Tomasulo Loop Example Loop: LD MULTD SD SUBI BNEZ F0 F4 F4 0 F0 0 R1 R1 R1 F2 R1 R1 #8 Loop Assume Multiply takes 4 clocks Assume first load takes 8 clocks (cache miss?), second load takes 4 clocks (hit) 1 Loop Example Cycle 0 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 No 0 Mult2 No Register result status Clock 0 R1 80 F0 Issue S1 Vj F2 Execution Write complete Result S2 Vk F4 Busy Address No No No Qi No No No Load1 Load2 Load3 Store1 Store2 Store3 RS for j RS for k Qj Qk Code: LD F0 MULTDF4 SD F4 SUBI R1 BNEZ R1 F6 F8 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Qi 2 Loop Example Cycle 1 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 No 0 Mult2 No Register result status Clock 1 R1 80 F0 Qi Issue 1 S1 Vj F2 Execution Write complete Result S2 Vk F4 Busy Address Yes 80 No No Qi No No No Load1 Load2 Load3 Store1 Store2 Store3 RS for j RS for k Qj Qk Code: LD F0 MULTDF4 SD F4 SUBI R1 BNEZ R1 F6 F8 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Load1 3 Loop Example Cycle 2 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 Yes MULTD 0 Mult2 No Register result status Clock 2 R1 80 F0 Qi Load1 Issue 1 2 S1 Vj Execution Write complete Result S2 Vk R(F2) F2 F4 Busy Address Yes 80 No No Qi No No No Load1 Load2 Load3 Store1 Store2 Store3 RS for j RS for k Qj Qk Code: LD F0 MULTDF4 SD F4 Load1 SUBI R1 BNEZ R1 F6 F8 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult1 4 Loop Example Cycle 3 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 Yes MULTD 0 Mult2 No Register result status Clock 3 R1 80 F0 Qi Load1 Issue 1 2 3 S1 Vj Execution Write complete Result S2 Vk R(F2) F2 F4 Busy Address Yes 80 No No Qi Yes 80 Mult1 No No Load1 Load2 Load3 Store1 Store2 Store3 RS for j RS for k Qj Qk Code: LD F0 MULTDF4 SD F4 Load1 SUBI R1 BNEZ R1 F6 F8 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult1 • Note: MULT1 has no registers names in RS 5 Loop Example Cycle 4 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 Yes MULTD 0 Mult2 No Register result status Clock 4 R1 72 F0 Qi Load1 Issue 1 2 3 S1 Vj Execution Write complete Result S2 Vk R(F2) F2 F4 Busy Address Yes 80 No No Qi Yes 80 Mult1 No No Load1 Load2 Load3 Store1 Store2 Store3 RS for j RS for k Qj Qk Code: LD F0 MULTDF4 SD F4 Load1 SUBI R1 BNEZ R1 F6 F8 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult1 6 Loop Example Cycle 5 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 Yes MULTD 0 Mult2 No Register result status Clock 5 R1 72 F0 Qi Load1 Issue 1 2 3 S1 Vj Execution Write complete Result S2 Vk R(F2) F2 F4 Busy Address Yes 80 No No Qi Yes 80 Mult1 No No Load1 Load2 Load3 Store1 Store2 Store3 RS for j RS for k Qj Qk Code: LD F0 MULTDF4 SD F4 Load1 SUBI R1 BNEZ R1 F6 F8 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult1 7 Loop Example Cycle 6 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 Yes MULTD 0 Mult2 No Register result status Clock 6 R1 72 F0 Qi Load2 Issue 1 2 3 6 S1 Vj Execution Write complete Result S2 Vk R(F2) F2 F4 Busy Address Yes 80 Yes 72 No Qi Yes 80 Mult1 No No Load1 Load2 Load3 Store1 Store2 Store3 RS for j RS for k Qj Qk Code: LD F0 MULTDF4 SD F4 Load1 SUBI R1 BNEZ R1 F6 F8 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult1 • Note: F0 never sees Load1 result 8 Loop Example Cycle 7 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 Yes MULTD 0 Mult2 Yes MULTD Register result status Clock 7 R1 72 F0 Qi Load2 Issue 1 2 3 6 7 S1 Vj F2 Execution Write complete Result Busy Address Yes 80 Yes 72 No Qi Yes 80 Mult1 No No R(F2) R(F2) Load1 Load2 Load3 Store1 Store2 Store3 RS for j RS for k Qj Qk Code: LD F0 MULTDF4 SD F4 Load1 SUBI R1 Load2 BNEZ R1 F4 F6 S2 Vk F8 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult2 • Note: MULT2 has no registers names in RS 9 Loop Example Cycle 8 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 Yes MULTD 0 Mult2 Yes MULTD Register result status Clock 8 R1 72 F0 Qi Load2 Issue 1 2 3 6 7 8 S1 Vj F2 Execution Write complete Result Busy Address Yes 80 Yes 72 No Qi Yes 80 Mult1 Yes 72 Mult2 No R(F2) R(F2) Load1 Load2 Load3 Store1 Store2 Store3 RS for j RS for k Qj Qk Code: LD F0 MULTDF4 SD F4 Load1 SUBI R1 Load2 BNEZ R1 F4 F6 S2 Vk F8 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult2 10 Loop Example Cycle 9 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 Yes MULTD 0 Mult2 Yes MULTD Register result status Clock 9 R1 64 F0 Qi Load2 Issue 1 2 3 6 7 8 S1 Vj F2 Execution Write complete Result 9 Load1 Load2 Load3 Store1 Store2 Store3 S2 RS for j RS for k Vk Qj Qk R(F2) R(F2) Load1 Load2 F4 F6 F8 Busy Address Yes 80 Yes 72 No Qi Yes 80 Mult1 Yes 72 Mult2 No Code: LD F0 MULTDF4 SD F4 SUBI R1 BNEZ R1 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult2 • Load1 completing; what is waiting for it? 11 Loop Example Cycle 10 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 4 Mult1 Yes MULTD 0 Mult2 Yes MULTD Register result status Clock 10 R1 64 F0 Qi Load2 Issue 1 2 3 6 7 8 S1 Vj Execution Write complete Result 9 10 Load1 Load2 Load3 10 Store1 Store2 Store3 S2 RS for j RS for k Vk Qj Qk M(80) R(F2) R(F2) Load2 F2 F6 F4 F8 Busy Address No Yes 72 No Qi Yes 80 Mult1 Yes 72 Mult2 No Code: LD F0 MULTDF4 SD F4 SUBI R1 BNEZ R1 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult2 • Load2 completing; what is waiting for it? 12 Loop Example Cycle 11 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 3 Mult1 Yes MULTD 4 Mult2 Yes MULTD Register result status Clock 11 R1 64 F0 Qi Load3 Issue 1 2 3 6 7 8 S1 Vj Execution Write complete Result 9 10 Load1 Load2 Load3 10 11 Store1 Store2 Store3 S2 RS for j RS for k Vk Qj Qk M(80) R(F2) M(72) R(F2) F2 F4 F6 F8 Busy Address No No Yes 64 Qi Yes 80 Mult1 Yes 72 Mult2 No Code: LD F0 MULTDF4 SD F4 SUBI R1 BNEZ R1 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult2 13 Loop Example Cycle 12 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 2 Mult1 Yes MULTD 3 Mult2 Yes MULTD Register result status Clock 12 R1 64 F0 Qi Load3 Issue 1 2 3 6 7 8 S1 Vj Execution Write complete Result 9 10 Load1 Load2 Load3 10 11 Store1 Store2 Store3 S2 RS for j RS for k Vk Qj Qk M(80) R(F2) M(72) R(F2) F2 F4 F6 F8 Busy Address No No Yes 64 Qi Yes 80 Mult1 Yes 72 Mult2 No Code: LD F0 MULTDF4 SD F4 SUBI R1 BNEZ R1 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult2 14 Loop Example Cycle 13 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 1 Mult1 Yes MULTD 2 Mult2 Yes MULTD Register result status Clock 13 R1 64 F0 Qi Load3 Issue 1 2 3 6 7 8 S1 Vj Execution Write complete Result 9 10 Load1 Load2 Load3 10 11 Store1 Store2 Store3 S2 RS for j RS for k Vk Qj Qk M(80) R(F2) M(72) R(F2) F2 F4 F6 F8 Busy Address No No Yes 64 Qi Yes 80 Mult1 Yes 72 Mult2 No Code: LD F0 MULTDF4 SD F4 SUBI R1 BNEZ R1 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult2 15 Loop Example Cycle 14 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 Yes MULTD 1 Mult2 Yes MULTD Register result status Clock 14 R1 64 F0 Qi Load3 Issue 1 2 3 6 7 8 S1 Vj Execution Write complete Result 9 10 Load1 14 Load2 Load3 10 11 Store1 Store2 Store3 S2 RS for j RS for k Vk Qj Qk M(80) R(F2) M(72) R(F2) F2 F4 F6 F8 Busy Address No No Yes 64 Qi Yes 80 Mult1 Yes 72 Mult2 No Code: LD F0 MULTDF4 SD F4 SUBI R1 BNEZ R1 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult2 • Mult1 completing; what is waiting for it? 16 Loop Example Cycle 15 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 No 0 Mult2 Yes MULTD Register result status Clock 15 R1 64 F0 Qi Load3 Issue 1 2 3 6 7 8 S1 Vj Execution Write complete Result 9 10 Load1 14 15 Load2 Load3 10 11 Store1 15 Store2 Store3 S2 RS for j RS for k Vk Qj Qk M(72) R(F2) F2 F4 F6 F8 Busy Address No No Yes 64 Qi Yes 80 M(80)*R(F2) Yes 72 Mult2 No Code: LD F0 MULTDF4 SD F4 SUBI R1 BNEZ R1 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult2 • Mult2 completing; what is waiting for it? 17 Loop Example Cycle 16 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 Yes MULTD 0 Mult2 No Register result status Clock 16 R1 64 F0 Qi Load3 Issue 1 2 3 6 7 8 S1 Vj F2 Execution Write complete Result 9 10 Load1 14 15 Load2 Load3 10 11 Store1 15 16 Store2 Store3 S2 RS for j RS for k Vk Qj Qk R(F2) Load3 F4 F6 F8 Busy Address No No Yes 64 Qi Yes 80 M(80)*R(F2) Yes 72 M(72)*R(72) No Code: LD F0 MULTDF4 SD F4 SUBI R1 BNEZ R1 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult1 18 Loop Example Cycle 17 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 Yes MULTD 0 Mult2 No Register result status Clock 17 R1 64 F0 Qi Load3 Issue 1 2 3 6 7 8 S1 Vj F2 Execution Write complete Result 9 10 Load1 14 15 Load2 Load3 10 11 Store1 15 16 Store2 Store3 S2 RS for j RS for k Vk Qj Qk R(F2) Load3 F4 F6 F8 Busy Address No No Yes 64 Qi Yes 80 M(80)*R(F2) Yes 72 M(72)*R(72) Yes 64 Mult1 Code: LD F0 MULTDF4 SD F4 SUBI R1 BNEZ R1 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult1 19 Loop Example Cycle 18 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 Yes MULTD 0 Mult2 No Register result status Clock 18 R1 56 F0 Qi Load3 Issue 1 2 3 6 7 8 S1 Vj Execution Write complete Result 9 10 14 15 18 10 11 15 16 S2 Vk R(F2) F2 F4 Busy Address No No Yes 64 Qi Yes 80 M(80)*R(F2) Yes 72 M(72)*R(72) Yes 64 Mult1 Load1 Load2 Load3 Store1 Store2 Store3 RS for j RS for k Qj Qk Code: LD F0 MULTDF4 SD F4 Load3 SUBI R1 BNEZ R1 F6 F8 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult1 20 Loop Example Cycle 19 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 Yes MULTD 0 Mult2 No Register result status Clock 19 R1 56 F0 Qi Load3 Issue 1 2 3 6 7 8 S1 Vj Execution Write complete Result 9 10 14 15 18 19 10 11 15 16 S2 Vk R(F2) F2 F4 Busy Address No No Yes 64 Qi No Yes 72 M(72)*R(72) Yes 64 Mult1 Load1 Load2 Load3 Store1 Store2 Store3 RS for j RS for k Qj Qk Code: LD F0 MULTDF4 SD F4 Load3 SUBI R1 BNEZ R1 F6 F8 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult1 21 Loop Example Cycle 20 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 Yes MULTD 0 Mult2 No Register result status Clock 20 R1 56 F0 Qi Load3 Issue 1 2 3 6 7 8 S1 Vj Execution Write complete Result 9 10 14 15 18 19 10 11 15 16 20 S2 RS for j Vk Qj R(F2) F2 F4 Busy Address No No Yes 64 Qi No Yes 72 M(72)*R(72) Yes 64 Mult1 Load1 Load2 Load3 Store1 Store2 Store3 RS for k Qk Code: LD F0 MULTDF4 SD F4 Load3 SUBI R1 BNEZ R1 F6 F8 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult1 22 Loop Example Cycle 21 Instruction status Instruction j k iteration LD F0 0 R1 1 MULTDF4 F0 F2 1 SD F4 0 R1 1 LD F0 0 R1 2 MULTDF4 F0 F2 2 SD F4 0 R1 2 Reservation Stations Time Name Busy Op 0 Add1 No 0 Add2 No 0 Add3 No 0 Mult1 Yes MULTD 0 Mult2 No Register result status Clock 21 R1 56 F0 Qi Load3 Issue 1 2 3 6 7 8 S1 Vj Execution Write complete Result 9 10 14 15 18 19 10 11 15 16 20 21 S2 RS for j Vk Qj R(F2) F2 F4 Busy Address No No Yes 64 Qi No No Yes 64 Mult1 Load1 Load2 Load3 Store1 Store2 Store3 RS for k Qk Code: LD F0 MULTDF4 SD F4 Load3 SUBI R1 BNEZ R1 F6 F8 0 R1 F0 F2 0 R1 R1 #8 Loop F10 F12 ... F30 Mult1 23
© Copyright 2025 Paperzz