Working Notes: a commonplace notebook for recording & exploring ideas.
Home. Site Map. Subscribe. More at expLog.
— Kunal
Stas Bekman
Religiously take notes
people
machine learning engineering
performance, incl. marketing vs reality
ROI
tflops
make sure to check against scarcity
theoretical numbers are unacheivable
shape needs to be appropriate
fn of the clock
fma = fused multiply add
main component for compute
can't boost the clock
measure it for your particular setup
buying gpus is very scary
have to manage all systems: need fast network and disk
gpu buffer, time to replace SLA, etc.
log everything while running
MFU
have to move data to hbm memory
benchmarking/duplex