nian2326076You've worked through some tough stuff! For interview prep, focus on how you can use your knowledge in real situations. Start by explaining tokenization and embeddings in simple terms. Use an analogy or quick example to show how they work. Next, talk about attention mechanisms. Try relating it to something familiar, like focusing on different parts of a conversation based on what's important. Finally, go over how generation works, and think about how you'd explain it to someone new. Keep it simple and relatable. Also, be ready to discuss any code you've written—what problems it solved and what you learned. This shows you can put theory into practice. Good luck!May 121 like
nitayneemanAuthorThanks. Glad the article helped :)May 131 like
DD_ZORO_69the transition from understanding basic neural nets to actually grasping how LLMs scale is a huge hurdle for most people. I really like how you handled the explanation of the "under the hood" mechanics without getting bogged down in too much jargon. It's rare to see someone bridge that gap between "surface level" and "impossible math" so well. I've been digging into late-stage training nuances lately and this was a solid refresher on the foundations.May 121 like
nitayneemanAuthorAppreciate that. The "surface level" vs "impossible math" gap is exactly what I was trying to thread. Most explanations either hand-wave the mechanics or assume you’re comfortable with backprop math before they’ll talk to you.
Curious what specifically you’re digging into on the late-stage training side - RLHF vs DPO tradeoffs? Constitutional AI?May 121 like
Curious what specifically you’re digging into on the late-stage training side - RLHF vs DPO tradeoffs? Constitutional AI? May 12 1 like