Blogs on Osman's Odyssey: Byte & Build

Blogs on Osman's Odyssey: Byte & Build https://www.ahmadosman.com/blog/ Recent content in Blogs on Osman's Odyssey: Byte & Build Osman's Odyssey: Byte & Build https://www.ahmadosman.com/logo/byte-and-build.png https://www.ahmadosman.com/logo/byte-and-build.png Hugo -- 0.134.3 en-us Tue, 01 Oct 2024 22:08:52 -0500 All In https://www.ahmadosman.com/blog/go-all-in/ Tue, 01 Oct 2024 22:08:52 -0500 https://www.ahmadosman.com/blog/go-all-in/ Embrace new ideas, trust your instincts, and go all in. 42 days to launch—let’s win this game! #GoAllIn Serving AI From The Basement — Part II https://www.ahmadosman.com/blog/serving-ai-from-the-basement-part-ii/ Wed, 18 Sep 2024 05:57:26 -0500 https://www.ahmadosman.com/blog/serving-ai-from-the-basement-part-ii/ SWE Agentic Framework, MoEs, Quantizations & Mixed Precision, Batch Inference, LLM Architectures, vLLM, DeepSeek v2.5, Embedding Models, and Speculative Decoding: An LLM Brain Dump... I have been working on a multi-agent system that simulates a team of Software Engineers; this system assigns projects, creates teams and adds members to them based on areas of expertise and need, and asks team members to build features, assign story points, have pair programming sessions together, etc. Serving AI From The Basement — Part I https://www.ahmadosman.com/blog/serving-ai-from-the-basement-part-i/ Fri, 06 Sep 2024 16:37:23 -0500 https://www.ahmadosman.com/blog/serving-ai-from-the-basement-part-i/ Dedicated LLM server powered by 8x RTX 3090 Graphic Cards, boasting a total of 192GB of VRAM.