<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Blogs on Osman&#39;s Odyssey: Byte &amp; Build</title>
    <link>https://www.ahmadosman.com/blog/</link>
    <description>Recent content in Blogs on Osman&#39;s Odyssey: Byte &amp; Build</description>
    <image>
      <title>Osman&#39;s Odyssey: Byte &amp; Build</title>
      <url>https://www.ahmadosman.com/logo/byte-and-build.png</url>
      <link>https://www.ahmadosman.com/logo/byte-and-build.png</link>
    </image>
    <generator>Hugo -- 0.145.0</generator>
    <language>en-us</language>
    <lastBuildDate>Thu, 26 Jun 2025 14:35:54 -0500</lastBuildDate>
    <atom:link href="https://www.ahmadosman.com/blog/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>First Came The Tokenizer</title>
      <link>https://www.ahmadosman.com/blog/first-came-the-tokenizer/</link>
      <pubDate>Thu, 26 Jun 2025 14:35:54 -0500</pubDate>
      <guid>https://www.ahmadosman.com/blog/first-came-the-tokenizer/</guid>
      <description>A deep dive into tokenizers, the invisible first piece of your LLM stack. Learn how they control costs, context windows, and performance, and see how algorithms like BPE and SentencePiece can make or break your AI.</description>
    </item>
    <item>
      <title>So You Want to Learn LLMs? Here&#39;s the Roadmap</title>
      <link>https://www.ahmadosman.com/blog/learn-llms-roadmap/</link>
      <pubDate>Mon, 23 Jun 2025 13:02:02 -0500</pubDate>
      <guid>https://www.ahmadosman.com/blog/learn-llms-roadmap/</guid>
      <description>The straight-up, no-BS roadmap for learning LLMs in 2025. Skip the ML fluff and endless prerequisites. Get the actionable phases, projects, and resources to actually build, train, and ship large language models—from the ground up.</description>
    </item>
    <item>
      <title>Software Engineers Aren&#39;t Getting Automated—Local AI Has To Win</title>
      <link>https://www.ahmadosman.com/blog/software-engineers-arent-getting-automated-local-ai-has-to-win/</link>
      <pubDate>Sat, 21 Jun 2025 07:08:00 -0500</pubDate>
      <guid>https://www.ahmadosman.com/blog/software-engineers-arent-getting-automated-local-ai-has-to-win/</guid>
      <description>Stop worrying about AI replacing you—the real threat is losing technical depth. As cloud dependence grows and platforms get more opaque, local-first AI, open weights, and full-stack ownership are the only safety nets left. Why the future belongs to those who can build, debug, and own their tools from the metal up. Trust no corporate overlord.</description>
    </item>
    <item>
      <title>My Ultimate DeepResearch Prompt Builder Template and How I Use It</title>
      <link>https://www.ahmadosman.com/blog/my-ultimate-deepresearch-prompt-builder/</link>
      <pubDate>Fri, 20 Jun 2025 03:06:06 -0500</pubDate>
      <guid>https://www.ahmadosman.com/blog/my-ultimate-deepresearch-prompt-builder/</guid>
      <description>I’m sharing my DeepResearch prompt builder template—the system that powers my research and learning workflows. Learn exactly how I turn chaos into clarity, force actionable insights, and get the most out of LLMs. See the template, my step-by-step process, and real-world tips for DeepResearchMaxxing in 2025.</description>
    </item>
    <item>
      <title>Just Like GPUs, We Need To Be Stress Tested</title>
      <link>https://www.ahmadosman.com/blog/just-like-gpus-we-need-to-be-stress-tested-101-days-of-blogging/</link>
      <pubDate>Wed, 18 Jun 2025 23:02:02 -0500</pubDate>
      <guid>https://www.ahmadosman.com/blog/just-like-gpus-we-need-to-be-stress-tested-101-days-of-blogging/</guid>
      <description>Why 101 days of daily tech blogging? A raw, open challenge on AI, LLMs, self-hosted experiments, knowledge distillation, and why consistency beats talent. Expect rants, technical breakdowns, open hardware journeys, memes, and daily accountability from the basement AI server guy.</description>
    </item>
    <item>
      <title>Mastering the Game: How Corporate Politics Shape Your Career</title>
      <link>https://www.ahmadosman.com/blog/mastering-the-corporate-game-space/</link>
      <pubDate>Fri, 02 May 2025 14:44:44 -0500</pubDate>
      <guid>https://www.ahmadosman.com/blog/mastering-the-corporate-game-space/</guid>
      <description>Corporate politics isn&#39;t just backroom deals—it&#39;s how influence, visibility, and relationships shape your career. In this candid guide, you&#39;ll learn to use titles, politics, and intentional networking to your advantage (without selling your soul). Real talk from people who&#39;ve played—and won—the game at big tech and beyond.</description>
    </item>
    <item>
      <title>Once Undesirable, Now Undeniable</title>
      <link>https://www.ahmadosman.com/blog/once-undesirable-now-undeniable/</link>
      <pubDate>Wed, 30 Apr 2025 15:46:46 -0500</pubDate>
      <guid>https://www.ahmadosman.com/blog/once-undesirable-now-undeniable/</guid>
      <description>How taking risks, building in public, and refusing to play a losing game flipped the script—and why sometimes you have to become undeniable before you ever become accepted.</description>
    </item>
    <item>
      <title>Build Your Private AI Screenshot Organizer with LMStudio</title>
      <link>https://www.ahmadosman.com/blog/build-your-local-privat-ai-screenshot-organizer-with-lmstudio/</link>
      <pubDate>Tue, 22 Apr 2025 08:56:56 -0500</pubDate>
      <guid>https://www.ahmadosman.com/blog/build-your-local-privat-ai-screenshot-organizer-with-lmstudio/</guid>
      <description>Build a local, privacy-first screenshot organizer using LMStudio’s Python SDK and Gemma 3 multimodal models. Keep your data off the cloud, automate screenshot categorization, and leverage the power of open-source AI—all running from your own PC. Step-by-step guide, code walkthrough, and a practical use-case for local LLMs.</description>
    </item>
    <item>
      <title>No, RAG Is NOT Dead!</title>
      <link>https://www.ahmadosman.com/blog/no-rag-is-not-dead-space/</link>
      <pubDate>Fri, 11 Apr 2025 15:31:31 -0500</pubDate>
      <guid>https://www.ahmadosman.com/blog/no-rag-is-not-dead-space/</guid>
      <description>Forget the hype—here’s what actually happened when we asked “Is RAG dead?” This deep-dive explores why Retrieval-Augmented Generation (RAG) is still essential in real AI systems, what people get wrong, and how practitioners are shipping the next wave of AI with smarter retrieval, dynamic context, and hard-earned lessons from the field.</description>
    </item>
    <item>
      <title>From the Shadows to the Feed: Why I’m Finally Playing the Game</title>
      <link>https://www.ahmadosman.com/blog/from-the-shadows-to-the-feed/</link>
      <pubDate>Tue, 25 Mar 2025 14:28:28 -0500</pubDate>
      <guid>https://www.ahmadosman.com/blog/from-the-shadows-to-the-feed/</guid>
      <description>After years of building in the dark, I decided to play the game of distribution. Here’s why networks—and distribution—matter more than ever, and why I’m finally sharing my journey, experiments, and ideas in public.</description>
    </item>
    <item>
      <title>Key Highlights From Running DeepSeek R-1 671B on 14x RTX 3090s &#43; Epyc 7713 &amp; 512GB RAM</title>
      <link>https://www.ahmadosman.com/blog/r1-ktransformers-inference-livestream/</link>
      <pubDate>Fri, 14 Feb 2025 02:56:56 -0600</pubDate>
      <guid>https://www.ahmadosman.com/blog/r1-ktransformers-inference-livestream/</guid>
      <description>Key takeaways from livestreaming DeepSeek R-1 671B (4-bit) on a 14x RTX 3090 basement AI server. See how KTransformers crushed llama.cpp in prompt eval speeds, compare setups, and get real-world insights into massive LLM inference with vLLM, ExLlamaV2, and more.</description>
    </item>
    <item>
      <title>Stop Wasting Your Multi-GPU Setup With llama.cpp</title>
      <link>https://www.ahmadosman.com/blog/do-not-use-llama-cpp-or-ollama-on-multi-gpus-setups-use-vllm-or-exllamav2/</link>
      <pubDate>Fri, 07 Feb 2025 05:06:36 -0600</pubDate>
      <guid>https://www.ahmadosman.com/blog/do-not-use-llama-cpp-or-ollama-on-multi-gpus-setups-use-vllm-or-exllamav2/</guid>
      <description>Exploring the intricacies of Inference Engines and why llama.cpp should be avoided when running Multi-GPU setups. Learn about Tensor Parallelism, the role of vLLM in batch inference, and why ExLlamaV2 has been a game-changer for GPU-optimized AI serving since it introduced Tensor Parallelism.</description>
    </item>
    <item>
      <title>Resources From X/Twitter Audio Space on LLMs &amp; AI - 2025-02-02</title>
      <link>https://www.ahmadosman.com/blog/deepseek-r1-space/</link>
      <pubDate>Sun, 02 Feb 2025 15:35:35 -0600</pubDate>
      <guid>https://www.ahmadosman.com/blog/deepseek-r1-space/</guid>
      <description>A curated collection of links, books, tools, and benchmarks discussed during the February 2nd, 2025 Twitter/X Audio Space on LLMs and AI. Includes practical resources, RAG leaderboards, toolkits, and perspectives on AI adoption in the Middle East and globally.</description>
    </item>
    <item>
      <title>Antifragile AI</title>
      <link>https://www.ahmadosman.com/blog/taleb-antifragile-ai-insights/</link>
      <pubDate>Tue, 03 Dec 2024 04:21:55 -0600</pubDate>
      <guid>https://www.ahmadosman.com/blog/taleb-antifragile-ai-insights/</guid>
      <description>Explore how AI systems can become antifragile, harnessing uncertainty to thrive. Learn about the shift and acceleration from traditional software to AI agentic systems and their implications for the future.</description>
    </item>
    <item>
      <title>All In</title>
      <link>https://www.ahmadosman.com/blog/go-all-in/</link>
      <pubDate>Tue, 01 Oct 2024 22:08:52 -0500</pubDate>
      <guid>https://www.ahmadosman.com/blog/go-all-in/</guid>
      <description>Embrace new ideas, trust your instincts, and go all in. 42 days to launch—let’s win this game! #GoAllIn</description>
    </item>
    <item>
      <title>Serving AI From The Basement — Part II</title>
      <link>https://www.ahmadosman.com/blog/serving-ai-from-the-basement-part-ii/</link>
      <pubDate>Wed, 18 Sep 2024 05:57:26 -0500</pubDate>
      <guid>https://www.ahmadosman.com/blog/serving-ai-from-the-basement-part-ii/</guid>
      <description>SWE Agentic Framework, MoEs, Quantizations &amp; Mixed Precision, Batch Inference, LLM Architectures, vLLM, DeepSeek v2.5, Embedding Models, and Speculative Decoding: An LLM Brain Dump... I have been working on a multi-agent system that simulates a team of Software Engineers; this system assigns projects, creates teams and adds members to them based on areas of expertise and need, and asks team members to build features, assign story points, have pair programming sessions together, etc.</description>
    </item>
    <item>
      <title>Serving AI From The Basement — Part I</title>
      <link>https://www.ahmadosman.com/blog/serving-ai-from-the-basement-part-i/</link>
      <pubDate>Fri, 06 Sep 2024 16:37:23 -0500</pubDate>
      <guid>https://www.ahmadosman.com/blog/serving-ai-from-the-basement-part-i/</guid>
      <description>Dedicated LLM server powered by 8x RTX 3090 Graphic Cards, boasting a total of 192GB of VRAM.</description>
    </item>
  </channel>
</rss>
