A deep dive into tokenizers, the invisible first piece of your LLM stack. Learn how they control costs, context windows, and performance, and see how algorithms like BPE and SentencePiece can make or break your AI.| Blogs on Osman's Odyssey: Byte & Build
The straight-up, no-BS roadmap for learning LLMs in 2025. Skip the ML fluff and endless prerequisites. Get the actionable phases, projects, and resources to actually build, train, and ship large language models—from the ground up.| Blogs on Osman's Odyssey: Byte & Build
Stop worrying about AI replacing you—the real threat is losing technical depth. As cloud dependence grows and platforms get more opaque, local-first AI, open weights, and full-stack ownership are the only safety nets left. Why the future belongs to those who can build, debug, and own their tools from the metal up. Trust no corporate overlord.| Blogs on Osman's Odyssey: Byte & Build
I’m sharing my DeepResearch prompt builder template—the system that powers my research and learning workflows. Learn exactly how I turn chaos into clarity, force actionable insights, and get the most out of LLMs. See the template, my step-by-step process, and real-world tips for DeepResearchMaxxing in 2025.| Blogs on Osman's Odyssey: Byte & Build
Why 101 days of daily tech blogging? A raw, open challenge on AI, LLMs, self-hosted experiments, knowledge distillation, and why consistency beats talent. Expect rants, technical breakdowns, open hardware journeys, memes, and daily accountability from the basement AI server guy.| Blogs on Osman's Odyssey: Byte & Build
Corporate politics isn't just backroom deals—it's how influence, visibility, and relationships shape your career. In this candid guide, you'll learn to use titles, politics, and intentional networking to your advantage (without selling your soul). Real talk from people who've played—and won—the game at big tech and beyond.| Blogs on Osman's Odyssey: Byte & Build
How taking risks, building in public, and refusing to play a losing game flipped the script—and why sometimes you have to become undeniable before you ever become accepted.| Blogs on Osman's Odyssey: Byte & Build
Build a local, privacy-first screenshot organizer using LMStudio’s Python SDK and Gemma 3 multimodal models. Keep your data off the cloud, automate screenshot categorization, and leverage the power of open-source AI—all running from your own PC. Step-by-step guide, code walkthrough, and a practical use-case for local LLMs.| Blogs on Osman's Odyssey: Byte & Build
Forget the hype—here’s what actually happened when we asked “Is RAG dead?” This deep-dive explores why Retrieval-Augmented Generation (RAG) is still essential in real AI systems, what people get wrong, and how practitioners are shipping the next wave of AI with smarter retrieval, dynamic context, and hard-earned lessons from the field.| Blogs on Osman's Odyssey: Byte & Build
After years of building in the dark, I decided to play the game of distribution. Here’s why networks—and distribution—matter more than ever, and why I’m finally sharing my journey, experiments, and ideas in public.| Blogs on Osman's Odyssey: Byte & Build
Key takeaways from livestreaming DeepSeek R-1 671B (4-bit) on a 14x RTX 3090 basement AI server. See how KTransformers crushed llama.cpp in prompt eval speeds, compare setups, and get real-world insights into massive LLM inference with vLLM, ExLlamaV2, and more.| Blogs on Osman's Odyssey: Byte & Build
A curated collection of links, books, tools, and benchmarks discussed during the February 2nd, 2025 Twitter/X Audio Space on LLMs and AI. Includes practical resources, RAG leaderboards, toolkits, and perspectives on AI adoption in the Middle East and globally.| Osman's Odyssey: Byte & Build
Exploring the intricacies of Inference Engines and why llama.cpp should be avoided when running Multi-GPU setups. Learn about Tensor Parallelism, the role of vLLM in batch inference, and why ExLlamaV2 has been a game-changer for GPU-optimized AI serving since it introduced Tensor Parallelism.| Osman's Odyssey: Byte & Build
Ahmad M. Osman, a Software Engineer with a deep background in Machine Learning, Gen. AI/Large Language Models (LLMs). My journey began with coding at the age of 7, leading me through various roles at Mayo Clinic, Trimble, the Federal Home Loan Bank of Des Moines, and CloudInn. With academic credentials in Computer Science and Data Science from Luther College, I've dedicated my career to not just coding, but creating, solving, and innovating · Ahmad M. Osman Website · Software Engineer with ...| Osman's Odyssey: Byte & Build
Ahmad M. Osman, a Software Engineer with a deep background in Machine Learning, Gen. AI/Large Language Models (LLMs). My journey began with coding at the age of 7, leading me through various roles at Mayo Clinic, Trimble, the Federal Home Loan Bank of Des Moines, and CloudInn. With academic credentials in Computer Science and Data Science from Luther College, I've dedicated my career to not just coding, but creating, solving, and innovating · Ahmad M. Osman Website · Software Engineer with ...| Osman's Odyssey: Byte & Build
Explore how AI systems can become antifragile, harnessing uncertainty to thrive. Learn about the shift and acceleration from traditional software to AI agentic systems and their implications for the future.| Osman's Odyssey: Byte & Build
Embrace new ideas, trust your instincts, and go all in. 42 days to launch—let’s win this game! #GoAllIn| Blogs on Osman's Odyssey: Byte & Build
SWE Agentic Framework, MoEs, Quantizations & Mixed Precision, Batch Inference, LLM Architectures, vLLM, DeepSeek v2.5, Embedding Models, and Speculative Decoding: An LLM Brain Dump... I have been working on a multi-agent system that simulates a team of Software Engineers; this system assigns projects, creates teams and adds members to them based on areas of expertise and need, and asks team members to build features, assign story points, have pair programming sessions together, etc.| Blogs on Osman's Odyssey: Byte & Build
Dedicated LLM server powered by 8x RTX 3090 Graphic Cards, boasting a total of 192GB of VRAM.| Blogs on Osman's Odyssey: Byte & Build