Inference on Osman's Odyssey: Byte & Build

Inference on Osman's Odyssey: Byte & Build https://www.ahmadosman.com/tags/inference/ Recent content in Inference on Osman's Odyssey: Byte & Build Osman's Odyssey: Byte & Build https://www.ahmadosman.com/logo/byte-and-build.png https://www.ahmadosman.com/logo/byte-and-build.png Hugo -- 0.134.3 en-us Tue, 24 Sep 2024 01:33:45 -0500 Serving AI From The Basement — Part II https://www.ahmadosman.com/blog/serving-ai-from-the-basement-part-ii/ Wed, 18 Sep 2024 05:57:26 -0500 https://www.ahmadosman.com/blog/serving-ai-from-the-basement-part-ii/ SWE Agentic Framework, MoEs, Quantizations & Mixed Precision, Batch Inference, LLM Architectures, vLLM, DeepSeek v2.5, Embedding Models, and Speculative Decoding: An LLM Brain Dump... I have been working on a multi-agent system that simulates a team of Software Engineers; this system assigns projects, creates teams and adds members to them based on areas of expertise and need, and asks team members to build features, assign story points, have pair programming sessions together, etc.