More of my notes on DeepSeek V4 - the really big news is the pricing: both DeepSeek-V4-Flash and DeepSeek-V4-Pro are the cheapest models in their categories while benchmarking close to the frontier models from other providers https://t.co/QQ2iFd4BDr https://t.co/0R0DFYYpSx
AI news, curated.
Content, created.
Delivered.
The only 5 AI stories that matter each day — across Claude, OpenAI, Google, xAI, Meta, DeepSeek and more. Transformed into tweets, LinkedIn posts, articles, and video scripts in your voice. Delivered at 8am, ready to publish.
$29–$299/mo · Cancel anytime · No posts, only news that moves the market
Today in AI
i think the most underated part of today's launch is not GPT 5.5 at all https://t.co/uwJtuJwgRx
These pelicans are kind of angry looking! Left is deepseek-v4-flash, right is deepseek-v4-pro - both generated using OpenRouter via my LLM tool https://t.co/UbUUd8Rhqr https://t.co/gZlyFk2yKy
Huge thanks to @radixark for bringing advanced RL capabilities to the SGLang ecosystem! A massive step forward for the open-source RL community. https://t.co/n47sxYNUCX
API is Available Today! 🔹 Keep base_url, just update model to deepseek-v4-pro or deepseek-v4-flash. 🔹 Supports OpenAI ChatCompletions & Anthropic APIs. 🔹 Both models support 1M context & dual modes (Thinking / Non-Thinking): https://t.co/ec3B0BDXZi ⚠️ Note: deepseek-…
Dedicated Optimizations for Agent Capabilities 🔹 DeepSeek-V4 is seamlessly integrated with leading AI agents like Claude Code, OpenClaw & OpenCode. 🔹 Already driving our in-house agentic coding at DeepSeek. The figure below showcases a sample PDF generated by DeepSeek-V4-…
DeepSeek-V4-Flash 🔹 Reasoning capabilities closely approach V4-Pro. 🔹 Performs on par with V4-Pro on simple Agent tasks. 🔹 Smaller parameter size, faster response times, and highly cost-effective API pricing. 3/n https://t.co/dAkP1f2aX0
DeepSeek-V4-Pro 🔹 Enhanced Agentic Capabilities: Open-source SOTA in Agentic Coding benchmarks. 🔹 Rich World Knowledge: Leads all current open models, trailing only Gemini-3.1-Pro. 🔹 World-Class Reasoning: Beats all current open models in Math/STEM/Coding, rivaling top https:…
🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models. 🔹 DeepSeek-V4-Flash: 284B total / 13B active para…
DeepSeek V4 by @deepseek_ai just dropped! SGLang is ready on Day 0 with a full stack of optimizations from architectures to low-level kernels. We also deliver a verified RL training pipeline in Miles (by @radixark) for V4 at launch: 1️⃣ Native "ShadowRadix" Design: DeepSeek V4's…
DeepSeek-V4 just dropped on Hugging Face https://t.co/hoGoBnfIMw https://t.co/pHb8MM8QzE
woke up to such brilliant vibes around GPT 5.5 and all the new features we released in Codex! the team really cooked with this model and the app quite literally burning through a lot of nights kudos to all the teams involved and enjoy GPT 5.5
auto-review now live in codex — using a guardian agent to evaluate the safety of proposed actions, reducing human approvals to only when they're really needed. https://t.co/B1IhkqBZaL
Introducing Grok Voice Think Fast 1.0 A state-of-the-art voice model built for complex, multi-step workflows with snappy responses and high accuracy. It takes the top spot on the Tau Voice Bench and handles real-world messiness like noise, accents, and interruptions better than
GPT-5.5 is here. Ben and I have had access for a bit. We have a lot of thoughts. 00:00:00 - Intro 00:01:34 - Vercel hack 00:05:38 - Kimi K2.6 00:14:29 - Cursor acquired? 00:37:24 - GPT Image 2 00:49:30 - GPT-5.5 01:22:02 - GPT-5.5 Pro https://t.co/x7bC5GreJD
GPT-5.5 is here. Ben and I have had access for a bit. We have a lot of thoughts. 00:01:34 - Vercel hack 00:05:38 - Kimi K2.6 00:14:29 - Cursor acquired? 00:37:24 - GPT Image 2 00:49:30 - GPT-5.5 01:22:02 - GPT-5.5 Pro https://t.co/zsXCfjH9MV
GPT-5.5 may not be in the official OpenAI API... but it's available via the apparently approved-of Codex API backdoor So I used that to make these pelicans (default and xhigh)! https://t.co/kIx2wnhpqD https://t.co/PP1JXhtEiM https://t.co/5hDFhjWMHy
GPT-5.5 is a new class of intelligence. This intelligence makes it intuitive to use; it completes challenging tasks with little micromanagement. Also very token efficient, and runs with low latency and at scale. A real step toward a new way of getting computer work done. https:…
GPT-5.5 (medium) is tied for SOTA on Artificial Analysis. GPT-5.5 (high) and GPT-5.5 (xhigh) are meaningfully ahead. xhigh is the first model to break the 50's https://t.co/vmDhaKy5eL
it's pretty insane how token efficient GPT 5.5 - SOTA performance all with significantly lower output tokens!! https://t.co/O9Fc8PQkkj https://t.co/v7zIxCeGo7
1. We believe in iterative deployment; although GPT-5.5 is already a smart model, we expect rapid improvements. Iterative deployment is a big part of our safety strategy; we believe the world will be best equipped to win at the team sport of AI resilience this way. 2. We believe
$5 per mil in, $30 per mil out. GPT-5.5 is smart. I've been using it for a bit. It's also weird, hard to wrangle, and too expensive IMO. Double the price of GPT-5.4. 20% more expensive than Opus 4.7. https://t.co/C8k68quwtw
Fun fact: Codex + GPT-5.5 even helped optimize the serving stack behind GPT-5.5, increasing token generation speeds by over 20% 🤯 https://t.co/v7zIxCeGo7
GPT-5.5 is rolling out today for Plus, Pro, Business and Enterprise users across ChatGPT and Codex. We’re also introducing GPT-5.5 Pro for Pro, Business, and Enterprise users in ChatGPT.
In ChatGPT, full-stack inference improvements enable a more capable model at faster speed. This efficiency is a game-changer for GPT-5.5 Pro, now a much more practical option for demanding tasks, and a step change in the level of difficulty and quality of work ChatGPT can take on
GPT-5.5 delivers this step up in intelligence without compromising on speed. GPT-5.5 matches GPT-5.4 per-token latency in real-world serving, while performing better across nearly every evaluation we measured. It also uses significantly fewer tokens to complete the same Codex h…
GPT-5.5 excels at writing and debugging code, researching online, analyzing data, creating documents and spreadsheets, operating software, and moving across tools until a task is finished. The gains are especially clear in agentic coding, computer use, knowledge work, and early
Introducing GPT-5.5 A new class of intelligence for real work and powering agents, built to understand complex goals, use tools, check its work, and carry more tasks through to completion. It marks a new way of getting computer work done. Now available in ChatGPT and Codex. htt…
My new cryptography puzzle is now live. Will pay $1,000 to the first person who DMs me the plaintext decryption of the first line. 2nd line is a hint. If you send me slop, AI hallucinations, or a decryption of the 2nd line, you are disqualified. https://t.co/p8qmYYRAIA https://…
🎉 Meet Hy3 preview from @TencentHunyuan , a 295B hybrid MoE (21B active) with 256K context. Day-0 support is now live in SGLang! → Strong reasoning: top results on FrontierScience-Olympiad, IMO-AnswerBench & GPQA Diamond → 256K context: big gains on CL-bench & LongBench …