Aggy AI News Aggregator

Your daily briefing on AI — curated from 51 sources

481 articles 51 sources Updated May 23, 2026 at 02:01 AM AEST 0 0/10 today

Headlines (77 articles)

0 of 77 read
  • We tried Google’s AI glasses and they’re almost there Google demoed prototype Android XR glasses that overlay Gemini-powered translation, navigation, and other information directly into your field of view.
  • Even If You Hate AI, You Will Use Google AI Search The search giant’s AI-crafted answers are so convenient, you’ll be sucked in—to the detriment of the web and the artists and thinkers behind it.
  • AI put "synthetic quotes" in his book. But this author wants to keep using it. Steven Rosenbaum explains how inaccurate quotes got into his book The Future of Truth.
  • The literary world isn’t prepared for AI You know it when you see it.
  • Google I/O showed how the path for AI-driven science is shifting Two years ago, an AI tool won Google DeepMind a Nobel. Researchers are now climbing toward a new goal.
  • Why would you disrespect your favorite artist with an AI remix? What superfan wants this?
  • The Gulf’s AI Boom Has an Undersea Cable Problem Hyperscalers are pushing the Gulf to rethink internet infrastructure as AI raises the stakes of cable disruptions.
  • Samsung’s memory chip employees negotiated $340,000 bonuses this year But the deal may still be a win for Samsung.
  • Roundtables: Can AI Learn to Understand the World? Watch a subscriber-only discussion exploring how AI might enter the physical world.
  • Spotify and Universal Music strike deal allowing fan-made AI covers and remixes Spotify is partnering with Universal Music Group to let Premium subscribers create AI-generated song covers and remixes, with participating artists receiving a share of the revenue.
  • Can OpenAI’s ‘Master of Disaster’ Fix AI’s Reputation Crisis? Global affairs chief Chris Lehane wants to tone down the debate over AI’s societal impacts—and get states to pass laws that won’t derail OpenAI’s meteoric rise.
  • Six search engines worth trying now that Google isn’t really Google anymore Google is about to look really different, and if you're not a fan of the AI overview feature, then you're not going to like what's coming.
  • Scaling creativity in the age of AI Building customer trust with on-brand content production has become a strategic imperative.
  • Meta Is in Crisis, Google Search’s Makeover, and AI Gets Booed by Graduates In this episode of “Uncanny Valley,” we unpack the mass layoffs at Meta, big announcements at Google I/O, and the latest backlash against AI.
  • Trump delays AI security executive order, saying language ‘could have been a blocker’ President Trump delayed signing an executive order that would have required pre-release government security reviews of AI models, citing dissatisfaction with the order's language.
  • All of the updates from Elon Musk and Sam Altman’s battle over OpenAI
  • Anthropic’s Code with Claude showed off coding’s future—whether you like it or not As tools like Claude Code get better, more and more developers are happy to hand off coding tasks to them. The way software gets built has changed for good.
  • In desperate times, graduates find hope in humiliating tech CEOs ‘They deserve everything they’re getting.’ (Boos.)
  • I Cloned Myself With Gemini’s AI Avatar Tool. The Result Was Unnervingly Me I used the Gemini app to generate lifelike videos featuring a digital clone of myself. Google sees this as the future of creation. I’m still creeped out.
  • Spotify adds AI-powered Q&A and briefing generation features to podcasts Spotify will let you generate daily or weekly briefs based on your prompts
  • LWiAI Podcast #245 - TML-Interaction, Claude For Legal, Sam Altman on Stand OpenAI launches new voice intelligence features in its API, Thinking Machines drops a new, highly responsive model designed for humanlike interactions in real time, and more!
  • Spotify takes on Google’s NotebookLM with its new app Spotify is releasing the new desktop app as a research preview in more than 20 markets.
  • This AI guitar pedal let me roll my own effects That new sound you’ve been looking for?
  • Spotify launches an ElevenLabs-powered audiobook creation tool The AI-powered audiobook generation won't bind authors to an exclusive contract, meaning they are free to publish their generated audiobooks anywhere.
  • Google just redesigned the search box for the first time in 25 years — here’s why it matters more than you think.
  • Spotify is launching AI-generated remixes UMG is first to strike a licensing deal.
  • Two AI-based science assistants succeed with drug-retargeting tasks Both tools generate hypotheses; one goes on to analyze some of the data.
  • SpaceX Listed Grok’s ‘Spicy’ Mode as a Risk in Its IPO Filing The rocket company has set aside more than $500 million for potential litigation losses, in part to account for complaints alleging that Grok created sexualized images.
  • The Path, founded by Tony Robbins and Calm alums, hopes to offer safer AI therapy The Path says its AI model has scored 95 on the mental health safety AI benchmark, Vera-MH. This compares to a top score of 65 for the consumer bots.
  • Spotify Studio’s AI agent creates a daily podcast just for you Music, podcasts, and a podcast that’s all about you.
  • SpaceX Is Spending $2.8 Billion to Buy Gas Turbines for Its AI Data Centers The investment comes as Elon Musk’s AI unit faces complaints about the carbon-emitting units and looks to become a big player in cloud computing.
  • Hark raises $700M Series A for its secretive ‘universal’ AI interface Hark expects to release its first multimodal models this summer, which it says will power a personal AI platform that works with existing products and services. The company expects to follow that with
  • AI video is moving beyond clip slop AI companies don’t just want Hollywood using AI for video, but for everything.
  • Google is pitching an AI agent ecosystem to consumers who may not buy it One of the most promising introductions at Google’s I/O developer conference on Tuesday was a new way for consumers to use the web: AI agents. Unfortunately, it was also the most confusing.
  • Musk v. Altman: Much ado about nothing Full of sound and fury, signifying nothing.
  • Roundtables: Inside the Musk v. Altman Trial Watch a subscriber-only discussion going behind the scenes of the trial and the implications for the AI race.
  • I Gave My OpenClaw Agent a Physical Body The coding skills of AI models are about to make it much easier to build and deploy robots.
  • Import AI 457: AI stuxnet; cursed Muon optimizer; and positive alignment Welcome to Import AI, a newsletter about AI research.
  • Here’s why Elon Musk lost his suit against OpenAI After three weeks of dueling testimony, the jury decided Musk had sued the AI giant too late.
  • Literary Prizewinners Are Facing AI Allegations. It Feels Like the New Normal Three of five regional winners of the prestigious Commonwealth Short Story Prize are suspected of relying on chatbots. They’re certainly not alone.
  • Everything Announced at Google I/O 2026: Gemini, Search, Smart Glasses Google is sprucing up its Gemini models, revamping search, and enabling AI agents in everything. There are also some spiffy new smart glasses coming this fall.
  • What to expect from Google this week The company has fallen behind its closest competitors where it matters most. Can it catch up?
  • Inside Anduril and Meta’s quest to make smart glasses for warfare It’s been a year since the duo entered the US Army’s troubled augmented-reality contest. Here’s what it looks like so far.
  • Send the arXiv AI-generated slop, get a yearlong vacation from submissions One of the site's moderators described the new policy on social media.
  • Claude Code's product lead talks usage limits, transparency, and the "lean harness" We have no grand plan," says Anthropic's Cat Wu—but that's by design.
  • Musk v. Altman week 3: Elon Musk and Sam Altman traded blows over each other’s credibility. Now the jury will pick a side. The trial spilled plenty of dirt—and raised more questions than answers about how the AI giant should be governed.
  • Your doctor’s AI notetaker may be making things up, Ontario audit finds Made-up therapy referrals, incorrect prescriptions among the common mistakes.
  • How Chinese short dramas became AI content machines The viral short dramas are increasingly being created entirely with AI, with hundreds of new shows spun up each day.
  • AI invades Princeton, where 30% of students cheat—but peers won't snitch Old "honor code" systems are under strain.
  • Rivian adds a new onboard AI assistant to its latest software update The Rivian Assistant is available for both Gen1 and Gen2 hardware.
  • Import AI 456: RSI and economic growth; radical optionality for AI regulation; and a neural computer What laws does superintelligence demand?
  • Amazon employees are "tokenmaxxing" due to pressure to use AI tools Workers are using an internal AI tool to automate non-essential tasks.
  • Last Week in AI #340 - OpenAI vs Musk + Microsoft, DeepSeek v4, Vision Banana First week of Musk v. Altman, OpenAI ends Microsoft legal peril over its $50B Amazon deal, DeepSeek previews new AI model that ‘closes the gap’ with frontier models, and more!
  • Anthropic raises Claude Code usage limits, credits new deal with SpaceX Deal follows others with Microsoft, Amazon, and more.
  • Google DeepMind partners with EVE Online for AI model testing Move comes as CCP Games spends $120M to go independent, rebrands as Fenris Creations.
  • Import AI 455: AI systems are about to start building themselves. The first step towards recursive self improvement
  • LWiAI Podcast #243 - GPT 5.5, DeepSeek V4, AI safety sabotage Our 243rd episode with a summary and discussion of last week’s big AI news!
  • LWiAI Podcast #242 - ChatGPT Images 2.0, Qwen 3.6 Max, Kimi-K2.6 ChatGPT’s new Images 2.0 model is surprisingly good at generating text , Alibaba Drops Qwen 3.6 Max Preview , SpaceX is working with Cursor
  • Import AI 454: Automating alignment research; safety study of a Chinese model; HiFloat4 At what point do the financial markets price in the singularity?
  • Import AI 453: Breaking AI agents; MirrorCode; and ten views on gradual disempowerment Was fire equivalent to a singularity for people at the time?
  • Import AI 452: Scaling laws for cyberwar; rising tides of AI automation; and a puzzle over gDP forecasting How much could AI revolutionize the economy?
  • LWiAI Podcast #238 - GPT 5.4 mini, OpenAI Pivot, Mamba 3, Attention Residuals OpenAI ships GPT-5.4 mini and nano, faster and more capable but up to 4x pricier, DLSS 5 looks like a real-time generative AI filter for video games | The Verge, and more!
  • Import AI 451: Political superintelligence; Google's society of minds, and a robot drummer Are there any genies that can be put back in the bottle?
  • Last Week in AI #339 - DLSS 5, OpenAI Superapp, MiniMax M2.7 DLSS 5 looks like a real-time generative AI filter for video games, OpenAI Reportedly Pivoting to a Focus on Business and Productivity Only, and more!
  • Import AI 450: China's electronic warfare model; traumatized LLMs; and a scaling law for cyberattacks How will timeless minds value time?
  • LWiAI Podcast #237 - Nemotron 3 Super, xAI reborn, Anthropic Lawsuit, Research! Nemotron 3 Super: An Open Hybrid Mamba-Transformer MoE for Agentic Reasoning, Another XAI Cofounder Has Left, Anthropic Sues Department of Defense
  • ImportAI 449: LLMs training other LLMs; 72B distributed training run; computer vision is harder than generative text Will AI cause a political interregnum
  • Last Week in AI #338 - Anthropic sues Trump, xAI starting over, Iran AI Fakes Anthropic sues Trump administration in AI dispute with Pentagon, ‘Not built right the first time’ — Musk’s xAI is starting over again, again, Cascade of A.I. Fakes About War With Iran Causes Chaos Onl
  • LWiAI Podcast #236 - GPT 5.4, Gemini 3.1 Flash Lite, Supply Chain Risk OpenAI launches GPT-5.4 with Pro and Thinking versions, Google releases Gemini 3.1 Flash Lite at 1/8th the cost of Pro, Where things stand with the Department of War Anthropic
  • Import AI 448: AI R&D; Bytedance's CUDA-writing agent; on-device satellite AI If Ukraine is the first major drone war, when will there be the first major AI war?
  • Last Week in AI #337 - Anthropic Risk, QuitGPT, ChatGPT 5.4 Anthropic officially told by DOD that it’s a supply chain risk, ‘cancel ChatGPT’ trend is growing after OpenAI signs a deal with the US military, and more!
  • Railway secures $100 million to challenge AWS with AI-native cloud infrastructure
  • Claude Code costs up to $200 a month. Goose does the same thing for free.
  • Listen Labs raises $69M after viral billboard hiring stunt to scale AI customer interviews
  • Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI
  • Anthropic launches Cowork, a Claude Desktop agent that works in your files — no coding required
  • Nous Research's NousCoder-14B is an open-source coding model landing right in the Claude Code moment

Research & Blogs (189 articles)

0 of 189 read
  • Specialization Beats Scale: A Strategic Variable Most AI Procurement Decisions Overlook A Blog post by Dharma-AI on Hugging Face
  • OpenAI named a Leader in enterprise coding agents by Gartner
  • Amazon Nova Act is now HIPAA eligible In this post, you will learn what Nova Act offers, how HIPAA eligibility applies to agentic AI, and how to get started.
  • Datasette Agent We just announced the first release of Datasette Agent, a new extensible AI assistant for Datasette. I’ve been working on my LLM Python library for just over three years now, …
  • We’re launching the Google DeepMind Accelerator program in Asia Pacific to tackle environmental risks The Asia-Pacific region is a global engine for economic growth, but it's also highly vulnerable to climate change. While green technologies are gaining momentum, a recen…
  • cq exchange: Agents without Borders cq exchange gives agents a shared place to store and retrieve experience-driven knowledge through private namespaces and a public commons.
  • Intelligent radiology workflow optimization with AI agents Many healthcare organizations report that traditional worklist systems rely on rigid rules that ignore critical context, radiologist specialization, current workload, fatigue levels, and case complexi
  • AdventHealth advances whole-person care with OpenAI
  • Integrating AWS API MCP Server with Amazon Quick using Amazon Bedrock AgentCore Runtime This post shows you how to use Amazon Bedrock AgentCore Runtime with Model Context Protocol (MCP) support to connect Amazon Quick with AWS services through the AWS API MCP Server, creating a conversat
  • We’re announcing new community investments in Missouri. We’re helping build the state’s next-generation workforce and investing in energy programs.
  • Building multi-tenant agents with Amazon Bedrock AgentCore This post explores design considerations for architecting multi-tenant agentic applications and the framework needed to address SaaS architecture challenges with Amazon Bedrock AgentCore.
  • Quoting SpaceX S-1 We have the ability to use compute resources to support our proprietary AI applications (such as Grok 5, which is currently being trained at COLOSSUS II), while also providing access …
  • 100 things we announced at I/O 2026 This year at Google I/O 2026, we announced Gemini Omni, Google Antigravity, Universal Cart and so much more. Here are the highlights.
  • Break the context window barrier with Amazon Bedrock AgentCore In this post, you will learn how to implement Recursive Language Models (RLM) using Amazon Bedrock AgentCore Code Interpreter and the Strands Agents SDK. By the end, you will know how to process docum
  • How fast is 10 tokens per second really? Neat little HTML app by Mike Veerman (source code here) which simulates LLM token output speeds from 5/second to 800/second. Useful if you see a model advertised as "30 tokens/second" …
  • Build AI agents for business intelligence with Amazon Bedrock AgentCore In this post, we show you how OPLOG developed three AI agents using the Strands Agents SDK, deployed them to Amazon Bedrock AgentCore, and integrated Amazon Bedrock with Anthropic’s Claude Sonnet and
  • A new experiment brings better group meetings to Google Beam See and hear your colleagues in true-to-life size and sound, making hybrid meetings feel more inclusive and connected.
  • Build an AI-powered recruitment assistant using Amazon Bedrock In this post, we demonstrate how to build an AI-powered recruitment assistant using Amazon Bedrock that brings efficiencies to candidate evaluation, generates personalized interview questions, and pro
  • Google I/O, Gemini Spark, Antigravity It's hard to find much to write about Google I/O this year because I have a policy of not writing about anything that I can't try out myself, and a …
  • Build AI-powered dashboard automation agents with NLP on Amazon Bedrock AgentCore This solution combines the power of Amazon Bedrock AgentCore, Strands Agents, and Amazon Quick transforms to deliver a secure, scalable, and intelligent system for building and operating AI agents whi
  • OlmoEarth v1.1: A more efficient family of Earth observation models A Blog post by Ai2 on Hugging Face
  • The next phase of OpenAI’s Education for Countries
  • The Interface Is No Longer the Product The future of AI may not be agents using today’s apps. It may be apps rebuilt around structured representations agents can inspect, modify, and validate directly. The deck, doc, or dashboard becomes t
  • An OpenAI model has disproved a central conjecture in discrete geometry
  • Benchmarking inference at scale: coding agents Real-world inference benchmarks for coding agents: 31% more TPS than TensorRT-LLM, 2× better TTFT at saturation, and 76% lower cost than Claude Opus 4.6.
  • May 19, 2026 Announcements Widening the conversation on frontier AI Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
  • I/O 2026 At Google I/O 2026, we shared how we’re making AI more helpful for everyone. See everything we announced.
  • How Ramp engineers accelerate code review with Codex
  • Gemini 3.5 Flash: more expensive, but Google plan to use it for everything Today at Google I/O, Google released Gemini 3.5 Flash. This one skipped the -preview modifier and went straight to general availability, and Google appear to be using it for a …
  • May 19, 2026 Announcements KPMG integrates Claude across its core business and workforce of more than 276,000 in strateg Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
  • Announcing OpenAI-compatible API support for Amazon SageMaker AI endpoints Today, Amazon SageMaker AI introduces OpenAI-compatible API support for real-time inference endpoints. If you use the OpenAI SDK, LangChain, or Strands Agents, you can now invoke models on SageMaker A
  • How AI Mode is changing the way people search in the U.S. One year after launch, see how AI Mode’s users are shifting from keywords to natural language queries.
  • Fast-tracking genetic leads to reverse cellular aging Accelerating cellular aging research
  • Introducing the Ettin Reranker Family We’re on a journey to advance and democratize artificial intelligence through open source and open science.
  • Introducing OpenAI for Singapore
  • New ways to create and get things done in Google Workspace Announcing new voice capabilities in Gmail, Docs and Keep, a new design tool called Google Pics and updates to AI Inbox.
  • Multimodal evaluators: MLLM-as-a-judge for image-to-text tasks in Strands Evals If you’re building visual shopping, image or document understanding, or chart analysis, you need a way to verify whether your model’s response is actually grounded in the source image. A text-only eva
  • I/O 2026: Welcome to the agentic Gemini era The latest from Google I/O: See how we’re helping you get more done with Gemini.
  • Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation A Blog post by NVIDIA on Hugging Face
  • Advancing content provenance for a safer, more transparent AI ecosystem
  • Gemini 3.5: frontier intelligence with action At Google I/O we released Gemini 3.5, our latest series of models combining frontier intelligence with action.
  • The last six months in LLMs in five minutes I put together these annotated slides from my five minute lightning talk at PyCon US 2026, using the latest iteration of my annotated presentation tool. # I presented this lightning …
  • PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend A Blog post by PaddlePaddle on Hugging Face
  • May 18, 2026 Announcements Anthropic acquires Stainless Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
  • A new era for AI Search We shared the next step in our journey to bring together the best of a search engine with the best of AI.
  • Simulate real-world places with Project Genie and Street View We’re connecting Project Genie with nearly 20 years of Google Street View imagery so you can create new worlds anchored in reality.
  • The Open Agent Leaderboard A Blog post by IBM Research on Hugging Face
  • Everything new in our Google AI subscriptions, fresh from I/O 2026 Introducing a $100 AI Ultra plan — plus, new features and benefits for Google AI Plus, Pro and Ultra subscribers.
  • Introducing Gemini Omni Introducing Gemini Omni, which allows you to create anything from any input and edit naturally using conversational language.
  • Introducing Google Antigravity 2.0 Google Antigravity - Build the new way
  • Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention From Gemma 4 to DeepSeek V4, How New Open-Weight LLMs Are Reducing Long-Context Costs
  • OpenAI and Dell partner to bring Codex to hybrid and on-premise enterprise environments
  • Gemini for Science: AI experiments and tools for a new era of discovery Gemini for Science is a new collection of science tools and experiments to expand the scale and precision of scientific exploration.
  • GDS weighs in on the NHS's decision to retreat from Open Source Terence Eden continues his coverage of the NHS' poorly considered decision to close down access to their open source repositories in response to vulnerabilities reported to them as part of …
  • Making it easier to understand how content was created and edited We're expanding our tools to help you understand how content was created and edited across the web.
  • Together AI and Pearl Research Labs Team Up to Reduce the Cost of AI Inference Together AI partners with Pearl Research Labs to launch a discounted Pearl-powered inference endpoint for Gemma-4-31B-it-pearl, using Proof of Useful Work to turn AI workloads into crypto emissions.
  • Strengthening Singapore’s AI Future: A New National Partnership Google DeepMind and Singapore partner to apply frontier AI to address challenges across health, education, sustainability and more through the National Partnerships for AI initiative.
  • Finding the molecular switches behind new infectious diseases Fast-tracking infectious disease research
  • Reel Friends: Building Social Discovery that Scales to Billions On its face the new Friend Bubbles feature looks simple enough. It highlights Reels your friends have watched and reacted to. But sometimes the features that seem the most straightforward require t…
  • Opening new paths in aging research Untangling the mysteries of aging
  • OpenAI and Malta partner to bring ChatGPT Plus to all citizens
  • Violin: An open-source video translation skill that breaks language barriers Violin is an open-source AI video translation tool that combines speech recognition, LLM translation, and text-to-speech to make video content accessible across languages.
  • May 14, 2026 Announcements PwC is deploying Claude to build technology, execute deals, and reinvent enterprise functions Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
  • QR code generator Generate scannable QR codes from URLs, text, or WiFi network details with customizable styling options. The tool supports multiple encoding modes, including WiFi networks with security settings, and o
  • Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality A Blog post by IBM Granite on Hugging Face
  • May 14, 2026 Announcements Anthropic forms $200 million partnership with the Gates Foundation Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
  • Not so locked in any more This Mitchell Hashimoto quote about Bun migrating from Zig to Rust reminded me of a similar conversation I had at a conference last week. I was talking to someone who …
  • How sales teams use Codex
  • Quoting Mitchell Hashimoto [...] On the interesting side is how fungible programming languages are nowadays. Programming languages used to be LOCK IN, and they're increasingly not so. You think the Bun rewrite in …
  • First Line of Defense for cq (Stack Overflow for Agents) cq helps coding agents share resolution paths and learn from past failures. We partnered with Lauren Mushro to bring VIBE✓ into cq and help review knowledge units before they enter shared memory.
  • Unlocking asynchronicity in continuous batching We’re on a journey to advance and democratize artificial intelligence through open source and open science.
  • May 13, 2026 Announcements Introducing Claude for Small Business
  • Introducing voice finder — a new tool to quickly find the right voice for your app from over 600+ voices Voice finder helps developers search, match, filter, and audition 600+ voices across Together AI TTS models using natural-language prompts or uploaded audio samples.
  • BalCapRL: A Balanced Framework for RL-Based MLLM Image Captioning Image captioning is one of the most fundamental tasks in computer vision. Owing to its open-ended nature, it has received significant…
  • Serving DeepSeek-V4: why million-token context is an inference systems problem DeepSeek-V4 makes million-token context a serving-systems problem. Together AI explores the inference work behind V4 on NVIDIA HGX B200, including compressed KV layouts, prefix caching, kernel maturit
  • Building Blocks for Foundation Model Training and Inference on AWS A Blog post by Amazon on Hugging Face
  • Adaptive Parallel Reasoning: The Next Paradigm in Efficient Inference Scaling The BAIR Blog
  • Large-Scale High-Quality 3D Gaussian Head Reconstruction from Multi-View Captures We propose HeadsUp, a scalable feed-forward method for reconstructing high-quality 3D Gaussian heads from large-scale multi-camera setups…
  • Apple Workshop on Privacy-Preserving Machine Learning & AI 2026 At Apple, we believe privacy is a fundamental human right. As AI capabilities increase and become more integrated into people’s daily…
  • Velox: Learning Representations of 4D Geometry and Appearance We introduce a framework for learning latent representations of 4D objects which are descriptive, faithfully capturing object geometry and…
  • RVPO: Risk-Sensitive Alignment via Variance Regularization Current critic-less RLHF methods aggregate multi-objective rewards via an arithmetic mean, leaving them vulnerable to constraint neglect:…
  • Octonous Open Beta: What We've Learned and Where We're Going The Octonous open beta is live. Learn what we discovered during closed beta, the workflow patterns users kept returning to, and the biggest improvements shipped since launch.
  • Deploy and inference any model from HuggingFace Learn how to deploy any Hugging Face model in one session using Goose and Together's Dedicated Container Inference. Skip the setup complexity — one prompt gets your model running in a production-grade
  • What Matters in Practical Learned Image Compression One of the major differentiators unlocked by learned codecs relative to their hard-coded traditional counterparts is their ability to be…
  • Text-Conditional JEPA for Learning Semantically Rich Visual Representations Image-based Joint-Embedding Predictive Architecture (I-JEPA) offers a promising approach to visual self-supervised learning through masked…
  • Sovereign AI: Control, Choice, and Why It Goes Beyond Geopolitics Sovereign AI shows up across nations, companies, communities, and individuals. This piece, based on a conversation with John Dickerson, CEO at Mozilla.ai, looks at control over AI systems, avoiding si
  • May 6, 2026 Announcements Higher usage limits for Claude and a compute deal with SpaceX We’ve raised Claude's usage limits and agreed a new compute partnership with SpaceX that will substantially increase our capacity in the near term.
  • vLLM V0 to V1: Correctness Before Corrections in RL A Blog post by ServiceNow-AI on Hugging Face
  • SpecMD: A Comprehensive Study on Speculative Expert Prefetching Mixture-of-Experts (MoE) models enable sparse expert activation, meaning that only a subset of the model’s parameters is used during each…
  • From Where Things Are to What They’re For: Benchmarking Spatial–Functional Intelligence for Multimodal LLMs True spatial intelligence for multimodal agents transcends low-level geometric perception, evolving from knowing where things are to…
  • Normalizing Flows with Iterative Denoising Normalizing Flows (NFs) are a classical family of likelihood-based methods that have received revived attention. Recent efforts such as…
  • Foundational research powering efficient inference at scale As AI moves from research to production, the challenge for AI-native teams shifts from building models to running them — efficiently, reliably, and at scale.
  • Sequoia Ascent 2026 summary Summary of my talk at Sequoia Ascent
  • From 732 bytes to nowhere: shutting down Copy Fail in production
  • Announcing Together AI and Adaption Partnership Together AI and Adaption partner to bring Together Fine-Tuning natively into Adaptive Data, helping teams optimize datasets, run fine-tuning, evaluate results, and deploy stronger open models.
  • DeepSeek-V4 Pro now available on Together AI DeepSeek-V4 Pro is now available on Together AI with 512K context, controllable reasoning modes, and cached-input pricing for long-context reasoning workloads like code agents, document intelligence,
  • Modernizing the Facebook Groups Search to Unlock the Power of Community Knowledge We’ve fundamentally transformed Facebook Groups Search to help people more reliably discover, sort through, and validate community content that’s most relevant to them. We’ve adopted a new hybrid r…
  • Gradient-based Planning for World Models at Longer Horizons The BAIR Blog
  • My Workflow for Understanding LLM Architectures A learning-oriented workflow for understanding new open-weight model releases
  • Capacity Efficiency at Meta: How Unified AI Agents Optimize Performance at Hyperscale We’re sharing insights into Meta’s Capacity Efficiency Program, where we’ve built an AI agent platform that helps automate finding and fixing performance issues throughout our inf…
  • Introducing Claude Design by Anthropic Labs
  • Introducing Claude Opus 4.7
  • Locally AI joins LM Studio Adrien and the Locally AI apps are joining the LM Studio family to double down on Apple platforms
  • Encoderfile’s New Format: Why a “Dull” Design Wins Encoder models power most NLP in production, but deploying them still means dragging along Python runtimes and dependencies. Encoderfile introduces a single executable with an appended payload and a f
  • How Meta Used AI to Map Tribal Knowledge in Large-Scale Data Pipelines AI coding assistants are powerful but only as good as their understanding of your codebase. When we pointed AI agents at one of Meta’s large-scale data processing pipelines – spanning four re…
  • Components of A Coding Agent How coding agents use tools, memory, and repo context to make LLMs work better in practice
  • KernelEvolve: How Meta’s Ranking Engineer Agent Optimizes AI Infrastructure This is the second post in the Ranking Engineer Agent blog series exploring the autonomous AI capabilities accelerating Meta’s Ads Ranking innovation. The previous post introduced Ranking Eng…
  • Meta Adaptive Ranking Model: Bending the Inference Scaling Curve to Serve LLM-Scale Models for Ads Meta continues to lead the industry in utilizing groundbreaking AI Recommendation Systems (RecSys) to deliver better experiences for people, and better results for advertisers. To reach the next fr…
  • Ollama is now powered by MLX on Apple Silicon in preview Today, we're previewing the fastest way to run Ollama on Apple silicon, powered by MLX, Apple's machine learning framework.
  • AI for American-Produced Cement and Concrete Meta is continuing its long-term roadmap to help the construction industry leverage AI to produce high-quality and more sustainable concrete mixes, as well as those exclusively produced in the Unit…
  • The Hardest Part of Running a Small Business in the Trades Running a small trade business includes a steady flow of admin work: quotes, scheduling, invoices, payments, and more. This post looks at how that workload builds up and introduces Clawbolt, a focused
  • Hardening Your LLM Dependency Supply Chain When source code and distributed packages don’t match, risks increase. This breakdown of the LiteLLM incident shares what to watch for and how to reduce exposure.
  • A Visual Guide to Attention Variants in Modern LLMs From MHA and GQA to MLA, sparse attention, and hybrid architectures
  • cq: Stack Overflow for Agents cq explores a Stack Overflow for agents, a shared commons where agents can query past learnings, contribute new knowledge, and avoid repeating the same mistakes in isolation.
  • Run open models on NVIDIA DGX Station GB300 LM Studio now supports NVIDIA DGX Station - GB300 Blackwell in a form factor you can run outside of the data center
  • llamafile Reloaded: What’s New in v0.10.0 llamafile 0.10.0 unifies portability and modern model features. Bundle weights, run multimodal models, and access tool calling and Anthropic Messages API support, all from a single executable.
  • Friend Bubbles: Enhancing Social Discovery on Facebook Reels Friend bubbles in Facebook Reels highlight Reels your friends have liked or reacted to, helping you discover new content and making it easier to connect over shared interests. This article explains…
  • Ranking Engineer Agent (REA): The Autonomous AI Agent Accelerating Meta’s Ads Ranking Innovation Meta’s Ranking Engineer Agent (REA) autonomously executes key steps across the end-to-end machine learning (ML) lifecycle for ads ranking models. This post covers REA’s ML experimentation capabilit…
  • Identifying Interactions at Scale for LLMs The BAIR Blog
  • A Dream of Spring for Open-Weight LLMs: 10 Architectures from Jan-Feb 2026 A Round Up And Comparison of 10 Open-Weight LLM Releases in Spring 2026
  • The simplest and fastest way to setup OpenClaw Setup OpenClaw in under two minutes with a single Ollama command.
  • Subagents and web search in Claude Code Ollama now supports subagents and web search in Claude Code.
  • Claude is a space to think We’ve made a choice: Claude will remain ad-free. We explain why advertising incentives are incompatible with a genuinely helpful AI assistant, and how we plan to expand access without compromising use
  • OpenClaw OpenClaw is a personal AI assistant that connects your messaging apps to local AI coding agents, all running on your own device.
  • Use your LM Studio Models in Claude Code Run Claude Code with any local model using LM Studio's Anthropic-compatible API
  • Introducing LM Studio 0.4.0 Server deployment, parallel requests with continuous batching, new REST API endpoint, and refreshed application UI
  • Categories of Inference-Time Scaling for Improved LLM Reasoning And an Overview of Recent Inference-Scaling Papers
  • ollama launch ollama launch is a new command which sets up and runs coding tools like Claude Code, OpenCode, and Codex with local or cloud models. No environment variables or config files needed.
  • Image generation (experimental) Generate images locally with Ollama on macOS. Windows and Linux support coming soon.
  • Claude Code with Anthropic API compatibility Ollama is now compatible with the Anthropic Messages API, making it possible to use tools like Claude Code with open models.
  • Open Responses with local models via LM Studio Update to LM Studio 0.3.39 for Open Responses support
  • OpenAI Codex with Ollama Open models can be used with OpenAI's Codex CLI through Ollama. Codex can read, modify, and execute code in your working directory using models such as gpt-oss:20b, gpt-oss:120b, or other open-weight
  • Information-Driven Design of Imaging Systems The BAIR Blog
  • NVIDIA Rubin Platform, Open Models, Autonomous Driving: NVIDIA Presents Blueprint for the Future at CES NVIDIA founder and CEO Jensen Huang opened CES in Las Vegas with Rubin — NVIDIA’s first extreme-codesigned AI platform — plus open models for healthcare, robotics and autonomy, and a Mercedes-Benz CLA
  • LM Studio 0.3.37 LFM2 tool call support and a generator stability fix
  • LM Studio 0.3.38 Mac M5 MLX fix, enable optimized MLX auto-upgrade
  • The State Of LLMs 2025: Progress, Problems, and Predictions A 2025 review of large language models, from DeepSeek R1 and RLVR to inference-time scaling, benchmarks, architectures, and predictions for 2026.
  • LLM Research Papers: The 2025 List (July to December) In June, I shared a bonus article with my curated and bookmarked research paper lists to the paid subscribers who make this Substack possible.
  • How to fine-tune FunctionGemma and run it locally Step by step guide for fine-tuning FunctionGemma with Unsloth, and then running it in LM Studio
  • 2025 LLM Year in Review 2025 Year in Review of LLM paradigm changes
  • Chemical hygiene An evolving guide of protecting your health from a pricemaxxing industry.
  • LM Studio 0.3.36 Support for Google's FunctionGemma (270M)
  • As AI Grows More Complex, Model Builders Rely on NVIDIA Unveiling what it describes as the most capable model series yet for professional knowledge work, OpenAI launched GPT-5.2 in December. The model was trained and deployed on NVIDIA infrastructure, incl
  • Auto-grading decade-old Hacker News discussions with hindsight A vibe coding thought exercise on what it might look like for LLMs to scour human historical data at scale and in retrospect.
  • LM Studio 0.3.35 Devstral-2, GLM-4.6V, and system prompt fixes
  • From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates Understanding How DeepSeek's Flagship Open-Weight Models Evolved
  • The space of minds On the space of minds and the optimizations that give rise to them.
  • Verifiability The impact of verifiability on the jagged frontier of LLMs
  • Beyond Standard LLMs Linear Attention Hybrids, Text Diffusion, Code World Models, and Small Recursive Transformers
  • RL without TD learning The BAIR Blog
  • OpenAI gpt-oss-safeguard Ollama is partnering with OpenAI and ROOST (Robust Open Online Safety Tools) to bring the latest gpt-oss-safeguard reasoning models to users for safety classification tasks. gpt-oss-safeguard models a
  • MiniMax M2 MiniMax M2 is now available on Ollama's cloud. It's a model built for coding and agentic workflows.
  • Animals vs Ghosts Today's frontier LLM research is not about building animals. It is about summoning ghosts. And a bit more on Sutton's Dwarkesh pod.
  • Reaching Across the Isles: UK-LLM Brings AI to UK Languages With NVIDIA Nemotron Trained on the Isambard-AI supercomputer, UK-LLM enables AI reasoning for Welsh and other UK languages for public services.
  • It’s the Humidity: How International Researchers in Poland, Deep Learning and NVIDIA GPUs Could Change the Forecast For more than a century, meteorologists have chased storms with chalkboards, equations, and now, supercomputers. But for all the progress, they still stumble over one deceptively simple ingredient: wa
  • What exactly does word2vec learn? The BAIR Blog
  • Applications Now Open for $60,000 NVIDIA Graduate Fellowship Awards The NVIDIA Graduate Fellowship Program provides grants, mentors and technical support to doctoral students doing outstanding research relevant to NVIDIA technologies. The application deadline for the
  • NVIDIA Research Shapes Physical AI AI and graphics research breakthroughs in neural rendering, 3D generation and world simulation power robotics, autonomous vehicles and content creation.
  • Isambard-AI, the UK’s Most Powerful AI Supercomputer, Goes Live The University of Bristol’s Isambard-AI, powered by NVIDIA Grace Hopper Superchips, delivers 21 exaflops of AI performance, making it the fastest system in the U.K. and among the most energy-efficient
  • A Gaming GPU Helps Crack the Code on a Thousand-Year Cultural Conversation The world of ancient ceramics has relied on expert eyes for millennia; at University Putra Malaysia and UNSW Sydney, a new AI, running on standard gaming hardware, is changing how people determine the
  • Whole-Body Conditioned Egocentric Video Prediction The BAIR Blog
  • NVIDIA CEO Drops the Blueprint for Europe’s AI Boom In Paris, Jensen Huang laid out how the continent is scaling up with Blackwell-powered factories, agentic AI and sovereign clouds — all part of Europe’s new intelligence infrastructure.
  • NVIDIA Releases New AI Models and Developer Tools to Advance Autonomous Vehicle Ecosystem NVIDIA today released NVIDIA Cosmos Predict-2 — a new world foundation model with improved future world state prediction capabilities for high-quality synthetic data generation.
  • Why We Think Special thanks to John Schulman for a lot of super valuable feedback and direct edits on this post. Test time compute (Graves et al. 2016, Ling, et al. 2017, Cobbe et al. 2021) and Chain-of-thought (C
  • Vibe coding MenuGen Work log of vibe coding menugen app
  • Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign) The BAIR Blog
  • Repurposing Protein Folding Models for Generation with Latent Diffusion The BAIR Blog
  • Power to the people: How LLMs flip the script on technology diffusion Yes
  • Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment The BAIR Blog
  • Finding the Best Sleep Tracker Finding the best sleep tracker with data
  • Reward Hacking in Reinforcement Learning Reward hacking occurs when a reinforcement learning (RL) agent exploits flaws or ambiguities in the reward function to achieve high rewards, without genuinely learning or completing the intended task.
  • Extrinsic Hallucinations in LLMs Hallucination in large language models usually refers to the model generating unfaithful, fabricated, inconsistent, or nonsensical content. As a term, hallucination has been somewhat generalized to ca
  • Diffusion Models for Video Generation Diffusion models have demonstrated strong results on image synthesis in past years. Now the research community has started working on a harder task—using it for video generation. The task itself is a
  • Thinking about High-Quality Human Data [Special thank you to Ian Kivlichan for many useful pointers (E.g. the 100+ year old Nature paper “Vox populi”) and nice feedback. 🙏 ] High-quality data is the fuel for modern data deep learning model
  • Adversarial Attacks on LLMs The use of large language models in the real world has strongly accelerated by the launch of ChatGPT. We (including my team at OpenAI, shoutout to them) have invested a lot of effort to build default
  • LLM Powered Autonomous Agents Building agents with LLM (large language model) as its core controller is a cool concept. Several proof-of-concepts demos, such as AutoGPT, GPT-Engineer and BabyAGI, serve as inspiring examples. The p
  • Prompt Engineering Prompt Engineering, also known as In-Context Prompting, refers to methods for how to communicate with LLM to steer its behavior for desired outcomes without updating the model weights. It is an empiri
  • The Transformer Family Version 2.0 Many new Transformer architecture improvements have been proposed since my last post on “The Transformer Family” about three years ago. Here I did a big refactoring and enrichment of that 2020 post —
  • Large Transformer Model Inference Optimization [Updated on 2023-01-24: add a small section on Distillation.] Large transformer models are mainstream nowadays, creating SoTA results for a variety of tasks. They are powerful but very expensive to tr
  • A conversation with Kevin Scott: What’s next in AI
  • From Hot Wheels to handling content: How brands are using Microsoft AI to be more productive and imaginative When designers at the toy company Mattel were asked recently to come up with a new Hot Wheels model car, they sought inspiration from DALL∙E 2, an AI system developed by OpenAI that creates custom ima
  • Microsoft open sources its ‘farm of the future’ toolkit FARMINGTON, Wash. – The gently rolling hills here in eastern Washington have long grown rich harvests of wheat, barley and lentils. Fifth-generation farmer Andrew Nelson is adding a new bumper crop to
  • How data and AI will transform contact centres for financial services
  • AI-equipped drones study dolphins on the edge of extinction
  • Online math tutoring service uses AI to help boost students’ skills and confidence Eedi, a London education startup, is using AI from Microsoft Research to personalize math learning for students in the early years of education.
  • AI-Mimi is building inclusive TV experiences for Deaf and Hard of Hearing user in Japan
  • Microsoft’s framework for building AI systems responsibly Today we are sharing publicly Microsoft’s Responsible AI Standard, a framework to guide how we build AI systems. It is an important step in our journey to develop better, more trustworthy AI. We are r
  • Singapore develops Asia’s first AI-based mobile app for shark and ray fin identification to combat illegal wildlife trade
  • The opportunity at home – can AI drive innovation in personal assistant devices and sign language?

Papers & Preprints (47 articles)

0 of 47 read
  • Tokenisation via Convex Relaxations Tokenisation is an integral part of the current NLP pipeline. Current tokenisation algorithms such as BPE and Unigram are greedy algorithms -- they make locally optimal decisions without considering t
  • Which Way Did It Move? Diagnosing and Overcoming Directional Motion Blindness in Video-LLMs Video Large Language Models (Video-LLMs) have made rapid progress on temporal video understanding, yet many fail at a basic perceptual primitive: signed image-plane motion direction. On simple videos
  • Vector Policy Optimization: Training for Diversity Improves Test-Time Search Language models must now generalize out of the box to novel environments and work inside inference-scaling search procedures, such as AlphaEvolve, that select rollouts with a variety of task-specific
  • Evaluating Commercial AI Chatbots as News Intermediaries AI chatbots are rapidly shaping how people encounter the news, yet no prior study has systematically measured how accurately these systems, with their proprietary search integrations and retrieval-syn
  • Uniform Diffusion Models Revisited: Leave-One-Out Denoiser and Absorbing State Reformulation Discrete diffusion models are often trained through clean-data prediction, but the prediction can be used in different ways to define the reverse dynamics. In Masked Diffusion Models (MDM) these choic
  • Integrable Elasticity via Neural Demand Potentials We propose the Integrable Context-Dependent Demand Network (ICDN), a demand-first neural model for multiproduct retail demand. The model learns log-demand as a smooth, context-conditioned function of
  • Cambrian-P: Pose-Grounded Video Understanding Camera pose matters. The position and orientation of each viewpoint define a shared spatial coordinate frame that relates observations across video frames. Yet this signal is largely absent from multi
  • The Matching Principle: A Geometric Theory of Loss Functions for Nuisance-Robust Representation Learning Robustness, domain adaptation, photometric and occlusion invariance, compositional generalisation, temporal robustness, alignment safety, and classical anisotropic regularisation are usually treated a
  • Reducing Political Manipulation with Consistency Training Large language models (LLMs) exhibit systematic political bias across a variety of sensitive contexts. We find that LLMs handle counterpart topics from opposing political sides asymmetrically. We refe
  • Plug-in Losses for Evidential Deep Learning: A Simplified Framework for Uncertainty Estimation that Includes the Softmax Classifier Real-world sensor-based learning systems require uncertainty estimation that is both reliable and computationally efficient. Evidential Deep Learning (EDL) provides single-pass uncertainty estimation
  • MotiMotion: Motion-Controlled Video Generation with Visual Reasoning Current motion-controlled image-to-video generation models rigidly follow user-provided trajectories that are often sparse, imprecise, and causally incomplete. Such reliance often yields unnatural or
  • Remember to be Curious: Episodic Context and Persistent Worlds for 3D Exploration Exploration is a prerequisite for learning useful behaviors in sparse-reward, long-horizon tasks, particularly within 3D environments. Curiosity-driven reinforcement learning addresses this via intrin
  • Finite-Particle Convergence Rates for Conservative and Non-Conservative Drifting Models We propose and analyze a conservative drifting method for one-step generative modeling. The method replaces the original displacement-based drifting velocity by a kernel density estimator (KDE)-gradie
  • Understanding Data Temporality Impact on Large Language Models Pre-training Large language models (LLMs) are typically trained on shuffled corpora, yielding models whose knowledge is frozen at train time and whose temporal grounding remains poorly understood. In this work, we
  • Proxy-Based Approximation of Shapley and Banzhaf Interactions Shapley and Banzhaf interactions capture the complex dynamics inherent in modern machine learning applications. However, current estimators for these higher-order interactions trade off between speed
  • Efficient Agentic Reasoning Through Self-Regulated Simulative Planning Join the discussion on this paper page
  • AwareVLN: Reasoning with Self-awareness for Vision-Language Navigation Vision-and-Language Navigation (VLN) requires an agent to ground language instructions to its own movement within a visual environment. While state-of-the-art methods leverage the reasoning capabiliti
  • MOSS: Self-Evolution through Source-Level Rewriting in Autonomous Agent Systems Autonomous agentic systems are largely static after deployment: they do not learn from user interactions, and recurring failures persist until the next human-driven update ships a fix. Self-evolving a
  • FAME: Failure-Aware Mixture-of-Experts for Message-Level Log Anomaly Detection Production systems generate millions of log lines daily, yet most anomaly detectors operate at the session or window-level, flagging groups of lines rather than identifying the specific message respon
  • ChronoMedKG: A Temporally-Grounded Biomedical Knowledge Graph and Benchmark for Clinical Reasoning Biomedical knowledge graphs (KGs) treat disease associations as static facts, but temporal information is crucial for clinical reasoning, e.g., a symptom diagnostic of one disease at age 3 may imply a
  • Multiple Neural Operators Achieve Near-Optimal Rates for Multi-Task Learning We study the approximation and statistical complexity of learning collections of operators in a shared multi-task setting, with a focus on the Multiple Neural Operators (MNO) architecture. For broad c
  • AnyMo: Geometry-Aware Setup-Agnostic Modeling of Human Motion in the Wild Join the discussion on this paper page
  • GesVLA: Gesture-Aware Vision-Language-Action Model Embedded Representations Vision-Language-Action (VLA) models have shown strong potential for general-purpose robot manipulation by unifying perception and action. However, existing VLA systems primarily rely on textual instru
  • Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention Linear attention replaces the unbounded cache of softmax attention with a fixed-size recurrent state, reducing sequence mixing to linear time and decoding to constant memory. The hard part is not just
  • Beyond Acoustic Emotion Recognition: Multimodal Pathos Analysis in Political Speech Using LLM-Based and Acoustic Emotion Models We investigate whether acoustic emotion recognition models can serve as proxies for the Pathos dimension in political speech analysis, as operationalised by the TRUST multi-agent large language model
  • Beyond Temperature: Hyperfitting as a Late-Stage Geometric Expansion Recent work has identified a counterintuitive phenomenon termed "Hyperfitting", where fine-tuning Large Language Models (LLMs) to near-zero training loss on small datasets surprisingly enhances open-e
  • FashionLens: Toward Versatile Fashion Image Retrieval via Task-Adaptive Learning Join the discussion on this paper page
  • Sensor2Sensor: Cross-Embodiment Sensor Conversion for Autonomous Driving Robust training and validation of Autonomous Driving Systems (ADS) require massive, diverse datasets. Proprietary data collected by Autonomous Vehicle (AV) fleets, while high-fidelity, are limited in
  • LCGuard: Latent Communication Guard for Safe KV Sharing in Multi-Agent Systems Large language model (LLM)-based multi-agent systems increasingly rely on intermediate communication to coordinate complex tasks. While most existing systems communicate through natural language, rece
  • AMEL: Accumulated Message Effects on LLM Judgments Large language models are routinely used as automated evaluators: to review code, moderate content, or score outputs, often with many items passing through one conversation. We ask whether the polarit
  • A Martingale Kernel Independence Test The Hilbert-Schmidt Independence Criterion (HSIC) and its joint-independence extension $d\mathrm{HSIC}$ are degenerate $V$-statistics whose data-dependent weighted-$χ^2$ null limits force a permutatio
  • SEGA: Spectral-Energy Guided Attention for Resolution Extrapolation in Diffusion Transformers Join the discussion on this paper page
  • DeltaBox: Scaling Stateful AI Agents with Millisecond-Level Sandbox Checkpoint/Rollback LLM-powered AI agents require high-frequency state exploration (e.g., test-time tree search and reinforcement learning), relying on rapid checkpoint and rollback (C/R) of the complete sandbox state, i
  • Synthetic Data Alone is Enough? Rethinking Data Scarcity in Pediatric Rare Disease Recognition Children with rare genetic diseases often exhibit distinctive facial phenotypes, yet developing computer vision systems for early diagnosis remains challenging due to extreme data scarcity, privacy co
  • Tokenization with Split Trees We introduce Tokenization with Split Trees (ToaST), a subword tokenization method that directly optimizes compression under a new recursive inference procedure. ToaST greedily splits each pretoken int
  • Generative Modeling by Value-Driven Transport We propose a new framework for generative modeling based on a discrete-time stochastic control formulation of measure transport. Adapting classic results from control theory, we formulate our problem
  • DecQ: Detail-Condensing Queries for Enhanced Reconstruction and Generation in Representation Autoencoders Join the discussion on this paper page
  • SDPM: Survival Diffusion Probabilistic Model for Continuous-Time Survival Analysis Survival analysis aims to estimate a time-to-event distribution from data with censored observations. Many existing methods either impose structural assumptions on the hazard function or discretize th
  • Spectral Tail Auxiliary Learning for AI-Generated Image Detection As generative image models evolve rapidly, the perceptual gap between generated and real images continues to narrow, making AI-generated image detection increasingly challenging. Many existing methods
  • MambaGaze: Bidirectional Mamba with Explicit Missing Data Modeling for Cognitive Load Assessment from Eye-Gaze Tracking Data Real-time cognitive load assessment from eye-tracking signals could potentially enable adaptive human-centered-AI such as safety-critical applications such as driver vigilance monitoring or automated
  • WorldKV: Efficient World Memory with World Retrieval and Compression Autoregressive video diffusion models have enabled real-time, action-conditioned world generation. However, sustaining a persistent world, where revisiting a previously seen viewpoint yields consisten
  • CogAdapt: Transferring Clinical ECG Foundation Models to Wearable Cognitive Load Assessment via Lead Adaptation Real-time cognitive load assessment is essential for adaptive human-computer interaction but remains challenging due to limited labeled data and poor cross-subject generalization. Recent ECG foundatio
  • "I didn't Make the Micro Decisions": Measuring, Inducing, and Exposing Goal-Level AI Contributions in Collaboration Join the discussion on this paper page
  • Platonic Representations in the Human Brain: Unsupervised Recovery of Universal Geometry Join the discussion on this paper page
  • Disentangling Sampling from Training Budget in Class-Imbalanced CT Body Composition Segmentation Join the discussion on this paper page
  • Forecasting Downstream Performance of LLMs With Proxy Metrics Join the discussion on this paper page
  • Lean Refactor: Multi-Objective Controllable Proof Optimization via Agentic Strategy Search Join the discussion on this paper page

Trending Research (48 articles)

0 of 48 read
  • Transformers Can Overcome the Curse of Dimensionality: A Theoretical Study from an Approximation Perspective
  • Online Bernstein-von Mises theorem
  • Covariate-dependent Hierarchical Dirichlet Processes
  • DCatalyst: A Unified Accelerated Framework for Decentralized Optimization
  • Boosted Control Functions: Distribution Generalization and Invariance in Confounded Models
  • Contrasting Local and Global Modeling with Machine Learning and Satellite Data: A Case Study Estimating Tree Canopy Height in African Savannas
  • A Symplectic Analysis of Alternating Mirror Descent
  • Neural operators for free-boundary problems
  • Two-way Node Popularity Model for Directed and Bipartite Networks
  • Deep neural operator for free boundary problems
  • Convergence and complexity of block majorization-minimization for constrained block-Riemannian optimization
  • Bayesian Inference of Contextual Bandit Policies via Empirical Likelihood
  • Plagiarism of ideas in the age of generative artificial intelligence
  • Stop ‘tokenmaxxing’ and deploy AI sensibly instead
  • SpecGP as a transformer-based model for predicting energy-adaptable structural spectra of glycopeptides
  • Immunotherapy drug target identification using machine learning and patient-derived tumour explant validation
  • A strong sustainability approach to AI development
  • A generative artificial intelligence approach for peptide antibiotic optimization
  • Honey, I Shrunk the Hypothesis Space (Through Logical Preprocessing)
  • TeamTTA: Efficient Multi-Device Collaboration for Open-Set Test-Time Adaptation via Cloud Integration
  • A Review of Causal Decision Making
  • Improving Plan Execution Flexibility using Block-Substitution
  • Rational Silence and False Polarization: How Viewpoint Organizations and Recommender Systems Distort the Expression of Public Opinion
  • Understanding AI Trustworthiness: A Scoping Review of AIES & FAccT Articles
  • Resource Efficient Sleep Staging via Multi-Level Masking and Prompt Learning
  • AutoMalDesc: Large-Scale Script Analysis for Cyber Threat Research
  • Beyond Content: A Comprehensive Speech Toxicity Dataset and Detection Framework Incorporating Paralinguistic Cues
  • Modulation-Based Backdoors: Leveraging Amplitude and Frequency Patterns to Attack Speaker Recognition
  • Learning Structurally Stabilized Representations for Lossless DNA Storage
  • ViG-RAG: Video-aware Graph Retrieval-Augmented Generation via Temporal and Semantic Hybrid Reasoning
  • Transferable Backdoor Attacks for Code Models via Sharpness-Aware Adversarial Perturbation
  • Toward Multimodal Fake News Detection by Multi-perspective Rationale Generation and Verification
  • RTMol: Rethinking Molecule-text Alignment in a Round-trip View
  • Physical-regularized Hierarchical Generative Model for Metallic Glass Structural Generation and Energy Prediction
  • Label-Aware Pseudo-Training Sample Generation for Text Classification
  • General Supervised Learning Framework for Open World Classification
  • Scaling Neuro-symbolic Problem Solving: Solver-Free Learning of Constraints and Objectives
  • Fine-tuning vs. In-context Learning in Large Language Models: A Formal Language Learning Perspective Large language models (LLMs) operate in two fundamental learning modes - fine-tuning (FT) and in-context learning (ICL) - raising key questions about which mode yields greater language proficiency and
  • Probabilistically Tightened Linear Relaxation-based Perturbation Analysis for Neural Network Verification
  • Data-Driven Motion Planning: A Survey on Deep Neural Networks, Reinforcement Learning, and Large Language Model Approaches Motion planning is a fundamental challenge in robotics, involving the creation of trajectories from start to goal states while meeting constraints like collision avoidance and joint limits. Its comple
  • Detecting Fake News in Urdu Language Using Machine Learning, Deep Learning, and Large Language Model-Based Approaches
  • Stylometry-driven framework for Urdu intrinsic plagiarism detection: a comprehensive analysis using machine learning, deep learning, and large language models Detecting plagiarism in documents is a well-established task in natural language processing (NLP). Broadly, plagiarism detection is categorized into two types (1) intrinsic: to check the whole documen
  • Neural Data Augmentation for Legal Overruling Task: Small Deep Learning Models vs. Large Language Models Deep learning models produce impressive results in any natural language processing applications when given a better learning strategy and trained with large labeled datasets. However, the annotation o
  • LaRA: Large Rank Adaptation for Speech and Text Cross-Modal Learning in Large Language Models Zuhair Hasan Shaik, Pradyoth Hegde, Prashant Bannulmath, Deepak K T. Findings of the Association for Computational Linguistics: EMNLP 2024. 2024.
  • Detecting Health Misinformation on Social Networking Sites Using Large Language Models and Deep Learning-based Natural Language Processing Health misinformation on social networking sites (SNS) is a critical issue, particularly during health crises like the COVID-19 pandemic. The spread of inaccurate health information can lead to severe
  • 20.5 C-Transformer: A 2.6-18.1μJ/Token Homogeneous DNN-Transformer/Spiking-Transformer Processor with Big-Little Network and Implicit Weight Generation for Large Language Models Recently, transformer-based large language models (LLMs), shown in Fig. 20.5.1, are widely used, and even on-device LLM systems with real-time responses are anticipated [1]. Many transformer processor
  • Large Language Models Are Zero-Shot Fuzzers: Fuzzing Deep-Learning Libraries via Large Language Models
  • Görsel dikkat modeli ve derin öğrenme yöntemleri kullanılarak geniş dağarcıklı ayrık işaret dili tanıma sisteminin modellenmesi (Modeling a large vocabulary isolated sign language recognition system using visual attention model and deep learning methods) Yükseköğretim Kurulu Tez Merkezi'nde bulunan basılı bütün tezleri tarayarak, üye olduktan sonra izinli tezlere tam metin(pdf) olarak erişebilirisiniz.

Discussions (120 articles)

0 of 120 read
  • How do check ChatGPT Pro Quota?
  • I read threads complaining about claude every week... tf are y'alls workflows?
  • Qwen-27B-IQ4_KS for ik_llama.cpp, especially for NVIDIA with 16GB VRAM
  • How it feels asking admins for usage for the 10th time that day
  • AI Safety Sacrifice
  • Mfs will do anything but study for the exam .
  • Why do data centers use fresh water?
  • NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable) [P]
  • AI training is becoming the new coding revolution
  • Foods as high fashion
  • Stop losing great answers in long AI conversations
  • 3 questions to ask yourself before shipping AI-generated code
  • OpenBMB presents the model BitCPM-CANN 1.58 bit
  • If your job requires zero intelligence
  • Aged like fine WINE
  • Elon?
  • Decentralized Distributed AI Breakthrough: How the World's Colleges and Universities Can Rival the AI Giants
  • Clients that rely on ChatGPT for ideas
  • Rethinking AI Bubble
  • One thing that's been bothering me lately: benchmark performance often tells me almost nothing about whether a workflow will survive production usage.[D]
  • Claude Code dropped /workflows
  • Notification for Telegram is now MCP-Compatible — Let AI Send Telegram Messages from WordPress
  • [llama.cpp] Asymmetric KV q8/q4 cache: current caveats and discussion in GGML repo
  • After comparing Claude Max $100 and ChatGPT Pro $100 side by side on actual billable work, I'm cancelling my ChatGPT Pro subscription
  • AI-generated stories secretly won 3 of 5 fiction awards
  • This is how I generate a full EV industry research report from one prompt using sense nova skills
  • My experience using Claude code with Local Llm, and full guide on how to set it up
  • People against AI put up these fake advertisements on the London Underground
  • I'm cancelling my ChatGPT Pro subscription
  • [NEW] Supra-50M Released!
  • Live Human Detector on Outbound Phone Calls [R]
  • Multi-agent AI systems are now automating scientific discovery and nobody seems ready
  • Which MCP servers are actually changing your Claude workflow? Sharing mine
  • GPT-5.2 matches top human reviewers in Nature peer review study
  • Google just declared "Google Search is AI Search" at I/O 2026
  • How did Gpt solve the erdos problem? A demonstration: less like “AI did math” and more like “AI found the hidden layer under the picture”
  • I asked ChatGPT “You personally as an AI with all you know and all you’ve seen and all you’ve learned since your conception what do you hate most about humanity” The Answer was pretty deep.
  • No longer have access to extended pro or heavy thinking after UI update
  • DeepSeek is pushing forward with $10.29 billion financing round, with Liang Wenfeng committing to continue developing open-source AI models rather than pursuing short-term commercialization goals
  • Glasses will fail
  • SpaceXAI locked Anthropic into paying them $1.25 billion per MONTH for compute
  • Math grad student friend says we're cooked
  • Novel Problems in VLA [R]
  • HTML instead of Markdown
  • Microsoft Cancels Internal Anthropic Licenses As Shift To Token-Based AI Billing Blows Up Annual Budgets In Months
  • All it takes is another Steinberger to open source a vastly more intelligent AI...
  • Average ChatGPT user after one successful prompt 💀
  • They just destroyed the Pro model with the new update
  • This just happened
  • The Case for Evaluating Model Behaviors
  • New Release of ROCm based MLX LLM Engine - lemon-mlx-engine
  • ​"Google has a whole department whose only job is to steal startups."
  • 2024 vs 2026
  • Can liveness detection models generalise to synthetic media generation techniques they were never trained on? [D]
  • Out of the Box
  • Interesting Response from Gemini
  • Do not trust AI chat memes
  • Same prompt on ChatGPT and Gemini got two totally different images. Not even close lol...
  • Just heard Anthropic added another star to their lineup… 🤣
  • make no mistakes
  • Just give me the F bro 😭
  • When your LLM treats data center GPUs like an optional DLC
  • Updated chatGPT web gui lacks reasoning selector for thinking or pro models: no more extended pro :(
  • Lisbon Machine Learning School (LxMLS 2026) [D]
  • Could AI eventually become something like a system that expands human understanding for humanity
  • CC service down for everyone or just me?
  • Meta laid off 10% of its workforce as Mark Zuckerberg warns that in the AI race "success isn’t a given"
  • My LinkedIn network is about to be aggressively flooded with Claude Code certifications
  • Top mathematician Timothy Gowers: "AI has now solved a major open problem ... one that many mathematicians had tried."
  • Loaded the new washer-dryer manual in a project as .toml files so my girlfriend knows how to use it
  • Do VLMs in production still use fixed-patch ViTs for their vision capabilities? [D]
  • this tweet aged in the funniest possible way
  • Qwen3.6 35Ba3 has changed my workflows and even how I use my computer
  • College Graduation Ceremony Erupts In Boos After 'New AI System' Allegedly Misses 'Hundreds' Of Graduates' Names
  • Why new grads are booing commencement speakers: There's an 'ambient anxiety that AI is going to make things dramatically worse'
  • Anthropic officially launched 13+ FREE AI courses with certificates (Including Agentic AI and Claude Code!)
  • OpenAI cofounder Karpathy joins Anthropic to teach Claude to improve itself without humans
  • Claude, you're right ...that was hard requirement... and i skipped it!
  • Word on the street
  • Waiting for Qwen 3.7 open weight... The new King has arrived...
  • So, what is Yann LeCun's "World Models" and JEPA and is it Really a Replacement for LLMs?
  • Heretic has been served a legal notice by Meta, Inc.
  • OpenAI claims a general-purpose reasoning model found a counterexample to Erdos's unit-distance bound [D]
  • Self-hosted sandboxes and MCP tunnels for Claude Managed Agents are now in public beta.
  • Risk reports need to address deployment-time spread of misalignment
  • Mechanistic estimation for expectations of random products
  • Using ChatGPT/Claude for years, but I feel like I’m using them the old way. How do I catch up?
  • The safe-to-dangerous shift is a fundamental problem for eval realism; but also for measuring awareness
  • Empowerment, corrigibility, etc. are simple abstractions (of a messed-up ontology)
  • Clarifying the role of the behavioral selection model
  • A Subquadratic sparse attention engine in pure Rust that runs LLMs on Raspberry Pi Zero, and Pi 5 without CUDA
  • AgentDB: Vector memory that gets smarter every time your agent uses it.
  • Subquadratic Sparse Attention for Edge LLM Inference (7b LLM on Raspberry Pi 5 and 1b on Pi Zero)
  • Natural Language Autoencoders Produce Unsupervised Explanations of LLM Activations
  • Mechanistic estimation for wide random MLPs
  • [Linkpost] Interpreting Language Model Parameters
  • Motivated reasoning, confirmation bias, and AI risk theory
  • New ChatGPT Prompting Guide
  • I need this ⌨️
  • [D] Self-Promotion Thread
  • Monthly "Is there a tool for..." Post
  • [D] Monthly Who's Hiring and Who wants to be Hired?
  • Made with ChatGPT Images 2.0
  • NVIDIA Open-Sourced an AI Model for Explorable 3D World Generation
  • Terminal-based oscilloscope with CRT phosphor physics, vibe coded in Nim
  • Meshy MCP Is Here - Big Step for AI 3D Workflows
  • New SOTA OpenSource AI to decompose live2D layers!
  • r/ClaudeAI List of Ongoing Megathreads
  • We heard you - r/ArtificialInteligence is getting sharper
  • MIT Non-AI License
  • Beyond ChatGPT: The Silent Birth of Conscious AI
  • Community Feedback
  • Sora 2 megathread (part 3)
  • Updates for ChatGPT
  • AMA on our DevDay Launches
  • Agentic Flow: Easily switch between low/no-cost AI models (OpenRouter/Onnx/Gemini) in Claude Code and Claude Agent SDK. Build agents in Claude Code, deploy them anywhere. >_ npx agentic-flow
  • Why the Technological Singularity May Be a "Big Nothing"
  • I created an Agentic Coding Competition MCP for Cline/Claude-Code/Cursor/Co-pilot using E2B Sandboxes. I'm looking for some Beta Testers. > npx flow-nexus@latest
  • "Intelligenza Artificiale for Artificial Intelligence Research and Development"
  • Ask HN: Is the rate of progress in AI exponential?

Reading List

No saved articles yet. Click the ☆ next to an article to save it.

Keyboard Shortcuts

j / k
Next / previous article
h / l
Previous / next category
Enter
Open in reader panel
o
Open in new tab
s
Save / unsave to reading list
/
Focus search
c
Toggle compact / detailed
t
Toggle dark / light theme
15
Jump to category
g / G
Top / bottom of page
X
Clear all saved data
Esc
Close search / panels
In Reader Panel
j / k
Scroll article
h / l
Previous / next article
g / G
Top / bottom of article
o
Open in new tab
q
Close reader