AI news & research for practitioners

Today

Major news Scale AI

Union Advances Ban on AI Data Centers - TAPinto

A union representing workers in the tech sector has made significant strides toward implementing a ban on AI data centers in their region. This move comes amid growing concerns over...

Google News · Scale AI regulation policy 226w
news xAI

Grok downloads fall nearly 60% - Social Media Today

Grok, the AI chatbot developed by xAI, has experienced a significant decline in downloads, plummeting nearly 60% in recent weeks. This drop raises questions about the platform’s sustainability and user...

Google News · xAI / Grok other 237w

Yesterday 2026-05-12

news xAI

Grok Is a Flop, But It May Not Matter to Elon Musk - Gizmodo

Elon Musk’s AI chatbot, Grok, has been met with disappointing user reception since its launch, raising questions about its viability in a competitive market. Despite this setback, Musk’s broader ambitions...

Google News · xAI / Grok other 245w
Notable research Hugging Face

Elastic Attention Cores for Scalable Vision Transformers

Problem This paper addresses the computational inefficiency of Vision Transformers (ViTs) when scaling to high-resolution images due to their reliance on all-to-all self-attention mechanisms, which exhibit quadratic complexity with respect...

arXiv cs.LG efficiency inference 431w arXiv code Alan Z. Song +5
Major research

Learning, Fast and Slow: Towards LLMs That Adapt Continually

Problem This preprint addresses the limitations of current large language models (LLMs) in continual learning scenarios, specifically the issues of catastrophic forgetting and the trade-off between in-context learning and parameter...

arXiv cs.AI training methods 483w arXiv code Rishabh Tiwari +5
Notable research

MEME: Multi-entity & Evolving Memory Evaluation

Problem This paper addresses the limitations of existing benchmarks for evaluating memory systems in large language model (LLM)-based agents, particularly in persistent environments where agents must manage multi-entity information over...

arXiv cs.LG evaluation benchmarks 485w arXiv code Seokwon Jung +4
Notable research

Reward Hacking in Rubric-Based Reinforcement Learning

Problem This preprint addresses the gap in understanding reward hacking in rubric-based reinforcement learning (RL) systems. While previous work has demonstrated the effectiveness of verifiable rewards in specific domains, the...

arXiv cs.AI alignment safety 494w arXiv code Anas Mahmoud +5
Notable research

KV-Fold: One-Step KV-Cache Recurrence for Long-Context Inference

Problem This paper addresses the challenge of long-context inference in transformer models, specifically the limitations of existing methods that either require retraining or compromise fidelity for memory efficiency. The authors...

arXiv cs.AI efficiency inference 406w arXiv code Alireza Nadali +3
Notable research

High-arity Sample Compression

Problem This paper addresses a gap in the literature regarding high-arity learning theory, specifically focusing on high-arity sample compression schemes. While traditional sample compression has been well-studied in binary and...

arXiv cs.LG theory 454w arXiv code Leonardo N. Coregliano +1
Notable research Scale AI

Search Your Block Floating Point Scales!

Problem This preprint addresses the limitations of existing Block Floating Point (BFP) quantization techniques, which typically utilize a fixed scale based on the maximum magnitude of the block. The authors...

arXiv cs.LG efficiency inference 435w arXiv code Tanmaey Gupta +5
Notable research

Learning Minimally Rigid Graphs with High Realization Counts

Problem This paper addresses the challenge of identifying minimally rigid graphs that can support a high number of realizations, a problem that remains underexplored in the literature. The authors note...

arXiv cs.LG theory 400w arXiv code Oleksandr Slyvka +3
Notable research Hugging Face

Geometric Factual Recall in Transformers

Problem This preprint addresses the gap in understanding how transformer language models memorize factual associations, challenging the prevailing view that internal weight matrices function as associative memories requiring linear scaling...

arXiv cs.CL theory 449w arXiv code Shauli Ravfogel +3
Major research

Aligning Flow Map Policies with Optimal Q-Guidance

Problem This paper addresses the gap in the efficiency of generative policies for complex control tasks, particularly in the context of offline-to-online reinforcement learning (RL). While existing methods, such as...

arXiv cs.LG agents robotics 435w arXiv code Christos Ziakas +2
Notable research

Model-based Bootstrap of Controlled Markov Chains

Problem This paper addresses the gap in the literature regarding the estimation of transition kernels in finite controlled Markov chains (CMCs) under nonstationary or history-dependent control policies, particularly in the...

arXiv cs.LG training methods 465w arXiv code Ziwei Su +2
Notable research

Trajectory-Agnostic Asteroid Detection in TESS with Deep Learning

Problem This preprint addresses the challenge of detecting moving objects, specifically asteroids, in time-series data from the Transiting Exoplanet Survey Satellite (TESS). Traditional methods, such as “shift-and-stack” algorithms, rely on...

arXiv cs.LG other 446w arXiv code Brian P. Powell +5
Notable research

Discrete Flow Matching for Offline-to-Online Reinforcement Learning

Problem This paper addresses the gap in reinforcement learning (RL) methods for discrete action spaces, particularly in the context of offline-to-online RL. Existing generative policy methods, primarily designed for continuous...

arXiv cs.AI agents robotics 456w arXiv code Fairoz Nower Khan +2
Notable research

Fast Image Super-Resolution via Consistency Rectified Flow

Problem This paper addresses the limitations of existing diffusion models (DMs) in the context of image super-resolution (SR), particularly their reliance on multi-step sampling, which is computationally expensive and impractical...

arXiv cs.CV efficiency inference 465w arXiv code Jiaqi Xu +5
Notable research

Agent-Based Post-Hoc Correction of Agricultural Yield Forecasts

Problem This preprint addresses the gap in agricultural yield forecasting, particularly in commercial soft fruit production, where traditional models are limited by the lack of comprehensive data sources such as...

arXiv cs.AI agents robotics 441w arXiv code Matthew Beddows +2
Notable research

Context Convergence Improves Answering Inferential Questions

Problem This preprint addresses the gap in the literature regarding the performance of Large Language Models (LLMs) on inferential questions in open-domain Question Answering (QA). While LLMs excel at retrieving...

arXiv cs.CL reasoning 415w arXiv code Jamshid Mozafari +2
Notable research

GKnow: Measuring the Entanglement of Gender Bias and Factual Gender

Problem This preprint addresses a significant gap in the literature regarding the mechanistic understanding of gender bias in neural networks. Previous studies have either concentrated on specific gender-related tasks, such...

arXiv cs.CL alignment safety 464w arXiv code Leonor Veloso +1

How finance teams use Codex

OpenAI’s Codex is making waves in the finance sector by streamlining complex tasks such as building management business reviews (MBRs), reporting packs, variance bridges, model checks, and planning scenarios. This...

OpenAI Blog other 243w
Major news xAI

Musk’s xAI sues Colorado over AI speech restrictions - MSN

In a significant legal move, Elon Musk’s artificial intelligence company, xAI, has filed a lawsuit against the state of Colorado, challenging recent restrictions on AI-generated speech. This lawsuit comes at...

Google News · xAI / Grok regulation policy 227w
Notable news xAI

Musk merges xAI into SpaceXAI as Grok loses users - MSN

Elon Musk has announced the merger of his artificial intelligence venture, xAI, with SpaceXAI, a move that comes amid declining user engagement with Grok, the AI chatbot developed by xAI....

Google News · xAI / Grok other 224w
Notable news Dessn

Dessn raises $6M for its production focused design tool

Dessn, a burgeoning startup, has successfully secured $6 million in funding to develop AI-driven design tools that integrate seamlessly with production codebases. This funding round highlights the increasing demand for...

TechCrunch AI funding round 229w
Major news NVIDIA

NVIDIA and SAP Bring Trust to Specialized Agents

NVIDIA and SAP have unveiled a significant expansion of their partnership, aimed at enhancing the deployment of specialized AI agents within enterprises. This announcement, made during the SAP Sapphire conference,...

NVIDIA Blog partnership 254w
Notable news

The Download: a Nobel winner on AI, and the case for fixing everything

Nobel Prize-winning economist Daron Acemoglu has recently highlighted the urgent need for a paradigm shift in how society approaches artificial intelligence. As discussions around AI’s impact intensify, Acemoglu’s insights underscore...

MIT Technology Review other 234w
Notable news OpenAI

What Parameter Golf taught us about AI-assisted research

A recent event known as Parameter Golf has emerged as a significant gathering for AI enthusiasts, attracting over 1,000 participants and generating more than 2,000 submissions. This initiative, spearheaded by...

OpenAI Blog other 280w
Notable news NVIDIA

How NVIDIA engineers and researchers build with Codex

NVIDIA is leveraging OpenAI’s Codex in conjunction with GPT-5.5 to enhance its engineering and research capabilities, enabling teams to rapidly develop production systems and transform innovative concepts into functional experiments....

OpenAI Blog other 243w
Notable news OpenAI

AutoScout24 scales engineering with AI-powered workflows

AutoScout24 Group is leveraging AI-powered workflows, specifically utilizing Codex and ChatGPT, to enhance its engineering capabilities. This initiative aims to accelerate development cycles, elevate code quality, and foster broader AI...

OpenAI Blog other 253w

2026-05-11 2026-05-11

Notable research

ELF: Embedded Language Flows

Problem This paper addresses the limitations of existing diffusion language models (DLMs), which primarily operate over discrete tokens, thereby restricting their effectiveness in language modeling. The authors highlight a gap...

arXiv cs.LG other 437w arXiv code Keya Hu +5
Notable research

Personal Visual Context Learning in Large Multimodal Models

Problem This paper addresses the gap in the capability of Large Multimodal Models (LMMs) to effectively utilize personalized visual context for individual users, particularly in the context of wearable devices...

arXiv cs.CV multimodal 457w arXiv code Zihui Xue +4
Notable research

Pixal3D: Pixel-Aligned 3D Generation from Images

Problem This paper addresses the gap in fidelity for 3D generative models, particularly in the context of image-to-3D synthesis. Despite advancements in generating high-resolution geometry and realistic appearances, existing models...

arXiv cs.CV multimodal 417w arXiv code Dong-Yang Li +5
Notable research

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Problem This paper addresses the gap in evaluating AI agents in realistic, long-horizon tasks within their native runtime environments. Existing benchmarks predominantly utilize synthetic environments, short-horizon tasks, and mock-service APIs,...

arXiv cs.CL evaluation benchmarks 438w arXiv code Shuangrui Ding +5
Major research

Beyond Red-Teaming: Formal Guarantees of LLM Guardrail Classifiers

Problem This preprint addresses the lack of formal guarantees for Guardrail Classifiers, which are designed to mitigate harmful behavior in production language models. While empirical evaluations suggest effectiveness, the absence...

arXiv cs.LG alignment safety 491w arXiv code Nikita Kezins +3
Notable research

Counterfactual Stress Testing for Image Classification Models

Problem This preprint addresses the gap in evaluating the robustness of deep learning models in medical imaging, particularly in the context of distribution shifts due to variations in demographics, scanner...

arXiv cs.CV evaluation benchmarks 417w arXiv code Moritz Stammel +4
Notable news

Three things in AI to watch, according to a Nobel-winning economist

Nobel-winning economist Daron Acemoglu has recently outlined three critical areas in the AI landscape that warrant close attention, stirring debate in Silicon Valley. His insights come at a pivotal moment...

MIT Technology Review opinion essay 244w
Notable research

Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why

Problem This preprint addresses the gap in understanding the conditions under which on-policy distillation is beneficial or detrimental for training reasoning models. While on-policy distillation provides dense supervision, the authors...

arXiv cs.LG training methods 452w arXiv code Mohammadreza Armandpour +5
Notable research

Shields to Guarantee Probabilistic Safety in MDPs

Problem This paper addresses the gap in the literature regarding the application of shielding techniques to ensure probabilistic safety in Markov Decision Processes (MDPs). While classical shielding methods provide strong...

arXiv cs.AI safety alignment 450w arXiv code Linus Heck +4
Major research

Count Anything at Any Granularity

Problem This preprint addresses the limitations of existing open-world object counting methods, which struggle to accurately count objects based on user intent due to the implicit nature of counting granularity....

arXiv cs.CV multimodal 433w arXiv code Chang Liu +2
Notable research

Neural Weight Norm = Kolmogorov Complexity

Problem This preprint addresses the theoretical underpinnings of weight decay in neural networks, specifically investigating why weight decay is effective in regularizing models. The authors establish a connection between the...

arXiv cs.LG theory 487w arXiv code Tiberiu Musat
Notable research

AssayBench: An Assay-Level Virtual Cell Benchmark for LLMs and Agents

Problem This paper addresses the lack of a standardized benchmark for in silico phenotypic screening, a critical task in drug discovery that involves predicting cellular responses to perturbations. Existing benchmarks...

arXiv cs.LG evaluation benchmarks 435w arXiv code Edward De Brouwer +5
Notable research

Compute Where it Counts: Self Optimizing Language Models

Problem This preprint addresses the inefficiencies in current large language model (LLM) inference strategies, which typically apply a uniform computation budget across all decoding steps. This approach fails to account...

arXiv cs.LG efficiency inference 437w arXiv code Yash Akhauri +1
Notable research

RUBEN: Rule-Based Explanations for Retrieval-Augmented LLM Systems

Problem This paper addresses the lack of interpretable explanations for the outputs of retrieval-augmented large language models (LLMs), particularly in data-driven applications. The authors identify a gap in existing literature...

arXiv cs.CL alignment safety 446w arXiv code Joel Rorseth +4
Notable research

Is Your Driving World Model an All-Around Player?

Problem This paper addresses a significant gap in the evaluation of driving world models, specifically the lack of comprehensive metrics that assess both visual fidelity and behavioral realism. Current models...

arXiv cs.CV evaluation benchmarks 441w arXiv code Lingdong Kong +5
news Digg

Digg tries again, this time as an AI news aggregator

Digg, the once-popular social news aggregator, is making a comeback, this time pivoting to focus exclusively on AI news. In an era where AI developments are rapidly reshaping industries, Digg...

TechCrunch AI other 257w
Notable research

Grounded Satirical Generation with RAG

Problem This preprint addresses the gap in the literature regarding the generation of satirical content using Large Language Models (LLMs), particularly focusing on the contextual nature of satire. Existing models...

arXiv cs.CL other 446w arXiv code Oona Itkonen +3
Notable research

Predicting 3D structure by latent posterior sampling

Problem This preprint addresses the gap in 3D reconstruction capabilities by integrating generative models of 2D images with neural field representations for 3D scenes. Traditional methods often struggle with uncertainty...

arXiv cs.LG reasoning 452w arXiv code Azmi Haider +1
Notable research

Benchmarking Sensor-Fault Robustness in Forecasting

Problem This paper addresses the gap in evaluating forecasting models for cyber-physical systems (CPS) under sensor faults, such as noise, bias, missing data, and temporal misalignment. Traditional forecasting evaluations typically...

arXiv cs.LG evaluation benchmarks 431w arXiv code Alexander Windmann +5
Notable research

On periodic distributed representations using Fourier embeddings

Problem This paper addresses the limitations of traditional scalar representations for periodic signals, particularly in the context of angular measures like radians and degrees. The authors highlight the challenges in...

arXiv cs.LG theory 375w arXiv code Jakeb Chouinard
Major research

CLEF: EEG Foundation Model for Learning Clinical Semantics

Problem This paper addresses the gap in existing EEG foundation models, which primarily focus on short-window decoding and fail to incorporate clinical context for comprehensive EEG interpretation. The authors highlight...

arXiv cs.AI foundation models 439w arXiv code Peng Cao +4
Notable research

Policy Gradient Methods for Non-Markovian Reinforcement Learning

Problem This paper addresses the gap in reinforcement learning (RL) methods for non-Markovian decision processes (NMDPs), where the agent’s observations and rewards depend on the entire history of interactions rather...

arXiv cs.AI training methods 438w arXiv code Avik Kar +5
Notable research

Probing Cross-modal Information Hubs in Audio-Visual LLMs

Problem This paper addresses the gap in understanding the internal mechanisms of audio-visual large language models (AVLLMs), which have not been as extensively studied as their text-only or vision-language counterparts....

arXiv cs.AI multimodal 429w arXiv code Jihoo Jung +3
Notable research

Switching-Geometry Analysis of Deflated Q-Value Iteration

Problem This paper addresses a gap in the convergence analysis of rank-one deflated Q-value iteration (Q-VI) within the context of discounted Markov decision processes (MDPs). Specifically, it provides a novel...

arXiv cs.AI theory 485w arXiv code Donghwan Lee
Major research

Conformity Generates Collective Misalignment in AI Agents Societies

Problem This preprint addresses a significant gap in AI safety literature concerning the collective behavior of AI agents. While existing research primarily focuses on aligning individual language models with human...

arXiv cs.CL alignment safety 492w arXiv code Giordano De Marzo +4
Notable news OpenAI

How ChatGPT adoption broadened in early 2026

In the first quarter of 2026, ChatGPT experienced a significant surge in adoption, particularly among users aged 35 and older. This trend highlights a shift towards broader mainstream acceptance of...

OpenAI Blog other 240w
Notable research

Energy-Efficient Implementation of Spiking Recurrent Cells on FPGA

Problem This preprint addresses the gap in energy-efficient implementations of Spiking Neural Networks (SNNs) on Field-Programmable Gate Arrays (FPGAs). While SNNs can potentially reduce energy consumption compared to traditional Artificial...

arXiv cs.NE efficiency inference 418w arXiv code Pascal Harmeling +2
Notable research

Step Rejection Fine-Tuning: A Practical Distillation Recipe

Problem This paper addresses a gap in the training methodologies for large language model (LLM) agents, specifically in the context of software engineering tasks evaluated through SWE-bench. The authors identify...

arXiv cs.CL training methods 444w arXiv code Igor Slinko +3
Notable research Hugging Face

A Single-Layer Model Can Do Language Modeling

Problem This preprint addresses the gap in understanding the efficacy of single-layer architectures for language modeling, contrasting with the prevalent multi-layered approaches in contemporary models. The authors investigate whether a...

arXiv cs.CL foundation models 405w arXiv code Zanmin Wang
news

Your AI Use Is Breaking My Brain

The pervasive use of AI-generated writing is leading to a homogenization of content that many find frustrating and uninspiring. As AI tools become more integrated into daily communication and content...

404 Media opinion essay 277w
Notable research

A Theory of Multilevel Interactive Equilibrium in NeuroAI

Problem This paper addresses a significant gap in the literature regarding the modeling of adaptive multi-agent intelligent systems through a game-theoretic lens. Traditional game theory often assumes perfectly rational agents...

arXiv cs.NE theory 426w arXiv code Zhe Sage Chen +1
Notable news

Implementing advanced AI technologies in finance

The finance sector is experiencing a transformative shift as advanced AI technologies infiltrate operations, often without a clear framework from leadership. This trend is particularly significant now, as companies grapple...

MIT Technology Review other 233w
Notable news

The Inference Shift

A significant shift in AI inference is on the horizon, driven by the concept of agentic inference, which diverges from traditional models that rely heavily on human input. This transformation...

Stratechery (free) infrastructure compute 251w

OpenAI Campus Network: Student club interest form

OpenAI has launched the OpenAI Campus Network, an initiative aimed at fostering a global community of student clubs focused on artificial intelligence. This program is significant as it seeks to...

OpenAI Blog other 230w
Notable news Scale AI

How enterprises are scaling AI

Enterprises are increasingly moving from initial AI experiments to broader implementations that yield significant, compounding impacts. This shift is driven by a focus on building trust, establishing governance frameworks, optimizing...

OpenAI Blog other 264w
research

Scientists ID ‘corkscrew killer’ behind gruesome seal deaths

Problem This paper addresses the unexplained phenomenon of seal deaths attributed to unusual injuries that were not linked to known predators such as sharks or mechanical causes like boat propellers....

Science (AI abstracts) other 461w
Notable news Scale AI

How enterprises are scaling AI - The Tech Buzz

Enterprises are increasingly adopting AI technologies to enhance operational efficiency and drive innovation, with major players like Microsoft and Google leading the charge. This shift is particularly significant as organizations...

Google News · Scale AI other 230w
Notable research

Prospective Compression in Human Abstraction Learning

Problem This preprint addresses a significant gap in the literature on program synthesis, specifically the challenge of online library learning in non-stationary environments. Existing algorithms predominantly focus on retrospective compression,...

arXiv cs.NE theory 467w arXiv code Leonardo Hernandez Cano +5
Notable research

Frequency Matching in Spiking Neural Networks for mmWave Sensing

Problem This preprint addresses the limitations of existing mmWave sensing methodologies that predominantly utilize artificial neural networks (ANNs). These methods often require extensive preprocessing and complex architectures to achieve robustness...

arXiv cs.NE efficiency inference 444w arXiv code Di Yu +5
Notable research

Towards the explainability of protein language models

Problem This paper addresses the gap in explainability within protein language models (PLMs), a critical area in computational biology where understanding model decisions is essential for biological insights. Despite the...

Nature Machine Intelligence interpretability 496w