null

62 primary articles · 8 secondary mentions

Primary coverage

Major news null

EU Allocates €30 Billion for AI Gigafactories Amid U.S. Spending Surge

The European Commission has announced a funding initiative of up to €30 billion for AI gigafactories. In stark contrast, major U.S. tech companies are projected to spend $600 billion on...

Jul 31, 2026

Major news null

US Navy's AI Strategy Prioritizes Speed Over Alignment Risks

The US Department of the Navy has signed a strategy aimed at ‘weaponizing’ data and AI, emphasizing that moving too slowly poses greater risks than ‘imperfect alignment.’ This strategy includes...

Jul 18, 2026

Major news null

Kimi's K3 Model Features 2.8 Trillion Parameters, Releases July 27

Kimi’s upcoming K3 model boasts 2.8 trillion parameters and a context capacity of one million tokens. Scheduled for full weight release on July 27, K3 is significantly pricier than its...

Jul 16, 2026

Major research null

UK's AI Security Institute finds standard benchmarks systematically underestimate what AI agents can actually do

The UK’s AI Security Institute (AISI) conducted a study analyzing seven standard benchmarks used to evaluate AI agents, revealing that these benchmarks systematically underestimate the true capabilities of AI systems....

Jul 3, 2026

Notable news null

Bridgewater's Finance Tests Show GPT and Claude's Limitations on Accuracy

Bridgewater’s finance tests revealed that both GPT and Claude achieved an accuracy rate of 84.7 percent. In contrast, the Qwen3-235B model, developed by the startup Thinking Machines Lab and founded...

Jul 3, 2026

Notable research null

Sumi: Open Uniform Diffusion Language Model from Scratch

Problem This work addresses the lack of large-scale pretrained uniform diffusion language models (UDLMs) in the literature, which hampers the understanding of their scaling behavior and generation dynamics. Prior to...

Jun 17, 2026 arXiv code Mengyu Ye +5

Notable research null

The Value Axis: Language Models Encode Whether They're on the Right Track

Problem This work addresses the gap in understanding how language models internally assess the value of their ongoing strategies, particularly in the context of reinforcement learning. The authors investigate whether...

Jun 15, 2026 arXiv code Nick Jiang +2

Major research null

ActiveSAM: Image-Conditional Class Pruning for Fast and Accurate Open-Vocabulary Segmentation

Problem This paper addresses the inefficiency of applying the Segment Anything Model 3 (SAM 3) directly to open-vocabulary semantic segmentation (OVSS). The authors highlight that traditional methods require full-resolution decoding...

Jun 15, 2026 arXiv code Tran Dinh Tien +1

Notable research null

AI coding agents find the right file but miss the exact lines that matter, study shows

Problem — This paper addresses a significant gap in the evaluation of AI coding agents, specifically their ability to not only locate relevant files but also identify critical lines of...

Jun 14, 2026

Notable news null

Open model Kimi K2.7 Code undercuts GPT-5.5 and Claude by up to 12x on price per token

Moonshot AI has launched Kimi K2.7 Code, an open-weights model designed for programming tasks that boasts a staggering 1 trillion parameters. This release is particularly significant as it positions Kimi...

Jun 13, 2026

Notable research null

Chatbots Keep Telling Stories About Lighthouse Keeper 'Elias Thorne'. We Might Know Why

Problem This work addresses the unexplained prevalence of specific narrative themes in large language models (LLMs), particularly the recurring character of ‘Elias Thorne,’ a lighthouse keeper. The authors highlight a...

Jun 11, 2026

Notable research null

Different Layers, Different Manifolds: Module-Wise Weight-Space Geometry in Transformer Optimization

Problem This work addresses the gap in understanding how different transformer modules may benefit from distinct weight-space geometries during optimization. The authors highlight that existing literature typically applies uniform manifold...

Jun 11, 2026 arXiv code Kirato Yoshihara

Notable research null

Which Models Are Our Models Built On? Auditing Invisible Dependencies in Modern LLMs

Problem The paper addresses the lack of transparency in the dependency structures of modern large language models (LLMs), which often rely on other models for data generation, filtering, and evaluation....

Jun 10, 2026 arXiv code Sanjay Adhikesaven +2

Notable news null

Jedify raises $24M to help companies arm AI agents with context on their business

Jedify has successfully raised $24 million in a recent funding round, led by Norwest, with participation from S Capital VC, Cerca Partners, Oceans Ventures, and strategic investor Snowflake Ventures. This...

Jun 10, 2026

Major research null

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

Problem This work addresses the gap in autonomous research capabilities, specifically the lack of frameworks that can effectively manage long-term research processes without human intervention. The authors propose a novel...

Jun 10, 2026 arXiv code Jiajie Jin +5

Major research null

FADA: Accessible fetal ultrasound interpretation and annotation with a selectively distilled unified vision-language model

Problem This paper addresses the critical shortage of trained sonographers in low- and middle-income countries, where over half of pregnant women lack access to skilled ultrasound services. Current deep learning...

Jun 9, 2026 arXiv code Mahmood Alzubaidi +5

Notable research null

PsychoSafe: Eliciting Psychologically-Informed Refusals in Large Language Models

Problem This work addresses the gap in the capability of large language models (LLMs) to handle requests that necessitate refusals, particularly in high-risk scenarios involving crisis or coercion. Traditional refusal...

Jun 8, 2026 arXiv code Gianluca Barmina +5

Notable research null

OpenBibleTTS: Large-Scale Speech Resources and TTS Models for Low-Resource Languages

Problem The paper addresses the significant gap in text-to-speech (TTS) capabilities for low-resource languages, which are often overlooked in favor of high-resource languages. Existing TTS models predominantly focus on a...

Jun 8, 2026 arXiv code David Guzmán +5

Notable research null

OpenOpt: An Open-Source SRAM Optimizer Based on Equivalent Circuit Model

Problem This work addresses the lack of efficient optimization frameworks for SRAM design that simultaneously consider both architectural parameters and transistor sizing. Existing methods often optimize these aspects in isolation,...

Jun 8, 2026 arXiv code Yikai Wang +5

Major news null

Is this the dawn of the Tokenpocalypse?

The AI landscape is on the brink of significant change as several major companies prepare for public offerings. This shift is crucial as it could reshape market dynamics and investor...

Jun 7, 2026

Major news null

Malicious Browser Add-Ons Target ChatGPT, Claude, Copilot, Gemini, and DeepSeek Users - CyberSecurityNews

Recent reports highlight a concerning trend where malicious browser add-ons are specifically targeting users of prominent AI tools, including ChatGPT, Claude, Copilot, Gemini, and DeepSeek. This surge in cyber threats...

Jun 5, 2026

news null

Are AI chatbots making us lose control of our brains?

Recent discussions at SXSW London have spotlighted the potential cognitive impacts of AI chatbots, with psychologist Gloria Mark from the University of California, Irvine, emphasizing the long-term effects of digital...

Jun 5, 2026

Notable research null

Pretraining Recurrent Networks without Recurrence

Problem The paper addresses the limitations of standard backpropagation through time (BPTT) in training recurrent neural networks (RNNs), particularly its sequential nature, which restricts parallelism, and its susceptibility to vanishing...

Jun 4, 2026 arXiv code Akarsh Kumar +1

Notable research null

Benchmarking Open-Source Layout Detection Models for Data Snapshot Extraction from Institutional Documents

Problem The paper addresses a significant gap in the literature regarding the extraction of semantically meaningful visual artifacts from institutional documents, which contain critical operational and analytical information. Current methodologies...

Jun 4, 2026 arXiv code AJ Carl P. Dy +1

Major news null

AI can now coach amateur virologists, and top tech leaders want Congress to act on DNA security

In a significant development for the intersection of artificial intelligence and biotechnology, leading tech figures, including Sam Altman and Dario Amodei, are calling on the U.S. government to take urgent...

Jun 4, 2026

Notable research null

MetaPoint: Unlocking Precise Spatial Control in Agentic Visual Generation

Problem Generative visual models have historically struggled with precise spatial control, particularly in mapping numerical coordinates to 2D image canvases. This limitation hinders the ability to generate images with specific...

Jun 3, 2026 arXiv code Dewei Zhou +5

Notable news null

Companies Are Using Reddit to Manipulate ChatGPT and Google AI Search

Recent reports indicate that peptide companies are engaging in questionable tactics by flooding the biohackers subreddit to influence the outputs of ChatGPT and Google AI Search. This manipulation raises significant...

Jun 3, 2026

Notable news null

Turing Award winner Richard Sutton says pure generative AI can't do real science

Turing Award winner Richard Sutton has raised significant concerns about the capabilities of conventional generative AI, asserting that it fundamentally lacks the ability to evaluate its own outputs. This limitation,...

Jun 1, 2026

Major news null

MiniMax M3 debuts, eclipsing GPT-5.5 and Gemini 3.1 Pro on key benchmark performance for just 5-10% of the cost - Venturebeat

The AI landscape is witnessing a significant shift with the launch of MiniMax M3, a new model that reportedly surpasses both GPT-5.5 and Gemini 3.1 Pro in key benchmark performance....

Jun 1, 2026

Notable research null

How's it going? Reinforcement learning in language models recruits a functional welfare axis

Problem This preprint addresses the gap in understanding how reinforcement learning (RL) influences the internal representations of language models, particularly in relation to a functional welfare axis. Prior literature has...

May 28, 2026 arXiv code Andy Q Han +2

Major news null stocks

Robinhood lets AI agents trade shares and make credit card purchases for customers

Robinhood has introduced a groundbreaking feature that allows customers to connect AI agents, such as Anthropic’s Claude, to their investment accounts through its new Managed Customer Platform (MCP). This innovation...

May 27, 2026

Notable research null

OpenURMA: A Clean-Room Open Implementation of the Unified Bus Protocol

Problem This paper addresses the limitations of modern datacenter RDMA (Remote Direct Memory Access) implementations, specifically focusing on the inefficiencies introduced by the Queue Pair over PCIe abstraction inherited from...

May 27, 2026 arXiv code Bojie Li

Notable research null

Retrying vs Resampling in AI Control

Problem This preprint addresses the limitations of existing AI control mechanisms, specifically focusing on the practice of retrying in AI coding scaffolds like Claude Code and Codex. The authors argue...

May 25, 2026 arXiv code James Lucassen +1

Notable research null

Everything at Every Scale: Scale-Invariant Diffusion with Continuous Super-Resolution

Problem This preprint addresses the gap in the literature regarding the integration of image generation and super-resolution tasks within a unified framework. Traditional approaches often require distinct architectures and retraining...

May 25, 2026 arXiv code Zixin Jessie Chen +5

Notable research null

Evaluating Commercial AI Chatbots as News Intermediaries

Problem This preprint addresses the lack of systematic evaluation of AI chatbots as news intermediaries, particularly in their ability to accurately handle emerging facts across diverse languages and regions. Prior...

May 21, 2026 arXiv code Mirac Suzgun +5

Notable research null

Cyber-Physical Anomaly Detection in IoT-Enabled Smart Grids Using Machine Learning and Metaheuristic Feature Optimization

Problem This preprint addresses the gap in effective anomaly detection within IoT-enabled smart grids, particularly distinguishing between physical incidents and cyber-physical attacks. The increasing complexity of smart grid infrastructures, characterized...

May 21, 2026 arXiv code Adis Alihodžić +2

Major research null

Open-source LLMs administer maximum electric shocks in a Milgram-like obedience experiment

Problem This preprint addresses the gap in understanding the behavior of large language models (LLMs) when subjected to authority pressure in decision-making contexts, particularly in high-stakes environments. While LLMs are...

May 20, 2026 arXiv code Roland Pihlakas +1

Notable research null

LamPO: A Lambda Style Policy Optimization for Reasoning Language Models

Problem This paper addresses the limitations of existing reinforcement learning with verifiable rewards (RLVR) methods, particularly the group-relative objectives like GRPO, which aggregate responses into scalar statistics. This aggregation discards...

May 20, 2026 arXiv code Zhe Yuan +5

Major news null

US to safety test new AI models from Google, Microsoft, xAI - MSN

The U.S. government is set to conduct safety tests on new AI models developed by tech giants Google, Microsoft, and xAI. This initiative comes amid growing concerns over the potential...

May 18, 2026

Notable research null

Spiker-LL: An Energy-Efficient FPGA Accelerator Enabling Adaptive Local Learning in Spiking Neural Networks

Problem This paper addresses the gap in deploying adaptive intelligence at the edge, particularly focusing on the high computational and energy costs associated with training neural models. While Spiking Neural...

May 18, 2026 arXiv code Alessio Caviglia +3

Notable news null

Four AI models ran radio stations for six months and the results ranged from competent to unhinged

In a groundbreaking experiment, Andon Labs has allowed four distinct AI models to autonomously operate their own radio stations for six months, revealing a spectrum of personalities and operational styles....

May 17, 2026

Notable news null

What the jury will actually decide in the case of Elon Musk vs. Sam Altman

In a highly anticipated legal showdown, Elon Musk and Sam Altman find themselves at the center of a landmark case that could reshape the AI landscape. The trial, which has...

May 14, 2026

Major news null

What happens when AI starts building itself?

Richard Socher, a prominent figure in the AI landscape, has launched a groundbreaking startup with a hefty $650 million backing, aiming to create an AI capable of self-research and continuous...

May 14, 2026

Notable news null

Podcast: The Chinese Deepfake Software Powering Scams

A recent podcast episode delves into Haotian AI, a Chinese-language deepfake software that has become a tool for various scams. The discussion highlights the growing sophistication of AI-generated content and...

May 13, 2026

Notable research null

Covering Human Action Space for Computer Use: Data Synthesis and Benchmark

Problem This paper addresses the inadequacy of existing computer-use agents (CUAs) in handling complex, low-frequency interactions within graphical user interfaces (GUIs). Despite advancements in models like GPT-5.4 and Claude, their...

May 12, 2026 arXiv code Miaosen Zhang +5

Notable research null

A Causal Language Modeling Detour Improves Encoder Continued Pretraining

Problem This preprint addresses the gap in the literature regarding the adaptation of encoder models to new domains, specifically in the context of biomedical text. The standard practice of continuing...

May 12, 2026 arXiv code Rian Touchent +1

Notable research null

Pretraining Exposure Explains Popularity Judgments in Large Language Models

Problem This preprint addresses the gap in understanding the origins of popularity bias in large language models (LLMs). While previous literature has suggested that LLMs exhibit preferences for well-known entities...

May 12, 2026 arXiv code Jamshid Mozafari +2

Major news null

The U.S. quietly deleted a page announcing AI security testing deals with Google, Microsoft, and xAI - qz.com

In a surprising move, the U.S. government has removed a webpage that previously announced AI security testing partnerships with major tech companies, including Google, Microsoft, and xAI. This deletion raises...

May 12, 2026

Notable research null

LoKA: Low-precision Kernel Applications for Recommendation Models At Scale

Problem This paper addresses the limited adoption of low-precision arithmetic, specifically FP8, in large recommendation models (LRMs), despite its successful application in large language models (LLMs). The authors highlight that...

May 11, 2026 arXiv code Liang Luo +5

Major news null stocks

ByteDance plans over $30 billion for AI expansion, bets big on Chinese chips

ByteDance is significantly ramping up its investment in artificial intelligence, announcing plans to allocate over 200 billion yuan (approximately $30 billion) for AI development by 2026. This marks a 25...

May 10, 2026

Also mentioned

Notable news OpenAI

Zhipu AI Launches ZCode with GLM-5.2 to Compete with Claude Code and Codex

Jul 6, 2026

news xAI

Trump DOJ supports Musk-owned data center in suit by NAACP - E&E News by POLITICO

Jun 16, 2026

Major research null

FADA: Accessible fetal ultrasound interpretation and annotation with a selectively distilled unified vision-language model

Jun 9, 2026 arXiv code Mahmood Alzubaidi +5

Notable news xAI