Volterra Generative Models
Problem The paper addresses the limitations of existing score-based diffusion models, which typically rely on Brownian perturbations that impose memoryless noising. This work is particularly relevant as it presents a...
38 primary articles · 6 secondary mentions
Problem The paper addresses the limitations of existing score-based diffusion models, which typically rely on Brownian perturbations that impose memoryless noising. This work is particularly relevant as it presents a...
Problem This work addresses the gap in open-source spatial question answering (SQA) systems for service robots navigating long egocentric routes. Existing methods predominantly rely on closed-source models like GPT-4o, which...
Problem The deployment of Spiking Neural Networks (SNNs) is hindered by the challenge of efficiently managing their inherent parallelism on physical hardware. This work addresses a significant gap in the...
Problem The paper addresses the gap in simulating credible rainfall conditions for autonomous-driving perception tests, which is crucial for identifying system boundaries and supporting Safety of the Intended Functionality (SOTIF)...
Problem The paper addresses the gap in the literature regarding the integration of stochastic differential equations (SDEs) in generative modeling, particularly the challenge of defining a precise distillation procedure for...
Problem This work addresses the limitations of existing reward backpropagation methods in text-to-image flow matching models, particularly the inefficiencies caused by storing activations across the full sampling trajectory and the...
Problem The paper addresses the limitations of existing multi-token prediction (MTP) methods in large language models (LLMs), which suffer from a fundamental architectural flaw: the competition between the MTP head...
Problem The paper addresses the reliability risks associated with using large language models (LLMs) for generating solver code in finite element simulations, particularly in multi-physics contexts. While LLMs can streamline...
Problem Reconstructing local stress fields in heterogeneous microstructures subjected to non-linear, history-dependent loading is a significant computational challenge in multi-scale simulations. Existing methods often struggle with the scale mismatch between...
Problem Generating realistic financial time series is a significant challenge due to the limited availability of historical data, which often leads to overfitting, particularly in adversarial training scenarios. Existing methods...
Problem Current feature attribution methods, particularly path-based techniques like Integrated Gradients, face limitations in their reliance on input-space trajectories. These methods lack control over the resolution of feature queries, leading...
Problem Backdoor attacks in Large Language Models (LLMs) pose significant security risks, as they allow adversaries to manipulate model outputs through specific triggers. Existing defenses primarily focus on known backdoors,...
Problem This paper addresses the challenge of embedding irregular and asynchronous data into continuous-time models, specifically focusing on Neural Controlled Differential Equations (NCDEs). Traditional methods rely on interpolation or imputation...
Problem This paper addresses the limitations of existing radiology report generation (RRG) systems, which typically rely on a single-path generation approach using multimodal large language models (MLLMs). The authors identify...
ClickHouse, a rapidly growing database provider, has announced that it has tripled its annualized revenue to $250 million, positioning itself for a potential initial public offering (IPO) within the next...
Problem This preprint addresses the limitations of existing estimators for Shapley and Banzhaf interactions, which are critical for understanding complex dynamics in machine learning applications. Current methods often struggle with...
Problem This paper addresses a significant gap in the performance of Gaussian Denoising Diffusion Probabilistic Models (DDPMs) by focusing on the path-space KL divergence between the exact reverse chain and...
Problem This paper addresses the limitations of existing subword tokenization methods, specifically focusing on the inefficiencies in token count and vocabulary selection. Current methods like Byte Pair Encoding (BPE), WordPiece,...
Problem This tutorial addresses the gap in understanding the theoretical foundations of diffusion models by framing them within the context of differential equations. It is particularly relevant as it is...
The Path, a new venture co-founded by motivational speaker Tony Robbins and former executives from Calm, is making waves in the AI therapy space with its innovative approach to mental...
Problem This paper addresses the limitations of natively trained spiking language models in achieving Transformer-like language quality, stable multi-domain pre-training, and high activation sparsity. The authors present SymbolicLight V1, a...
Problem This paper addresses the limitations of existing inference-time guidance techniques in diffusion-based generative models, which often require repeated score or gradient evaluations. These methods introduce bias and high computational...
Problem This preprint addresses a gap in psycholinguistic literature regarding the incremental processing of linguistic input, specifically focusing on “noisy-channel garden-path” sentences. These sentences initially appear grammatically correct but later...
Problem This preprint addresses the challenge of negative transfer in multi-physics foundation models, particularly in the context of scientific machine learning (SciML). The simultaneous co-training of disparate partial differential equation...
Problem This paper addresses the limitations of traditional autonomous UAV search missions, which typically rely on geometric coverage patterns that neglect the semantic context of the target objects. This oversight...
Problem This paper addresses the computational inefficiency of existing Gromov-Wasserstein (GW) solvers, particularly in high-dimensional spaces. The authors propose a novel approach, min Generalized Sliced Gromov-Wasserstein (min-GSGW), which leverages generalized...
AI startup Recursive has made a significant entrance into the market, unveiling its ambitious plans to develop self-improving artificial intelligence with a substantial funding boost of $650 million. This announcement...
Problem This paper addresses the challenge of optimal path selection in hybrid action spaces for Computer Use Agents (CUAs), which can perform both atomic GUI actions and high-level tool calls....
Problem This paper addresses the challenge of anonymous multi-agent path finding (MAPF) on finite, connected graphs, a problem that has significant implications in robotics and automated systems. The authors identify...
Problem This paper addresses the limitations of traditional self-distillation in reinforcement learning (RL) for large language models (LLMs), particularly in scenarios where the teacher model’s guidance can suppress the student’s...
Problem This paper addresses the limitations of self-distillation (SD) in autoregressive large language models (LLMs), particularly the challenges posed by free-form self-generated trajectories, task-dependent correctness, and the instability of plausible...
Problem This preprint addresses the significant gap in the deployment of large language models (LLMs) on consumer-grade hardware, specifically personal computers, which are often underutilized for AI workloads due to...
Problem This preprint addresses the inefficiencies in sampling from diffusion models, which typically require numerous iterative steps to transform samples from a simple noise distribution into a target distribution. The...
Problem This paper addresses the theoretical underpinnings of the Generative Modeling via Drifting (GMD) framework proposed by Deng et al. (2026), which is currently a preprint and unreviewed. The authors...
Problem This paper addresses the challenge of real-time double-directional beam management for vehicle-to-everything (V2X) connectivity in millimeter-wave (mmWave) networks. Existing methods are hindered by high training overhead and limited generalization...
Problem This paper addresses the lack of systematic empirical evaluations of retrieval strategies in the context of Retrieval-Augmented Generation (RAG) for biomedical applications. While RAG has been established as a...
Problem This paper addresses the gap in the detection of multi-turn prompt injection attacks on large language models (LLMs), specifically focusing on covert attacks where individual turns appear benign. Existing...
Problem This paper addresses the challenge of efficient dynamic model merging for multi-task adaptation, specifically focusing on the storage overhead associated with maintaining independent parameters for each task. Existing dynamic...