ByteDance Launches iLLaDA, an 8B Diffusion Language Model Competing with Qwen2.5
ByteDance has introduced iLLaDA, an 8 billion parameter diffusion language model that matches the performance of Qwen2.5 at the base level. Researchers from Renmin University and ByteDance noted that iLLaDA...