Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.LG
arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for recent submissions

  • Thu, 2 Oct 2025
  • Wed, 1 Oct 2025
  • Tue, 30 Sep 2025
  • Mon, 29 Sep 2025
  • Fri, 26 Sep 2025

See today's new changes

Total of 1554 entries : 1-50 51-100 101-150 151-200 ... 1551-1554
Showing up to 50 entries per page: fewer | more | all

Thu, 2 Oct 2025 (showing first 50 of 241 entries )

[1] arXiv:2510.01185 [pdf, html, other]
Title: Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs
Leyla Mirvakhabova, Babak Ehteshami Bejnordi, Gaurav Kumar, Hanxue Liang, Wanru Zhao, Paul Whatmough
Subjects: Machine Learning (cs.LG)
[2] arXiv:2510.01184 [pdf, html, other]
Title: Temporal Score Rescaling for Temperature Sampling in Diffusion and Flow Models
Yanbo Xu, Yu Wu, Sungjae Park, Zhizhuo Zhou, Shubham Tulsiani
Subjects: Machine Learning (cs.LG)
[3] arXiv:2510.01180 [pdf, html, other]
Title: BroRL: Scaling Reinforcement Learning via Broadened Exploration
Jian Hu, Mingjie Liu, Ximing Lu, Fang Wu, Zaid Harchaoui, Shizhe Diao, Yejin Choi, Pavlo Molchanov, Jun Yang, Jan Kautz, Yi Dong
Comments: 16 pages, 4 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[4] arXiv:2510.01179 [pdf, html, other]
Title: TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments
Zhangchen Xu, Adriana Meza Soria, Shawn Tan, Anurag Roy, Ashish Sunil Agrawal, Radha Poovendran, Rameswar Panda
Comments: 35 pages, 13 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[5] arXiv:2510.01178 [pdf, html, other]
Title: COM-BOM: Bayesian Exemplar Search for Efficiently Exploring the Accuracy-Calibration Pareto Frontier
Gaoxiang Luo, Aryan Deshwal
Comments: Accepted by EMNLP 2025 Main, Code: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[6] arXiv:2510.01175 [pdf, html, other]
Title: On the Benefits of Weight Normalization for Overparameterized Matrix Sensing
Yudong Wei, Liang Zhang, Bingcong Li, Niao He
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP); Optimization and Control (math.OC); Machine Learning (stat.ML)
[7] arXiv:2510.01169 [pdf, html, other]
Title: Fiaingen: A financial time series generative method matching real-world data quality
Jože M. Rožanec, Tina Žezlin, Laurentiu Vasiliu, Dunja Mladenić, Radu Prodan, Dumitru Roman
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[8] arXiv:2510.01167 [pdf, html, other]
Title: Simultaneous Multi-objective Alignment Across Verifiable and Non-verifiable Rewards
Yiran Shen, Yu Xia, Jonathan Chang, Prithviraj Ammanabrolu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[9] arXiv:2510.01163 [pdf, other]
Title: How Does the Pretraining Distribution Shape In-Context Learning? Task Selection, Generalization, and Robustness
Waïss Azizian, Ali Hasan
Comments: 52 pages, 12 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[10] arXiv:2510.01161 [pdf, html, other]
Title: Prosperity before Collapse: How Far Can Off-Policy RL Reach with Stale Data on LLMs?
Haizhong Zheng, Jiawei Zhao, Bedi Chen
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[11] arXiv:2510.01159 [pdf, html, other]
Title: Multi-Marginal Flow Matching with Adversarially Learnt Interpolants
Oskar Kviman, Kirill Tamogashev, Nicola Branchini, Víctor Elvira, Jens Lagergren, Nikolay Malkin
Subjects: Machine Learning (cs.LG)
[12] arXiv:2510.01153 [pdf, html, other]
Title: Neural Hamilton--Jacobi Characteristic Flows for Optimal Transport
Yesom Park, Shu Liu, Mo Zhou, Stanley Osher
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[13] arXiv:2510.01137 [pdf, html, other]
Title: Sample-Efficient Differentially Private Fine-Tuning via Gradient Matrix Denoising
Ali Dadsetan, Frank Rudzicz
Subjects: Machine Learning (cs.LG)
[14] arXiv:2510.01136 [pdf, html, other]
Title: TabINR: An Implicit Neural Representation Framework for Tabular Data Imputation
Vincent Ochs, Florentin Bieder, Sidaty el Hadramy, Paul Friedrich, Stephanie Taha-Mehlitz, Anas Taha, Philippe C. Cattin
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[15] arXiv:2510.01135 [pdf, other]
Title: Prompt Curriculum Learning for Efficient LLM Post-Training
Zhaolin Gao, Joongwon Kim, Wen Sun, Thorsten Joachims, Sid Wang, Richard Yuanzhe Pang, Liang Tan
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[16] arXiv:2510.01132 [pdf, html, other]
Title: A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning
Ruiyi Wang, Prithviraj Ammanabrolu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[17] arXiv:2510.01123 [pdf, html, other]
Title: Rethinking Thinking Tokens: LLMs as Improvement Operators
Lovish Madaan, Aniket Didolkar, Suchin Gururangan, John Quan, Ruan Silva, Ruslan Salakhutdinov, Manzil Zaheer, Sanjeev Arora, Anirudh Goyal
Comments: 21 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[18] arXiv:2510.01118 [pdf, html, other]
Title: Breaking the Euclidean Barrier: Hyperboloid-Based Biological Sequence Analysis
Sarwan Ali, Haris Mansoor, Murray Patterson
Subjects: Machine Learning (cs.LG)
[19] arXiv:2510.01116 [pdf, html, other]
Title: Eliciting Chain-of-Thought Reasoning for Time Series Analysis using Reinforcement Learning
Felix Parker, Nimeesha Chan, Chi Zhang, Kimia Ghobadi
Subjects: Machine Learning (cs.LG)
[20] arXiv:2510.01113 [pdf, html, other]
Title: Privacy Preserved Federated Learning with Attention-Based Aggregation for Biometric Recognition
Kassahun Azezew, Minyechil Alehegn, Tsega Asresa, Bitew Mekuria, Tizazu Bayh, Ayenew Kassie, Amsalu Tesema, Animut Embiyale
Subjects: Machine Learning (cs.LG)
[21] arXiv:2510.01111 [pdf, html, other]
Title: Augmenting LLMs for General Time Series Understanding and Prediction
Felix Parker, Nimeesha Chan, Chi Zhang, Kimia Ghobadi
Subjects: Machine Learning (cs.LG)
[22] arXiv:2510.01105 [pdf, html, other]
Title: Geometric Properties of Neural Multivariate Regression
George Andriopoulos, Zixuan Dong, Bimarsha Adhikari, Keith Ross
Comments: 22 pages, 12 figures
Subjects: Machine Learning (cs.LG)
[23] arXiv:2510.01089 [pdf, html, other]
Title: Dynamical system reconstruction from partial observations using stochastic dynamics
Viktor Sip, Martin Breyton, Spase Petkoski, Viktor Jirsa
Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[24] arXiv:2510.01083 [pdf, html, other]
Title: Multi-Actor Multi-Critic Deep Deterministic Reinforcement Learning with a Novel Q-Ensemble Method
Andy Wu, Chun-Cheng Lin, Rung-Tzuo Liaw, Yuehua Huang, Chihjung Kuo, Chia Tong Weng
Subjects: Machine Learning (cs.LG)
[25] arXiv:2510.01074 [pdf, html, other]
Title: Predicting Diabetic Retinopathy Using a Two-Level Ensemble Model
Mahyar Mahmoudi, Tieming Liu
Comments: Accepted for presentation at the IISE Annual Conference & Expo 2025, 6 pages, 2 tables, 1 figure
Subjects: Machine Learning (cs.LG)
[26] arXiv:2510.01070 [pdf, html, other]
Title: Eliciting Secret Knowledge from Language Models
Bartosz Cywiński, Emil Ryd, Rowan Wang, Senthooran Rajamanoharan, Neel Nanda, Arthur Conmy, Samuel Marks
Subjects: Machine Learning (cs.LG)
[27] arXiv:2510.01051 [pdf, html, other]
Title: GEM: A Gym for Agentic LLMs
Zichen Liu, Anya Sims, Keyu Duan, Changyu Chen, Simon Yu, Xiangxin Zhou, Haotian Xu, Shaopan Xiong, Bo Liu, Chenmien Tan, Chuen Yang Beh, Weixun Wang, Hao Zhu, Weiyan Shi, Diyi Yang, Michael Shieh, Yee Whye Teh, Wee Sun Lee, Min Lin
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[28] arXiv:2510.01039 [pdf, html, other]
Title: Gated X-TFC: Soft Domain Decomposition for Forward and Inverse Problems in Sharp-Gradient PDEs
Vikas Dwivedi, Enrico Schiassi, Monica Sigovan, Bruno Sixou
Subjects: Machine Learning (cs.LG)
[29] arXiv:2510.01037 [pdf, html, other]
Title: CurES: From Gradient Analysis to Efficient Curriculum Learning for Reasoning LLMs
Yongcheng Zeng, Zexu Sun, Bokai Ji, Erxue Min, Hengyi Cai, Shuaiqiang Wang, Dawei Yin, Haifeng Zhang, Xu Chen, Jun Wang
Comments: 25 pages, 10 Figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[30] arXiv:2510.01032 [pdf, html, other]
Title: Meaningless Tokens, Meaningful Gains: How Activation Shifts Enhance LLM Reasoning
Zeru Shi, Yingjia Wan, Zhenting Wang, Qifan Wang, Fan Yang, Elisa Kreiss, Ruixiang Tang
Subjects: Machine Learning (cs.LG)
[31] arXiv:2510.01022 [pdf, html, other]
Title: Equivariant Geometric Scattering Networks via Vector Diffusion Wavelets
David R. Johnson, Rishabh Anand, Smita Krishnaswamy, Michael Perlmutter
Comments: Accepted for presentation at the NeurIPS workshop on New Perspectives in Advancing Graph Machine Learning
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP); Machine Learning (stat.ML)
[32] arXiv:2510.01020 [pdf, other]
Title: The Good, the Bad, and the Sampled: a No-Regret Approach to Safe Online Classification
Tavor Z. Baharav, Spyros Dragazis, Aldo Pacchiano
Comments: 43 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Statistics Theory (math.ST); Machine Learning (stat.ML)
[33] arXiv:2510.01012 [pdf, html, other]
Title: Random Feature Spiking Neural Networks
Maximilian Gollwitzer, Felix Dietrich
Comments: 34 pages incl. references & appendix, 3 figures, 4 tables
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[34] arXiv:2510.00983 [pdf, html, other]
Title: Riemannian Consistency Model
Chaoran Cheng, Yusong Wang, Yuxin Chen, Xiangxin Zhou, Nanning Zheng, Ge Liu
Comments: Accepted to NeurIPS 2025
Subjects: Machine Learning (cs.LG)
[35] arXiv:2510.00977 [pdf, html, other]
Title: It Takes Two: Your GRPO Is Secretly DPO
Yihong Wu, Liheng Ma, Lei Ding, Muzhi Li, Xinyu Wang, Kejia Chen, Zhan Su, Zhanguang Zhang, Chenyang Huang, Yingxue Zhang, Mark Coates, Jian-Yun Nie
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[36] arXiv:2510.00938 [pdf, other]
Title: Large Reasoning Models Learn Better Alignment from Flawed Thinking
ShengYun Peng, Eric Smith, Ivan Evtimov, Song Jiang, Pin-Yu Chen, Hongyuan Zhan, Haozhu Wang, Duen Horng Chau, Mahesh Pasupuleti, Jianfeng Chi
Subjects: Machine Learning (cs.LG)
[37] arXiv:2510.00915 [pdf, html, other]
Title: Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect Verifiers
Xin-Qiang Cai, Wei Wang, Feng Liu, Tongliang Liu, Gang Niu, Masashi Sugiyama
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[38] arXiv:2510.00911 [pdf, html, other]
Title: RiskPO: Risk-based Policy Optimization via Verifiable Reward for LLM Post-Training
Tao Ren, Jinyang Jiang, Hui Yang, Wan Tian, Minhao Zou, Guanghao Li, Zishi Zhang, Qinghao Wang, Shentao Qin, Yanjun Zhao, Rui Tao, Hui Shao, Yijie Peng
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[39] arXiv:2510.00907 [pdf, html, other]
Title: BoMGene: Integrating Boruta-mRMR feature selection for enhanced Gene expression classification
Bich-Chung Phan, Thanh Ma, Huu-Hoa Nguyen, Thanh-Nghi Do
Subjects: Machine Learning (cs.LG)
[40] arXiv:2510.00885 [pdf, html, other]
Title: Rectifying Regression in Reinforcement Learning
Alex Ayoub, David Szepesvári, Alireza Baktiari, Csaba Szepesvári, Dale Schuurmans
Subjects: Machine Learning (cs.LG)
[41] arXiv:2510.00883 [pdf, html, other]
Title: GLAI: GreenLightningAI for Accelerated Training through Knowledge Decoupling
Jose I. Mestre, Alberto Fernández-Hernández, Cristian Pérez-Corral, Manuel F. Dolz, Jose Duato, Enrique S. Quintana-Ortí
Comments: 20 pages, 2 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[42] arXiv:2510.00873 [pdf, html, other]
Title: Reducción de ruido por medio de autoencoders: caso de estudio con la señal GW150914
Fernanda Zapata Bascuñán, Darío Fernando Mendieta
Comments: in Spanish language, Presented at the RPIC 2023 (Information Processing and Control work Reunion)
Subjects: Machine Learning (cs.LG); Instrumentation and Methods for Astrophysics (astro-ph.IM)
[43] arXiv:2510.00872 [pdf, other]
Title: A Visual Diagnostics Framework for District Heating Data: Enhancing Data Quality for AI-Driven Heat Consumption Prediction
Kristoffer Christensen, Bo Nørregaard Jørgensen, Zheng Grace Ma
Comments: Energy this http URL Conference 2025 (EI.A 2025), 3-6 December 2025, Universiti Tenaga Nasional (UNITEN), Kuala Lumpur, Malaysia
Subjects: Machine Learning (cs.LG); Human-Computer Interaction (cs.HC)
[44] arXiv:2510.00871 [pdf, html, other]
Title: Target Population Synthesis using CT-GAN
Tanay Rastogi, Daniel Jonsson
Comments: Submitted for journal and is under review
Subjects: Machine Learning (cs.LG)
[45] arXiv:2510.00866 [pdf, html, other]
Title: The data-quality illusion: Rethinking Classifier-based quality filtering for LLM Pretraining
Thiziri Nait Saada, Louis Bethune, Michal Klein, David Grangier, Marco Cuturi, Pierre Ablin
Comments: 21 pages, 20 figures, 2 tables, preprint
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[46] arXiv:2510.00859 [pdf, html, other]
Title: Population Synthesis using Incomplete Information
Tanay Rastogi, Daniel Jonsson, Anders Karlström
Comments: Presented at 25th Euro Working Group on Transportation (EWGT) Meeting
Journal-ref: Rastogi, Tanay, Daniel Jonsson, and Anders Karlstr\"om. "Population Synthesis Using Incomplete Microsample." Transportation Research Procedia 86 (2025): 80-87
Subjects: Machine Learning (cs.LG)
[47] arXiv:2510.00845 [pdf, html, other]
Title: Mechanistic Interpretability as Statistical Estimation: A Variance Analysis of EAP-IG
Maxime Méloux, Maxime Peyrard, François Portet
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[48] arXiv:2510.00841 [pdf, html, other]
Title: LLM Routing with Dueling Feedback
Chao-Kai Chiang, Takashi Ishida, Masashi Sugiyama
Subjects: Machine Learning (cs.LG)
[49] arXiv:2510.00819 [pdf, html, other]
Title: Stabilizing Policy Gradients for Sample-Efficient Reinforcement Learning in LLM Reasoning
Luckeciano C. Melo, Alessandro Abate, Yarin Gal
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[50] arXiv:2510.00815 [pdf, html, other]
Title: Learn to Guide Your Diffusion Model
Alexandre Galashov, Ashwini Pokle, Arnaud Doucet, Arthur Gretton, Mauricio Delbracio, Valentin De Bortoli
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Total of 1554 entries : 1-50 51-100 101-150 151-200 ... 1551-1554
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • Click here to contact arXiv Contact
  • Click here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack