Machine Learning

Authors and titles for recent submissions

See today's new changes

Total of 1554 entries : 1-50 51-100 101-150 151-200 ... 1551-1554

Showing up to 50 entries per page: fewer | more | all

[1] arXiv:2510.01185 [pdf, html, other]: Title: Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs

Leyla Mirvakhabova, Babak Ehteshami Bejnordi, Gaurav Kumar, Hanxue Liang, Wanru Zhao, Paul Whatmough

Subjects: Machine Learning (cs.LG)
[2] arXiv:2510.01184 [pdf, html, other]: Title: Temporal Score Rescaling for Temperature Sampling in Diffusion and Flow Models

Yanbo Xu, Yu Wu, Sungjae Park, Zhizhuo Zhou, Shubham Tulsiani

Subjects: Machine Learning (cs.LG)
[3] arXiv:2510.01180 [pdf, html, other]: Title: BroRL: Scaling Reinforcement Learning via Broadened Exploration

Jian Hu, Mingjie Liu, Ximing Lu, Fang Wu, Zaid Harchaoui, Shizhe Diao, Yejin Choi, Pavlo Molchanov, Jun Yang, Jan Kautz, Yi Dong

Comments: 16 pages, 4 figures

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[4] arXiv:2510.01179 [pdf, html, other]: Title: TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Zhangchen Xu, Adriana Meza Soria, Shawn Tan, Anurag Roy, Ashish Sunil Agrawal, Radha Poovendran, Rameswar Panda

Comments: 35 pages, 13 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[5] arXiv:2510.01178 [pdf, html, other]: Title: COM-BOM: Bayesian Exemplar Search for Efficiently Exploring the Accuracy-Calibration Pareto Frontier

Gaoxiang Luo, Aryan Deshwal

Comments: Accepted by EMNLP 2025 Main, Code: this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[6] arXiv:2510.01175 [pdf, html, other]: Title: On the Benefits of Weight Normalization for Overparameterized Matrix Sensing

Yudong Wei, Liang Zhang, Bingcong Li, Niao He

Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP); Optimization and Control (math.OC); Machine Learning (stat.ML)
[7] arXiv:2510.01169 [pdf, html, other]: Title: Fiaingen: A financial time series generative method matching real-world data quality

Jože M. Rožanec, Tina Žezlin, Laurentiu Vasiliu, Dunja Mladenić, Radu Prodan, Dumitru Roman

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[8] arXiv:2510.01167 [pdf, html, other]: Title: Simultaneous Multi-objective Alignment Across Verifiable and Non-verifiable Rewards

Yiran Shen, Yu Xia, Jonathan Chang, Prithviraj Ammanabrolu

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[9] arXiv:2510.01163 [pdf, other]: Title: How Does the Pretraining Distribution Shape In-Context Learning? Task Selection, Generalization, and Robustness

Waïss Azizian, Ali Hasan

Comments: 52 pages, 12 figures

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[10] arXiv:2510.01161 [pdf, html, other]: Title: Prosperity before Collapse: How Far Can Off-Policy RL Reach with Stale Data on LLMs?

Haizhong Zheng, Jiawei Zhao, Bedi Chen

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[11] arXiv:2510.01159 [pdf, html, other]: Title: Multi-Marginal Flow Matching with Adversarially Learnt Interpolants

Oskar Kviman, Kirill Tamogashev, Nicola Branchini, Víctor Elvira, Jens Lagergren, Nikolay Malkin

Subjects: Machine Learning (cs.LG)
[12] arXiv:2510.01153 [pdf, html, other]: Title: Neural Hamilton--Jacobi Characteristic Flows for Optimal Transport

Yesom Park, Shu Liu, Mo Zhou, Stanley Osher

Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[13] arXiv:2510.01137 [pdf, html, other]: Title: Sample-Efficient Differentially Private Fine-Tuning via Gradient Matrix Denoising

Ali Dadsetan, Frank Rudzicz

Subjects: Machine Learning (cs.LG)
[14] arXiv:2510.01136 [pdf, html, other]: Title: TabINR: An Implicit Neural Representation Framework for Tabular Data Imputation

Vincent Ochs, Florentin Bieder, Sidaty el Hadramy, Paul Friedrich, Stephanie Taha-Mehlitz, Anas Taha, Philippe C. Cattin

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[15] arXiv:2510.01135 [pdf, other]: Title: Prompt Curriculum Learning for Efficient LLM Post-Training

Zhaolin Gao, Joongwon Kim, Wen Sun, Thorsten Joachims, Sid Wang, Richard Yuanzhe Pang, Liang Tan

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[16] arXiv:2510.01132 [pdf, html, other]: Title: A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning

Ruiyi Wang, Prithviraj Ammanabrolu

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[17] arXiv:2510.01123 [pdf, html, other]: Title: Rethinking Thinking Tokens: LLMs as Improvement Operators

Lovish Madaan, Aniket Didolkar, Suchin Gururangan, John Quan, Ruan Silva, Ruslan Salakhutdinov, Manzil Zaheer, Sanjeev Arora, Anirudh Goyal

Comments: 21 pages

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[18] arXiv:2510.01118 [pdf, html, other]: Title: Breaking the Euclidean Barrier: Hyperboloid-Based Biological Sequence Analysis

Sarwan Ali, Haris Mansoor, Murray Patterson

Subjects: Machine Learning (cs.LG)
[19] arXiv:2510.01116 [pdf, html, other]: Title: Eliciting Chain-of-Thought Reasoning for Time Series Analysis using Reinforcement Learning

Felix Parker, Nimeesha Chan, Chi Zhang, Kimia Ghobadi

Subjects: Machine Learning (cs.LG)
[20] arXiv:2510.01113 [pdf, html, other]: Title: Privacy Preserved Federated Learning with Attention-Based Aggregation for Biometric Recognition

Kassahun Azezew, Minyechil Alehegn, Tsega Asresa, Bitew Mekuria, Tizazu Bayh, Ayenew Kassie, Amsalu Tesema, Animut Embiyale

Subjects: Machine Learning (cs.LG)
[21] arXiv:2510.01111 [pdf, html, other]: Title: Augmenting LLMs for General Time Series Understanding and Prediction

Felix Parker, Nimeesha Chan, Chi Zhang, Kimia Ghobadi

Subjects: Machine Learning (cs.LG)
[22] arXiv:2510.01105 [pdf, html, other]: Title: Geometric Properties of Neural Multivariate Regression

George Andriopoulos, Zixuan Dong, Bimarsha Adhikari, Keith Ross

Comments: 22 pages, 12 figures

Subjects: Machine Learning (cs.LG)
[23] arXiv:2510.01089 [pdf, html, other]: Title: Dynamical system reconstruction from partial observations using stochastic dynamics

Viktor Sip, Martin Breyton, Spase Petkoski, Viktor Jirsa

Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[24] arXiv:2510.01083 [pdf, html, other]: Title: Multi-Actor Multi-Critic Deep Deterministic Reinforcement Learning with a Novel Q-Ensemble Method

Andy Wu, Chun-Cheng Lin, Rung-Tzuo Liaw, Yuehua Huang, Chihjung Kuo, Chia Tong Weng

Subjects: Machine Learning (cs.LG)
[25] arXiv:2510.01074 [pdf, html, other]: Title: Predicting Diabetic Retinopathy Using a Two-Level Ensemble Model

Mahyar Mahmoudi, Tieming Liu

Comments: Accepted for presentation at the IISE Annual Conference & Expo 2025, 6 pages, 2 tables, 1 figure

Subjects: Machine Learning (cs.LG)
[26] arXiv:2510.01070 [pdf, html, other]: Title: Eliciting Secret Knowledge from Language Models

Bartosz Cywiński, Emil Ryd, Rowan Wang, Senthooran Rajamanoharan, Neel Nanda, Arthur Conmy, Samuel Marks

Subjects: Machine Learning (cs.LG)
[27] arXiv:2510.01051 [pdf, html, other]: Title: GEM: A Gym for Agentic LLMs

Zichen Liu, Anya Sims, Keyu Duan, Changyu Chen, Simon Yu, Xiangxin Zhou, Haotian Xu, Shaopan Xiong, Bo Liu, Chenmien Tan, Chuen Yang Beh, Weixun Wang, Hao Zhu, Weiyan Shi, Diyi Yang, Michael Shieh, Yee Whye Teh, Wee Sun Lee, Min Lin

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[28] arXiv:2510.01039 [pdf, html, other]: Title: Gated X-TFC: Soft Domain Decomposition for Forward and Inverse Problems in Sharp-Gradient PDEs

Vikas Dwivedi, Enrico Schiassi, Monica Sigovan, Bruno Sixou

Subjects: Machine Learning (cs.LG)
[29] arXiv:2510.01037 [pdf, html, other]: Title: CurES: From Gradient Analysis to Efficient Curriculum Learning for Reasoning LLMs

Yongcheng Zeng, Zexu Sun, Bokai Ji, Erxue Min, Hengyi Cai, Shuaiqiang Wang, Dawei Yin, Haifeng Zhang, Xu Chen, Jun Wang

Comments: 25 pages, 10 Figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[30] arXiv:2510.01032 [pdf, html, other]: Title: Meaningless Tokens, Meaningful Gains: How Activation Shifts Enhance LLM Reasoning

Zeru Shi, Yingjia Wan, Zhenting Wang, Qifan Wang, Fan Yang, Elisa Kreiss, Ruixiang Tang

Subjects: Machine Learning (cs.LG)
[31] arXiv:2510.01022 [pdf, html, other]: Title: Equivariant Geometric Scattering Networks via Vector Diffusion Wavelets

David R. Johnson, Rishabh Anand, Smita Krishnaswamy, Michael Perlmutter

Comments: Accepted for presentation at the NeurIPS workshop on New Perspectives in Advancing Graph Machine Learning

Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP); Machine Learning (stat.ML)
[32] arXiv:2510.01020 [pdf, other]: Title: The Good, the Bad, and the Sampled: a No-Regret Approach to Safe Online Classification

Tavor Z. Baharav, Spyros Dragazis, Aldo Pacchiano

Comments: 43 pages

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Statistics Theory (math.ST); Machine Learning (stat.ML)
[33] arXiv:2510.01012 [pdf, html, other]: Title: Random Feature Spiking Neural Networks

Maximilian Gollwitzer, Felix Dietrich

Comments: 34 pages incl. references & appendix, 3 figures, 4 tables

Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[34] arXiv:2510.00983 [pdf, html, other]: Title: Riemannian Consistency Model

Chaoran Cheng, Yusong Wang, Yuxin Chen, Xiangxin Zhou, Nanning Zheng, Ge Liu

Comments: Accepted to NeurIPS 2025

Subjects: Machine Learning (cs.LG)
[35] arXiv:2510.00977 [pdf, html, other]: Title: It Takes Two: Your GRPO Is Secretly DPO

Yihong Wu, Liheng Ma, Lei Ding, Muzhi Li, Xinyu Wang, Kejia Chen, Zhan Su, Zhanguang Zhang, Chenyang Huang, Yingxue Zhang, Mark Coates, Jian-Yun Nie

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[36] arXiv:2510.00938 [pdf, other]: Title: Large Reasoning Models Learn Better Alignment from Flawed Thinking

ShengYun Peng, Eric Smith, Ivan Evtimov, Song Jiang, Pin-Yu Chen, Hongyuan Zhan, Haozhu Wang, Duen Horng Chau, Mahesh Pasupuleti, Jianfeng Chi

Subjects: Machine Learning (cs.LG)
[37] arXiv:2510.00915 [pdf, html, other]: Title: Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect Verifiers

Xin-Qiang Cai, Wei Wang, Feng Liu, Tongliang Liu, Gang Niu, Masashi Sugiyama

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[38] arXiv:2510.00911 [pdf, html, other]: Title: RiskPO: Risk-based Policy Optimization via Verifiable Reward for LLM Post-Training

Tao Ren, Jinyang Jiang, Hui Yang, Wan Tian, Minhao Zou, Guanghao Li, Zishi Zhang, Qinghao Wang, Shentao Qin, Yanjun Zhao, Rui Tao, Hui Shao, Yijie Peng

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[39] arXiv:2510.00907 [pdf, html, other]: Title: BoMGene: Integrating Boruta-mRMR feature selection for enhanced Gene expression classification

Bich-Chung Phan, Thanh Ma, Huu-Hoa Nguyen, Thanh-Nghi Do

Subjects: Machine Learning (cs.LG)
[40] arXiv:2510.00885 [pdf, html, other]: Title: Rectifying Regression in Reinforcement Learning

Alex Ayoub, David Szepesvári, Alireza Baktiari, Csaba Szepesvári, Dale Schuurmans

Subjects: Machine Learning (cs.LG)
[41] arXiv:2510.00883 [pdf, html, other]: Title: GLAI: GreenLightningAI for Accelerated Training through Knowledge Decoupling

Jose I. Mestre, Alberto Fernández-Hernández, Cristian Pérez-Corral, Manuel F. Dolz, Jose Duato, Enrique S. Quintana-Ortí

Comments: 20 pages, 2 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[42] arXiv:2510.00873 [pdf, html, other]: Title: Reducción de ruido por medio de autoencoders: caso de estudio con la señal GW150914

Fernanda Zapata Bascuñán, Darío Fernando Mendieta

Comments: in Spanish language, Presented at the RPIC 2023 (Information Processing and Control work Reunion)

Subjects: Machine Learning (cs.LG); Instrumentation and Methods for Astrophysics (astro-ph.IM)
[43] arXiv:2510.00872 [pdf, other]: Title: A Visual Diagnostics Framework for District Heating Data: Enhancing Data Quality for AI-Driven Heat Consumption Prediction

Kristoffer Christensen, Bo Nørregaard Jørgensen, Zheng Grace Ma

Comments: Energy this http URL Conference 2025 (EI.A 2025), 3-6 December 2025, Universiti Tenaga Nasional (UNITEN), Kuala Lumpur, Malaysia

Subjects: Machine Learning (cs.LG); Human-Computer Interaction (cs.HC)
[44] arXiv:2510.00871 [pdf, html, other]: Title: Target Population Synthesis using CT-GAN

Tanay Rastogi, Daniel Jonsson

Comments: Submitted for journal and is under review

Subjects: Machine Learning (cs.LG)
[45] arXiv:2510.00866 [pdf, html, other]: Title: The data-quality illusion: Rethinking Classifier-based quality filtering for LLM Pretraining

Thiziri Nait Saada, Louis Bethune, Michal Klein, David Grangier, Marco Cuturi, Pierre Ablin

Comments: 21 pages, 20 figures, 2 tables, preprint

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[46] arXiv:2510.00859 [pdf, html, other]: Title: Population Synthesis using Incomplete Information

Tanay Rastogi, Daniel Jonsson, Anders Karlström

Comments: Presented at 25th Euro Working Group on Transportation (EWGT) Meeting

Journal-ref: Rastogi, Tanay, Daniel Jonsson, and Anders Karlstr\"om. "Population Synthesis Using Incomplete Microsample." Transportation Research Procedia 86 (2025): 80-87

Subjects: Machine Learning (cs.LG)
[47] arXiv:2510.00845 [pdf, html, other]: Title: Mechanistic Interpretability as Statistical Estimation: A Variance Analysis of EAP-IG

Maxime Méloux, Maxime Peyrard, François Portet

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[48] arXiv:2510.00841 [pdf, html, other]: Title: LLM Routing with Dueling Feedback

Chao-Kai Chiang, Takashi Ishida, Masashi Sugiyama

Subjects: Machine Learning (cs.LG)
[49] arXiv:2510.00819 [pdf, html, other]: Title: Stabilizing Policy Gradients for Sample-Efficient Reinforcement Learning in LLM Reasoning

Luckeciano C. Melo, Alessandro Abate, Yarin Gal

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[50] arXiv:2510.00815 [pdf, html, other]: Title: Learn to Guide Your Diffusion Model

Alexandre Galashov, Ashwini Pokle, Arnaud Doucet, Arthur Gretton, Mauricio Delbracio, Valentin De Bortoli

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)

Total of 1554 entries : 1-50 51-100 101-150 151-200 ... 1551-1554

Showing up to 50 entries per page: fewer | more | all

Machine Learning

Authors and titles for recent submissions

Thu, 2 Oct 2025 (showing first 50 of 241 entries )