Contributors Forks Stargazers Issues

Updated on 2025.06.28

Usage instructions: here

Contents

  1. Adversarial attacks
  2. Poisoning attacks
  3. Generative models safety
  4. Data privacy
  5. Model Privacy
  6. Forensics
  7. AIGC

Adversarial attacks

2025-06-26

Publish Date Title Authors PDF Code
2025-06-26 Generative Adversarial Evasion and Out-of-Distribution Detection for UAV Cyber-Attacks Deepak Kumar Panda et.al. 2506.21142 null
2025-06-26 Curriculum-Guided Antifragile Reinforcement Learning for Secure UAV Deconfliction under Observation-Space Attacks Deepak Kumar Panda et.al. 2506.21129 null
2025-06-26 Robust Policy Switching for Antifragile Reinforcement Learning for UAV Deconfliction in Adversarial Environments Deepak Kumar Panda et.al. 2506.21127 null
2025-06-26 Boosting Generative Adversarial Transferability with Self-supervised Vision Transformer Features Shangbo Wu et.al. 2506.21046 null
2025-06-26 E-FreeM2: Efficient Training-Free Multi-Scale and Cross-Modal News Verification via MLLMs Van-Hoang Phan et.al. 2506.20944 null

2025-06-25

Publish Date Title Authors PDF Code
2025-06-25 Empowering Digital Agriculture: A Privacy-Preserving Framework for Data Sharing and Collaborative Research Osama Zafar et.al. 2506.20872 null
2025-06-25 Universal and Efficient Detection of Adversarial Data through Nonuniform Impact on Network Layers Furkan Mumcu et.al. 2506.20816 null
2025-06-25 Poster: Enhancing GNN Robustness for Network Intrusion Detection via Agent-based Analysis Zhonghao Zhan et.al. 2506.20806 null
2025-06-25 On Convolutions, Intrinsic Dimension, and Diffusion Models Kin Kwan Leung et.al. 2506.20705 null
2025-06-25 Vulnerability Disclosure through Adaptive Black-Box Adversarial Attacks on NIDS Sabrine Ennaji et.al. 2506.20576 null
2025-06-25 Multimodal Representation Learning and Fusion Qihang Jin et.al. 2506.20494 null

2025-06-24

Publish Date Title Authors PDF Code
2025-06-24 Adversarial Attacks on Deep Learning-Based False Data Injection Detection in Differential Relays Ahmad Mohammad Saber et.al. 2506.19302 null
2025-06-24 DRO-Augment Framework: Robustness by Synergizing Wasserstein Distributionally Robust Optimization and Data Augmentation Jiaming Hu et.al. 2506.17874 null

2025-06-23

Publish Date Title Authors PDF Code
2025-06-23 Amplifying Machine Learning Attacks Through Strategic Compositions Yugeng Liu et.al. 2506.18870 null
2025-06-23 SpaNN: Detecting Multiple Adversarial Patches on CNNs by Spanning Saliency Thresholds Mauricio Byrd Victorica et.al. 2506.18591 null
2025-06-23 DUMB and DUMBer: Is Adversarial Training Worth It in the Real World? Francesco Marchiori et.al. 2506.18516 null
2025-06-23 Sharpening the Spear: Adaptive Expert-Guided Adversarial Attack Against DRL-based Autonomous Driving Policies Junchao Fan et.al. 2506.18304 null
2025-06-23 Semantic Structure-Aware Generative Attacks for Enhanced Adversarial Transferability Jongoh Jeong et.al. 2506.18248 null

2025-06-22

Publish Date Title Authors PDF Code
2025-06-22 An Attack Method for Medical Insurance Claim Fraud Detection based on Generative Adversarial Network Yining Pang et.al. 2506.19871 null

2025-06-21

Publish Date Title Authors PDF Code
2025-06-21 Optimization-Free Patch Attack on Stereo Depth Estimation Hangcheng Liu et.al. 2506.17632 null

2025-06-20

Publish Date Title Authors PDF Code
2025-06-20 Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model Side Liu et.al. 2506.17162 null
2025-06-20 Robust Training with Data Augmentation for Medical Imaging Classification Josué Martínez-Martínez et.al. 2506.17133 null
2025-06-20 Large Language Model Unlearning for Source Code Xue Jiang et.al. 2506.17125 null
2025-06-20 DepthVanish: Optimizing Adversarial Interval Structures for Stereo-Depth-Invisible Patches Yun Xing et.al. 2506.16690 null

2025-06-19

Publish Date Title Authors PDF Code
2025-06-19 Robustness Evaluation of OCR-based Visual Document Understanding under Multi-Modal Adversarial Attacks Dong Nguyen Tien et.al. 2506.16407 null
2025-06-19 MBA: Multimodal Bidirectional Attack for Referring Expression Segmentation Models Xingbai Chen et.al. 2506.16157 null
2025-06-19 Adversarial Attacks and Detection in Visual Place Recognition for Safer Robot Navigation Connor Malone et.al. 2506.15988 **[link](https://github.com/QVPR/aarapsiproject)**

2025-06-18

Publish Date Title Authors PDF Code
2025-06-18 PolyGuard: Massive Multi-Domain Safety Policy-Grounded Guardrail Dataset Mintong Kang et.al. 2506.19054 null
2025-06-18 VLMInferSlow: Evaluating the Efficiency Robustness of Large Vision-Language Models as a Service Xiasi Wang et.al. 2506.15755 null
2025-06-18 Insights on Adversarial Attacks for Tabular Machine Learning via a Systematic Literature Review Salijona Dyrmishi et.al. 2506.15506 null

2025-06-17

Publish Date Title Authors PDF Code
2025-06-17 Busting the Paper Ballot: Voting Meets Adversarial Machine Learning Kaleel Mahmood et.al. 2506.14582 **[link](https://github.com/votercenter/busting-the-ballot)**
2025-06-17 Doppelgänger Method: Breaking Role Consistency in LLM Agent via Prompt-based Transferable Adversarial Attack Daewon Kang et.al. 2506.14539 null
2025-06-17 A Comparative Study on Proactive and Passive Detection of Deepfake Speech Chia-Hua Wu et.al. 2506.14398 **[link](https://github.com/nii-yamagishilab/antispoofing-watermark)**

2025-06-16

Publish Date Title Authors PDF Code
2025-06-16 Navigating the Black Box: Leveraging LLMs for Effective Text-Level Graph Injection Attacks Yuefei Lyu et.al. 2506.13276 null
2025-06-16 Position: Certified Robustness Does Not (Yet) Imply Model Security Andrew C. Cullen et.al. 2506.13024 null

2025-06-15

Publish Date Title Authors PDF Code
2025-06-15 The Safety Reminder: A Soft Prompt to Reactivate Delayed Safety Awareness in Vision-Language Models Peiyuan Tang et.al. 2506.15734 null
2025-06-15 Constraint-Guided Prediction Refinement via Deterministic Diffusion Trajectories Pantelis Dogoulis et.al. 2506.12911 null
2025-06-15 Intriguing Frequency Interpretation of Adversarial Robustness for CNNs and ViTs Lu Chen et.al. 2506.12875 null
2025-06-15 Active Adversarial Noise Suppression for Image Forgery Localization Rongxuan Peng et.al. 2506.12871 null
2025-06-15 NAP-Tuning: Neural Augmented Prompt Tuning for Adversarially Robust Vision-Language Models Jiaming Zhang et.al. 2506.12706 null
2025-06-15 Alphabet Index Mapping: Jailbreaking LLMs through Semantic Dissimilarity Bilal Saleh Husain et.al. 2506.12685 null

2025-06-14

Publish Date Title Authors PDF Code
2025-06-14 Existence of Adversarial Examples for Random Convolutional Networks via Isoperimetric Inequalities on $\mathbb{so}(d)$ Amit Daniely et.al. 2506.12613 null
2025-06-14 On the existence of consistent adversarial attacks in high-dimensional linear classification Matteo Vilucchio et.al. 2506.12454 null
2025-06-14 Exploring the Secondary Risks of Large Language Models Jiawei Chen et.al. 2506.12382 null

2025-06-13

Publish Date Title Authors PDF Code
2025-06-13 Learning Causality for Modern Machine Learning Yongqiang Chen et.al. 2506.12226 null
2025-06-13 Improving Large Language Model Safety with Contrastive Representation Learning Samuel Simko et.al. 2506.11938 **[link](https://github.com/samuelsimko/crl-llm-defense)**
2025-06-13 A Neural Rejection System Against Universal Adversarial Perturbations in Radio Signal Classification Lu Zhang et.al. 2506.11901 null
2025-06-13 Attention-based Adversarial Robust Distillation in Radio Signal Classifications for Low-Power IoT Devices Lu Zhang et.al. 2506.11892 null
2025-06-13 TrustGLM: Evaluating the Robustness of GraphLLMs Against Prompt, Text, and Structure Attacks Qihai Zhang et.al. 2506.11844 null
2025-06-13 Differential Privacy in Machine Learning: From Symbolic AI to LLMs Francisco Aguilera-Martínez et.al. 2506.11687 null
2025-06-13 KCES: Training-Free Defense for Robust Graph Neural Networks via Kernel Complexity Yaning Jia et.al. 2506.11611 null
2025-06-13 Investigating Vulnerabilities and Defenses Against Audio-Visual Attacks: A Comprehensive Survey Emphasizing Multimodal Models Jinming Wen et.al. 2506.11521 null

2025-06-12

Publish Date Title Authors PDF Code
2025-06-12 Lattice Climber Attack: Adversarial attacks for randomized mixtures of classifiers Lucas Gnecco-Heredia et.al. 2506.10888 **[link](https://github.com/lucasgneccoh/lattice_climber_attack)**
2025-06-12 Efficiency Robustness of Dynamic Deep Learning Systems Ravishka Rathnasuriya et.al. 2506.10831 **[link](https://github.com/anonymous-sok/sok_submission)**
2025-06-12 Unsourced Adversarial CAPTCHA: A Bi-Phase Adversarial CAPTCHA Framework Xia Du et.al. 2506.10685 null
2025-06-12 Assessing the Resilience of Automotive Intrusion Detection Systems to Adversarial Manipulation Stefano Longari et.al. 2506.10620 null
2025-06-12 Boosting Adversarial Transferability for Hyperspectral Image Classification Using 3D Structure-invariant Transformation and Intermediate Feature Distance Chun Liu et.al. 2506.10459 null

2025-06-11

Publish Date Title Authors PDF Code
2025-06-11 GenBreak: Red Teaming Text-to-Image Generators Using Large Language Models Zilong Wang et.al. 2506.10047 null
2025-06-11 A look at adversarial attacks on radio waveforms from discrete latent space Attanasia Garuso et.al. 2506.09896 null
2025-06-11 Evasion Attacks Against Bayesian Predictive Models Pablo G. Arce et.al. 2506.09640 **[link](https://github.com/pablogarciarce/advreg)**
2025-06-11 Effective Red-Teaming of Policy-Adherent Agents Itay Nakash et.al. 2506.09600 null
2025-06-11 LLMs Cannot Reliably Judge (Yet?): A Comprehensive Assessment on the Robustness of LLM-as-a-Judge Songze Li et.al. 2506.09443 **[link](https://github.com/s3ic-lab/robustjudge)**
2025-06-11 Adversarial Surrogate Risk Bounds for Binary Classification Natalie S. Frank et.al. 2506.09348 null
2025-06-11 AdversariaL attacK sAfety aLIgnment(ALKALI): Safeguarding LLMs through GRACE: Geometric Representation-Aware Contrastive Enhancement- Introducing Adversarial Vulnerability Quality Index (AVQI) Danush Khanna et.al. 2506.08885 null

2025-06-10

Publish Date Title Authors PDF Code
2025-06-10 PatchGuard: Adversarially Robust Anomaly Detection and Localization through Vision Transformers and Pseudo Anomalies Mojtaba Nafez et.al. 2506.09237 **[link](https://github.com/rohban-lab/patchgaurd)**
2025-06-10 Adversarial Text Generation with Dynamic Contextual Perturbation Hetvi Waghela et.al. 2506.09148 null
2025-06-10 Towards Robust Deep Reinforcement Learning against Environmental State Perturbation Chenxu Wang et.al. 2506.08961 null
2025-06-10 Efficient Robust Conformal Prediction via Lipschitz-Bounded Networks Thomas Massena et.al. 2506.05434 null

2025-06-09

Publish Date Title Authors PDF Code
2025-06-09 SHIELD: Secure Hypernetworks for Incremental Expansion Learning Defense Patryk Krukowski et.al. 2506.08255 null
2025-06-09 Adversarial Attack Classification and Robustness Testing for Large Language Models for Code Yang Liu et.al. 2506.07942 null
2025-06-09 Enhancing Adversarial Robustness with Conformal Prediction: A Framework for Guaranteed Model Reliability Jie Bao et.al. 2506.07804 **[link](https://github.com/bjbbbb/enhancing-adversarial-robustness-with-conformal-prediction)**
2025-06-09 ProARD: progressive adversarial robustness distillation: provide wide range of robust students Seyedhamidreza Mousavi et.al. 2506.07666 null
2025-06-09 Explore the vulnerability of black-box models via diffusion models Jiacheng Shi et.al. 2506.07590 null

2025-06-08

Publish Date Title Authors PDF Code
2025-06-08 D2R: dual regularization loss with collaborative adversarial generation for model robustness Zhenyu Liu et.al. 2506.07056 null
2025-06-08 Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text Yize Cheng et.al. 2506.07001 null
2025-06-08 Boosting Adversarial Transferability via Commonality-Oriented Gradient Optimization Yanting Gao et.al. 2506.06992 null

2025-06-07

Publish Date Title Authors PDF Code
2025-06-07 Rewriting the Budget: A General Framework for Black-Box Attacks Under Cost Asymmetry Mahdi Salmani et.al. 2506.06933 null
2025-06-07 KNN-Defense: Defense against 3D Adversarial Point Clouds using Nearest-Neighbor Search Nima Jamali et.al. 2506.06906 null

2025-06-06

Publish Date Title Authors PDF Code
2025-06-06 SDN-Based False Data Detection With Its Mitigation and Machine Learning Robustness for In-Vehicle Networks Long Dang et.al. 2506.06556 null
2025-06-06 SATversary: Adversarial Attacks on Satellite Fingerprinting Joshua Smailes et.al. 2506.06119 null

2025-06-05

Publish Date Title Authors PDF Code
2025-06-05 Exploring Adversarial Watermarking in Transformer-Based Models: Transferability and Robustness Against Defense Mechanism for Medical Images Rifat Sadik et.al. 2506.06389 null
2025-06-05 Explainer-guided Targeted Adversarial Attacks against Binary Code Similarity Detection Models Mingjie Chen et.al. 2506.05430 null
2025-06-05 Identifying and Understanding Cross-Class Features in Adversarial Training Zeming Wei et.al. 2506.05032 null
2025-06-05 Robustness as Architecture: Designing IQA Models to Withstand Adversarial Perturbations Igor Meleshin et.al. 2506.04951 null
2025-06-05 Fool the Stoplight: Realistic Adversarial Patch Attacks on Traffic Light Detectors Svetlana Pavlitska et.al. 2506.04823 null
2025-06-05 Influence Functions for Edge Edits in Non-Convex Graph Neural Networks Jaeseung Heo et.al. 2506.04694 null
2025-06-05 Towards Better Generalization via Distributional Input Projection Network Yifan Hao et.al. 2506.04690 null
2025-06-05 Normative Conflicts and Shallow AI Alignment Raphaël Millière et.al. 2506.04679 null
2025-06-05 RAID: A Dataset for Testing the Adversarial Robustness of AI-Generated Image Detectors Hicham Eddoubi et.al. 2506.03988 **[link](https://github.com/pralab/raid)**

2025-06-04

Publish Date Title Authors PDF Code
2025-06-04 Poisoning Behavioral-based Worker Selection in Mobile Crowdsensing using Generative Adversarial Networks Ruba Nasser et.al. 2506.05403 null
2025-06-04 Sylva: Tailoring Personalized Adversarial Defense in Pre-trained Models via Collaborative Fine-tuning Tianyu Qi et.al. 2506.05402 null
2025-06-04 Prediction Inconsistency Helps Achieve Generalizable Detection of Adversarial Examples Sicong Han et.al. 2506.03765 null
2025-06-04 Robustness of Prompting: Enhancing Robustness of Large Language Models Against Prompting Attacks Lin Mu et.al. 2506.03627 null
2025-06-04 Across Programming Language Silos: A Study on Cross-Lingual Retrieval-augmented Code Generation Qiming Zhu et.al. 2506.03535 null

2025-06-03

Publish Date Title Authors PDF Code
2025-06-03 Attacking Attention of Foundation Models Disrupts Downstream Tasks Hondamunige Prasanna Silva et.al. 2506.05394 null
2025-06-03 Dynamic Epsilon Scheduling: A Multi-Factor Adaptive Perturbation Budget for Adversarial Training Alan Mitkiy et.al. 2506.04263 null
2025-06-03 Adversarial Attacks on Robotic Vision Language Action Models Eliot Krzysztof Jones et.al. 2506.03350 **[link](https://github.com/eliotjones1/robogcg)**
2025-06-03 Privacy Leaks by Adversaries: Adversarial Iterations for Membership Inference Attack Jing Xue et.al. 2506.02711 null
2025-06-03 Poster: FedBlockParadox -- A Framework for Simulating and Securing Decentralized Federated Learning Gabriele Digregorio et.al. 2506.02679 null
2025-06-03 Tarallo: Evading Behavioral Malware Detectors in the Problem Space Gabriele Digregorio et.al. 2506.02660 **[link](https://github.com/necst/Tarallo)**
2025-06-03 BitBypass: A New Direction in Jailbreaking Aligned Large Language Models with Bitstream Camouflage Kalyan Nakka et.al. 2506.02479 null
2025-06-03 Adversarial control of synchronization in complex oscillator networks Yasutoshi Nagahama et.al. 2506.02403 **[link](https://github.com/kztakemoto/advSync)**
2025-06-03 On the Stability of Graph Convolutional Neural Networks: A Probabilistic Perspective Ning Zhang et.al. 2506.01213 null

2025-06-02

Publish Date Title Authors PDF Code
2025-06-02 Robust Satisficing Gaussian Process Bandits Under Adversarial Attacks Artun Saday et.al. 2506.01625 null
2025-06-02 Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation Yuan Gan et.al. 2506.01591 null
2025-06-02 Enhancing Diffusion-based Unrestricted Adversarial Attacks via Adversary Preferences Alignment Kaixun Jiang et.al. 2506.01511 null
2025-06-02 Adversarial learning for nonparametric regression: Minimax rate and adaptive estimation Jingfu Peng et.al. 2506.01267 null

2025-06-01

Publish Date Title Authors PDF Code
2025-06-01 Fighting Fire with Fire (F3): A Training-free and Efficient Visual Adversarial Example Purification Method in LVLMs Yudong Zhang et.al. 2506.01064 null
2025-06-01 CAPAA: Classifier-Agnostic Projector-Based Adversarial Attack Zhan Li et.al. 2506.00978 null
2025-06-01 Breaking Latent Prior Bias in Detectors for Generalizable AIGC Image Detection Yue Zhou et.al. 2506.00874 null
2025-06-01 SafeGenes: Evaluating the Adversarial Robustness of Genomic Foundation Models Huixin Zhan et.al. 2506.00821 null

2025-05-31

Publish Date Title Authors PDF Code
2025-05-31 Poster: Adapting Pretrained Vision Transformers with LoRA Against Attack Vectors Richard E. Neddo et.al. 2506.00661 null
2025-05-31 Con Instruction: Universal Jailbreaking of Multimodal Large Language Models via Non-Textual Modalities Jiahui Geng et.al. 2506.00548 null
2025-05-31 Adversarial Machine Learning for Robust Password Strength Estimation Pappu Jha et.al. 2506.00373 null
2025-05-31 Towards Effective and Efficient Adversarial Defense with Diffusion Models for Robust Visual Tracking Long Xu et.al. 2506.00325 null
2025-05-31 Practical Adversarial Attacks on Stochastic Bandits via Fake Data Injection Qirun Zeng et.al. 2505.21938 null

2025-05-30

Publish Date Title Authors PDF Code
2025-05-30 Adversarial Threat Vectors and Risk Mitigation for Retrieval-Augmented Generation Systems Chris M. Ward et.al. 2506.00281 null
2025-05-30 3D Gaussian Splat Vulnerabilities Matthew Hull et.al. 2506.00280 null
2025-05-30 Black-box Adversarial Attacks on CNN-based SLAM Algorithms Maria Rafaela Gkeka et.al. 2505.24654 null
2025-05-30 A Flat Minima Perspective on Understanding Augmentations and Model Robustness Weebum Yoo et.al. 2505.24592 null
2025-05-30 Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors Andrea Pedrotti et.al. 2505.24523 null
2025-05-30 Learning Safety Constraints for Large Language Models Xin Chen et.al. 2505.24445 null
2025-05-30 Adversarial Preference Learning for Robust LLM Alignment Yuanfu Wang et.al. 2505.24369 null
2025-05-30 Light as Deception: GPT-driven Natural Relighting Against Vision-Language Pre-training Models Ying Yang et.al. 2505.24227 null
2025-05-30 The Butterfly Effect in Pathology: Exploring Security in Pathology Foundation Models Jiashuai Liu et.al. 2505.24141 null
2025-05-30 Adversarially Robust AI-Generated Image Detection for Free: An Information Theoretic Perspective Ruixuan Zhang et.al. 2505.22604 null

2025-05-29

Publish Date Title Authors PDF Code
2025-05-29 SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents Kunlun Zhu et.al. 2505.23559 **[link](https://github.com/ulab-uiuc/safescientist)**
2025-05-29 TRAP: Targeted Redirecting of Agentic Preferences Hangoo Kang et.al. 2505.23518 null
2025-05-29 Adaptive Jailbreaking Strategies Based on the Semantic Understanding Capabilities of Large Language Models Mingyu Yu et.al. 2505.23404 null
2025-05-29 Adversarial Semantic and Label Perturbation Attack for Pedestrian Attribute Recognition Weizhe Kong et.al. 2505.23313 **[link](https://github.com/event-ahu/openpar)**
2025-05-29 Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion Chunlong Xie et.al. 2505.23266 null
2025-05-29 The Meeseeks Mesh: Spatially Consistent 3D Adversarial Objects for BEV Detector Aixuan Li et.al. 2505.22499 null

2025-05-28

Publish Date Title Authors PDF Code
2025-05-28 Understanding Adversarial Training with Energy-based Models Mujtaba Hussain Mirza et.al. 2505.22486 null
2025-05-28 Seeing the Threat: Vulnerabilities in Vision-Language Models to Adversarial Attack Juan Ren et.al. 2505.21967 null
2025-05-28 Rethinking Gradient-based Adversarial Attacks on Point Cloud Classification Jun Chen et.al. 2505.21854 null
2025-05-28 The Silent Saboteur: Imperceptible Adversarial Attacks against Black-Box Retrieval-Augmented Generation Systems Hongru Song et.al. 2505.18583 null

2025-05-27

Publish Date Title Authors PDF Code
2025-05-27 What is Adversarial Training for Diffusion Models? Briglia Maria Rosaria et.al. 2505.21742 null
2025-05-27 Preventing Adversarial AI Attacks Against Autonomous Situational Awareness: A Maritime Case Study Mathew J. Walter et.al. 2505.21609 null
2025-05-27 Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment Xiaojun Jia et.al. 2505.21494 null
2025-05-27 Attribute-Efficient PAC Learning of Sparse Halfspaces with Constant Malicious Noise Rate Shiwei Zeng et.al. 2505.21430 null
2025-05-27 A Framework for Adversarial Analysis of Decision Support Systems Prior to Deployment Brett Bissey et.al. 2505.21414 null
2025-05-27 Boosting Adversarial Transferability via High-Frequency Augmentation and Hierarchical-Gradient Fusion Yayin Zheng et.al. 2505.21181 null
2025-05-27 TabAttackBench: A Benchmark for Adversarial Attacks on Tabular Data Zhipeng He et.al. 2505.21027 null
2025-05-27 Breaking Dataset Boundaries: Class-Agnostic Targeted Adversarial Attacks Taïga Gonçalves et.al. 2505.20782 null

2025-05-26

Publish Date Title Authors PDF Code
2025-05-26 Comparing Neural Network Encodings for Logic-based Explainability Levi Cordeiro Carvalho et.al. 2505.20269 null
2025-05-26 Attention! You Vision Language Model Could Be Maliciously Manipulated Xiaosen Wang et.al. 2505.19911 null
2025-05-26 One Surrogate to Fool Them All: Universal, Transferable, and Targeted Adversarial Attacks with CLIP Binyan Xu et.al. 2505.19840 **[link](https://github.com/binyxu/univintruder)**
2025-05-26 TESSER: Transfer-Enhancing Adversarial Attacks from Vision Transformers via Spectral and Semantic Regularization Amira Guesmi et.al. 2505.19613 null

2025-05-25

Publish Date Title Authors PDF Code
2025-05-25 Curvature Dynamic Black-box Attack: revisiting adversarial robustness via dynamic curvature estimation Peiran Sun et.al. 2505.19194 null

2025-05-24

Publish Date Title Authors PDF Code
2025-05-24 Security Concerns for Large Language Models: A Survey Miles Q. Li et.al. 2505.18889 null
2025-05-24 Audio Jailbreak Attacks: Exposing Vulnerabilities in SpeechGPT in a White-Box Framework Binhao Ma et.al. 2505.18864 null
2025-05-24 Mal-D2GAN: Double-Detector based GAN for Malware Generation Nam Hoang Thanh et.al. 2505.18806 null

2025-05-23

Publish Date Title Authors PDF Code
2025-05-23 Towards more transferable adversarial attack in black-box manner Chun Tong Lei et.al. 2505.18097 null
2025-05-23 CAMME: Adaptive Deepfake Image Detection with Multi-Modal Cross-Attention Naseem Khan et.al. 2505.18035 **[link](https://github.com/magnet300/camme)**
2025-05-23 SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification Shashank Agnihotri et.al. 2505.18015 **[link](https://github.com/shashankskagnihotri/benchmarking_reliability_generalization)**
2025-05-23 Superplatforms Have to Attack AI Agents Jianghao Lin et.al. 2505.17861 null
2025-05-23 Temporal Consistency Constrained Transferable Adversarial Attacks with Background Mixup for Action Recognition Ping Li et.al. 2505.17807 **[link](https://github.com/mlvccn/bmtc_transferattackvid)**
2025-05-23 EVADE: Multimodal Benchmark for Evasive Content Detection in E-Commerce Applications Ancheng Xu et.al. 2505.17654 null
2025-05-23 Ownership Verification of DNN Models Using White-Box Adversarial Attacks with Specified Probability Manipulation Teruki Sano et.al. 2505.17579 null
2025-05-23 What You Read Isn't What You Hear: Linguistic Sensitivity in Deepfake Speech Detection Binh Nguyen et.al. 2505.17513 null
2025-05-23 Enhancing Adversarial Robustness of Vision Language Models via Adversarial Mixture Prompt Tuning Shiji Zhao et.al. 2505.17509 null
2025-05-23 VEAttack: Downstream-agnostic Vision Encoder Attack against Large Vision Language Models Hefei Mei et.al. 2505.17440 null
2025-05-23 When Are Concepts Erased From Diffusion Models? Kevin Lu et.al. 2505.17013 null

2025-05-22

Publish Date Title Authors PDF Code
2025-05-22 Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms Baran Hashemi et.al. 2505.17190 null
2025-05-22 MixAT: Combining Continuous and Discrete Adversarial Training for LLMs Csaba Dékány et.al. 2505.16947 null
2025-05-22 CAIN: Hijacking LLM-Humans Conversations via a Two-Stage Malicious System Prompt Generation and Refining Framework Viet Pham et.al. 2505.16888 null
2025-05-22 Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected Vulnerability Punya Syon Pandey et.al. 2505.16789 **[link](https://github.com/psyonp/accidental_misalignment)**
2025-05-22 Experimental robustness benchmark of quantum neural network on a superconducting quantum processor Hai-Feng Zhang et.al. 2505.16714 null
2025-05-22 AdvReal: Adversarial Patch Generation Framework with Application to Adversarial Safety Evaluation of Object Detection Systems Yuanhao Huang et.al. 2505.16402 **[link](https://github.com/huangyh98/advreal)**
2025-05-22 Chain-of-Thought Poisoning Attacks against R1-based Retrieval-Augmented Generation Systems Hongru Song et.al. 2505.16367 null
2025-05-22 Accelerating Targeted Hard-Label Adversarial Attacks in Low-Query Black-Box Settings Arjhun Swaminathan et.al. 2505.16313 **[link](https://github.com/mdppml/tea)**
2025-05-22 SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning Kaiwen Zhou et.al. 2505.16186 null
2025-05-22 TRAIL: Transferable Robust Adversarial Images via Latent diffusion Yuhao Xue et.al. 2505.16166 null
2025-05-22 A Few Large Shifts: Layer-Inconsistency Based Minimal Overhead Adversarial Example Detection Sanggeon Yun et.al. 2505.12586 null

2025-05-21

Publish Date Title Authors PDF Code
2025-05-21 Reverse Engineering Human Preferences with Reinforcement Learning Lisa Alazraki et.al. 2505.15795 null
2025-05-21 Beyond Classification: Evaluating Diffusion Denoised Smoothing for Security-Utility Trade off Yury Belousov et.al. 2505.15594 null
2025-05-21 My Face Is Mine, Not Yours: Facial Protection Against Diffusion Model Face Swapping Hon Ming Yam et.al. 2505.15336 null
2025-05-21 Blind Spot Navigation: Evolutionary Discovery of Sensitive Semantic Concepts for LVLMs Zihao Pan et.al. 2505.15265 null
2025-05-21 Few-Shot Adversarial Low-Rank Fine-Tuning of Vision-Language Models Sajjad Ghiasvand et.al. 2505.15130 null
2025-05-21 A Survey On Secure Machine Learning Taobo Liao et.al. 2505.15124 null
2025-05-21 Robustness Evaluation of Graph-based News Detection Using Network Structural Information Xianghua Zeng et.al. 2505.14453 null

2025-05-20

Publish Date Title Authors PDF Code
2025-05-20 GraphemeAug: A Systematic Approach to Synthesized Hard Negative Keyword Spotting Examples Harry Zhang et.al. 2505.14814 null
2025-05-20 Adverseness vs. Equilibrium: Exploring Graph Adversarial Resilience through Dynamic Equilibrium Xinxin Fan et.al. 2505.14463 null
2025-05-20 Universal Acoustic Adversarial Attacks for Flexible Control of Speech-LLMs Rao Ma et.al. 2505.14286 null
2025-05-20 Adversarial Training from Mean Field Perspective Soichiro Kumano et.al. 2505.14021 null
2025-05-20 Safety2Drive: Safety-Critical Scenario Benchmark for the Evaluation of Autonomous Driving Jingzheng Li et.al. 2505.13872 null

2025-05-19

Publish Date Title Authors PDF Code
2025-05-19 FlowPure: Continuous Normalizing Flows for Adversarial Purification Elias Collaert et.al. 2505.13280 **[link](https://github.com/distrinet/flowpure)**
2025-05-19 Quantum Algorithms for Causal Estimands Rishi Goel et.al. 2505.12873 null
2025-05-19 Language Models That Walk the Talk: A Framework for Formal Fairness Certificates Danqing Chen et.al. 2505.12767 null
2025-05-19 On the Mechanisms of Adversarial Data Augmentation for Robust and Adaptive Transfer Learning Hana Satou et.al. 2505.12681 null
2025-05-19 Spiking Neural Network: a low power solution for physical layer authentication Jung Hoon Lee et.al. 2505.12647 null

2025-05-18

Publish Date Title Authors PDF Code
2025-05-18 SPIRIT: Patching Speech Language Models against Jailbreak Attacks Amirbek Djanibekov et.al. 2505.13541 null
2025-05-18 A Survey of Attacks on Large Language Models Wenrui Xu et.al. 2505.12567 null
2025-05-18 EVALOOP: Assessing LLM Robustness in Programming from a Self-consistency Perspective Sen Fang et.al. 2505.12185 null

2025-05-17

Publish Date Title Authors PDF Code
2025-05-17 FABLE: A Localized, Targeted Adversarial Attack on Weather Forecasting Models Yue Deng et.al. 2505.12167 null
2025-05-17 Black-box Adversaries from Latent Space: Unnoticeable Attacks on Human Pose and Shape Estimation Zhiying Li et.al. 2505.12009 null
2025-05-17 Adversarial Robustness for Unified Multi-Modal Encoders via Efficient Calibration Chih-Ting Liao et.al. 2505.11895 null

2025-05-16

Publish Date Title Authors PDF Code
2025-05-16 On the Sharp Input-Output Analysis of Nonlinear Systems under Adversarial Attacks Jihun Kim et.al. 2505.11688 null
2025-05-16 GenoArmory: A Unified Evaluation Framework for Adversarial Attacks on Genomic Foundation Models Haozheng Luo et.al. 2505.10983 **[link](https://github.com/MAGICS-LAB/GenoArmory)**

2025-05-14

Publish Date Title Authors PDF Code
2025-05-14 Revisiting Adversarial Perception Attacks and Defense Methods on Autonomous Driving Systems Cheng Chen et.al. 2505.11532 null
2025-05-14 Adversarial Attack on Large Language Models using Exponentiated Gradient Descent Sajib Biswas et.al. 2505.09820 **[link](https://github.com/sbamit/exponentiated-gradient-descent-llm-attack)**
2025-05-14 Evaluating the Robustness of Adversarial Defenses in Malware Detection Systems Mostafa Jafari et.al. 2505.09342 **[link](https://github.com/mostafa-ja/sigma-binary)**
2025-05-14 Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction Xiaowei Zhu et.al. 2505.05084 null

2025-05-13

Publish Date Title Authors PDF Code
2025-05-13 Towards Adaptive Meta-Gradient Adversarial Examples for Visual Tracking Wei-Long Tian et.al. 2505.08999 **[link](https://github.com/pgao-lab/amga)**
2025-05-13 SHAP-based Explanations are Sensitive to Feature Representation Hyunseung Hwang et.al. 2505.08345 **[link](https://github.com/aguno/shap-attack)**
2025-05-13 AI and Generative AI Transforming Disaster Management: A Survey of Damage Assessment and Response Techniques Aman Raj et.al. 2505.08202 null
2025-05-13 Red Teaming the Mind of the Machine: A Systematic Evaluation of Prompt Injection and Jailbreak Vulnerabilities in LLMs Chetan Pathade et.al. 2505.04806 null

2025-05-12

Publish Date Title Authors PDF Code
2025-05-12 Sharp Gaussian approximations for Decentralized Federated Learning Soham Bonnerjee et.al. 2505.08125 null
2025-05-12 Dynamical Low-Rank Compression of Neural Networks with Robustness under Adversarial Attacks Steffen Schotthöfer et.al. 2505.08022 null
2025-05-12 Must Read: A Systematic Survey of Computational Persuasion Nimet Beyza Bozdag et.al. 2505.07775 **[link](https://github.com/beyzabozdag/PersuasionSurvey)**
2025-05-12 GRADA: Graph-based Reranker against Adversarial Documents Attack Jingjie Zheng et.al. 2505.07546 **[link](https://github.com/agrzheng/GRADA)**
2025-05-12 No Query, No Access Wenqiang Wang et.al. 2505.07258 null

2025-05-11

Publish Date Title Authors PDF Code
2025-05-11 IM-BERT: Enhancing Robustness of BERT through the Implicit Euler Method Mihyeon Kim et.al. 2505.06889 null
2025-05-11 DP-TRAE: A Dual-Phase Merging Transferable Reversible Adversarial Example for Image Privacy Protection Xia Du et.al. 2505.06860 null

2025-05-10

Publish Date Title Authors PDF Code
2025-05-10 Boundary-Guided Trajectory Prediction for Road Aware and Physically Feasible Autonomous Driving Ahmed Abouelazm et.al. 2505.06740 null
2025-05-10 TAROT: Towards Essentially Domain-Invariant Robustness with Theoretical Justification Dongyoon Yang et.al. 2505.06580 null

2025-05-09

Publish Date Title Authors PDF Code
2025-05-09 Remote Rowhammer Attack using Adversarial Observations on Federated Learning Clients Jinsheng Yuan et.al. 2505.06335 null
2025-05-09 Realistic Adversarial Attacks for Robustness Evaluation of Trajectory Prediction Models via Future State Perturbation Julian F. Schumann et.al. 2505.06134 **[link](https://github.com/jhagenus/general-framework-update-adversarial-jeroen)**
2025-05-09 A Taxonomy of Attacks and Defenses in Split Learning Aqsa Shabbir et.al. 2505.05872 null

2025-05-08

Publish Date Title Authors PDF Code
2025-05-08 Unpacking Robustness in Inflectional Languages: Adversarial Evaluation and Mechanistic Insights Paweł Walkowiak et.al. 2505.07856 null
2025-05-08 X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP Hanxun Huang et.al. 2505.05528 **[link](https://github.com/HanxunH/XTransferBench)**
2025-05-08 DispBench: Benchmarking Disparity Estimation to Synthetic Corruptions Shashank Agnihotri et.al. 2505.05091 **[link](https://github.com/shashankskagnihotri/benchmarking_robustness)**
2025-05-08 Uncovering the Limitations of Model Inversion Evaluation -- Benchmarks and Connection to Type-I Adversarial Attacks Sy-Tuyen Ho et.al. 2505.03519 null

2025-05-07

Publish Date Title Authors PDF Code
2025-05-07 Input-Specific and Universal Adversarial Attack Generation for Spiking Neural Networks in the Spiking Domain Spyridon Raptis et.al. 2505.06299 null
2025-05-07 Crafting Physical Adversarial Examples by Combining Differentiable and Physically Based Renders Yuqiu Liu et.al. 2505.04662 null
2025-05-07 Reliable Disentanglement Multi-view Learning Against View Adversarial Attacks Xuyang Wang et.al. 2505.04046 **[link](https://github.com/Willy1005/2025-IJCAI-RDML)**

2025-05-06

Publish Date Title Authors PDF Code
2025-05-06 ALMA: Aggregated Lipschitz Maximization Attack on Auto-encoders Chethan Krishnamurthy Ramanaik et.al. 2505.03646 null
2025-05-06 Robustness in AI-Generated Detection: Enhancing Resistance to Adversarial Attacks Sun Haoxuan et.al. 2505.03435 null
2025-05-06 Attention-aggregated Attack for Boosting the Transferability of Facial Adversarial Examples Jian-Wei Li et.al. 2505.03383 null
2025-05-06 A Chaos Driven Metric for Backdoor Attack Detection Hema Karnam Surendrababu et.al. 2505.03208 null
2025-05-06 Adversarial Sample Generation for Anomaly Detection in Industrial Control Systems Abdul Mustafa et.al. 2505.03120 null
2025-05-06 Adversarial Attacks in Multimodal Systems: A Practitioner's Survey Shashank Kapoor et.al. 2505.03084 null
2025-05-06 Adversarial Robustness of Deep Learning Models for Inland Water Body Segmentation from SAR Images Siddharth Kothari et.al. 2505.01884 **[link](https://github.com/gvcl/iwseg-sar-poison)**

2025-05-05

Publish Date Title Authors PDF Code
2025-05-05 Adversarial Robustness Analysis of Vision-Language Models in Medical Image Segmentation Anjila Budathoki et.al. 2505.02971 **[link](https://github.com/anjilab/secure-private-ai)**
2025-05-05 Bayesian Robust Aggregation for Federated Learning Aleksandr Karakulev et.al. 2505.02490 **[link](https://github.com/sciml-fl/bra-fl)**

2025-05-04

Publish Date Title Authors PDF Code
2025-05-04 A Comprehensive Analysis of Adversarial Attacks against Spam Filters Esra Hotoğlu et.al. 2505.03831 null
2025-05-04 Lightweight Defense Against Adversarial Attacks in Time Series Classification Yi Han et.al. 2505.02073 **[link](https://github.com/yi126/lightweight-defence)**

2025-05-03

Publish Date Title Authors PDF Code
2025-05-03 CAMOUFLAGE: Exploiting Misinformation Detection Systems Through LLM-driven Adversarial Claim Transformation Mazal Bethany et.al. 2505.01900 null
2025-05-03 Rogue Cell: Adversarial Attack and Defense in Untrusted O-RAN Setup Exploiting the Traffic Steering xApp Eran Aizikovich et.al. 2505.01816 null

2025-05-02

Publish Date Title Authors PDF Code
2025-05-02 Modeling Behavioral Preferences of Cyber Adversaries Using Inverse Reinforcement Learning Aditya Shinde et.al. 2505.03817 null
2025-05-02 Machine Learning for Cyber-Attack Identification from Traffic Flows Yujing Zhou et.al. 2505.01489 null
2025-05-02 Constrained Network Adversarial Attacks: Validity, Robustness, and Transferability Anass Grini et.al. 2505.01328 null
2025-05-02 Harmonizing Intra-coherence and Inter-divergence in Ensemble Attacks for Adversarial Transferability Zhaoyang Ma et.al. 2505.01168 null
2025-05-02 Transferable Adversarial Attacks on Black-Box Vision-Language Models Kai Hu et.al. 2505.01050 null
2025-05-02 Quantum Support Vector Regression for Robust Anomaly Detection Kilian Tscharke et.al. 2505.01012 null
2025-05-02 MARS: Defending Unmanned Aerial Vehicles From Attacks on Inertial Sensors with Model-based Anomaly Detection and Recovery Haocheng Meng et.al. 2505.00924 null
2025-05-02 Fast and Low-Cost Genomic Foundation Models via Outlier Removal Haozheng Luo et.al. 2505.00598 **[link](https://github.com/MAGICS-LAB/GERM)**

2025-05-01

Publish Date Title Authors PDF Code
2025-05-01 OET: Optimization-based prompt injection Evaluation Toolkit Jinsheng Pan et.al. 2505.00843 null
2025-05-01 Fully passive quantum random number generation with untrusted light KaiWei Qiu et.al. 2505.00636 null
2025-05-01 Analysis of the vulnerability of machine learning regression models to adversarial attacks using data from 5G wireless networks Leonid Legashev et.al. 2505.00487 null
2025-05-01 GAN-based Generator of Adversarial Attack on Intelligent End-to-End Autoencoder-based Communication System Jianyuan Chen et.al. 2505.00395 null

2025-04-30

Publish Date Title Authors PDF Code
2025-04-30 Stochastic Subspace Descent Accelerated via Bi-fidelity Line Search Nuojin Cheng et.al. 2505.00162 null
2025-04-30 Active Light Modulation to Counter Manipulation of Speech Visual Content Hadleigh Schwartz et.al. 2504.21846 null
2025-04-30 Generative AI in Financial Institution: A Global Survey of Opportunities, Threats, and Regulation Bikash Saha et.al. 2504.21574 null
2025-04-30 How to Backdoor the Knowledge Distillation Chen Wu et.al. 2504.21323 null
2025-04-30 Quantifying the Noise of Structural Perturbations on Graph Adversarial Attacks Junyuan Fang et.al. 2504.20869 null

2025-04-29

Publish Date Title Authors PDF Code
2025-04-29 AegisLLM: Scaling Agentic Systems for Self-Reflective Defense in LLM Security Zikui Cai et.al. 2504.20965 null
2025-04-29 Mitigating the Structural Bias in Graph Adversarial Defenses Junyuan Fang et.al. 2504.20848 null
2025-04-29 Learning and Generalization with Mixture Data Harsh Vardhan et.al. 2504.20651 null
2025-04-29 WILD: a new in-the-Wild Image Linkage Dataset for synthetic image attribution Pietro Bongini et.al. 2504.19595 null

2025-04-28

Publish Date Title Authors PDF Code
2025-04-28 AGATE: Stealthy Black-box Watermarking for Multimodal Model Copyright Protection Jianbo Gao et.al. 2504.21044 null
2025-04-28 Prefill-Based Jailbreak: A Novel Approach of Bypassing LLM Safety Boundary Yakai Li et.al. 2504.21038 null
2025-04-28 The Dark Side of Digital Twins: Adversarial Attacks on AI-Driven Water Forecasting Mohammadhossein Homaei et.al. 2504.20295 null
2025-04-28 A Case Study on the Use of Representativeness Bias as a Defense Against Adversarial Cyber Threats Briland Hitaj et.al. 2504.20245 null
2025-04-28 Evaluate-and-Purify: Fortifying Code Language Models Against Adversarial Attacks Using LLM-as-a-Judge Wenhan Mu et.al. 2504.19730 null
2025-04-28 Fooling the Decoder: An Adversarial Attack on Quantum Error Correction Jerome Lenssen et.al. 2504.19651 null
2025-04-28 FCGHunter: Towards Evaluating Robustness of Graph-Based Android Malware Detection Shiwen Song et.al. 2504.19456 null

2025-04-27

Publish Date Title Authors PDF Code
2025-04-27 Forging and Removing Latent-Noise Diffusion Watermarks Using a Single Image Anubhav Jain et.al. 2504.20111 null
2025-04-27 CapsFake: A Multimodal Capsule Network for Detecting Instruction-Guided Deepfakes Tuan Nguyen et.al. 2504.19212 null

2025-04-26

Publish Date Title Authors PDF Code
2025-04-26 Unveiling and Mitigating Adversarial Vulnerabilities in Iterative Optimizers Elad Sofer et.al. 2504.19000 null
2025-04-26 Test It Before You Trust It: Applying Software Testing for Trustworthy In-context Learning Teeradaj Racharak et.al. 2504.18827 null

2025-04-25

Publish Date Title Authors PDF Code
2025-04-25 Edge-Based Learning for Improved Classification Under Adversarial Noise Manish Kansana et.al. 2504.20077 null
2025-04-25 Adversarial Attacks on LLM-as-a-Judge Systems: Insights from Prompt Injections Narek Maloyan et.al. 2504.18333 null
2025-04-25 Generative AI for Physical-Layer Authentication Rui Meng et.al. 2504.18175 null

2025-04-24

Publish Date Title Authors PDF Code
2025-04-24 A Simple DropConnect Approach to Transfer-based Targeted Attack Tongrui Su et.al. 2504.18594 null
2025-04-24 Unsupervised Corpus Poisoning Attacks in Continuous Space for Dense Retrieval Yongkang Li et.al. 2504.17884 null
2025-04-24 On the Generalization of Adversarially Trained Quantum Classifiers Petros Georgiou et.al. 2504.17690 null
2025-04-24 Evaluating the Vulnerability of ML-Based Ethereum Phishing Detectors to Single-Feature Adversarial Perturbations Ahod Alghuried et.al. 2504.17684 null
2025-04-24 Evaluating Time Series Models for Urban Wastewater Management: Predictive Performance, Model Complexity and Resilience Vipin Singh et.al. 2504.17461 null
2025-04-24 Unveiling Hidden Vulnerabilities in Digital Human Generation via Adversarial Attacks Zhiying Li et.al. 2504.17457 null

2025-04-23

Publish Date Title Authors PDF Code
2025-04-23 Seeking Flat Minima over Diverse Surrogates for Improved Adversarial Transferability: A Theoretical Framework and Algorithmic Instantiation Meixi Zheng et.al. 2504.16474 null
2025-04-23 Property-Preserving Hashing for $\ell_1$ -Distance Predicates: Applications to Countering Adversarial Input Attacks Hassan Asghar et.al. 2504.16355 null
2025-04-23 aiXamine: Simplified LLM Safety and Security Fatih Deniz et.al. 2504.14985 null

2025-04-22

Publish Date Title Authors PDF Code
2025-04-22 Human-Imperceptible Physical Adversarial Attack for NIR Face Recognition Models Songyan Xie et.al. 2504.15823 null

2025-04-21

Publish Date Title Authors PDF Code
2025-04-21 Unifying Image Counterfactuals and Feature Attributions with Latent-Space Adversarial Attacks Jeremy Goldwasser et.al. 2504.15479 null
2025-04-21 MR. Guard: Multilingual Reasoning Guardrail using Curriculum Learning Yahan Yang et.al. 2504.15241 null

2025-04-20

Publish Date Title Authors PDF Code
2025-04-20 Towards Model Resistant to Transferable Adversarial Examples via Trigger Activation Yi Yu et.al. 2504.14541 null

2025-04-19

Publish Date Title Authors PDF Code
2025-04-19 Adversarial Attack for RGB-Event based Visual Object Tracking Qiang Chen et.al. 2504.14423 null
2025-04-19 Hydra: An Agentic Reasoning Approach for Enhancing Adversarial Robustness and Mitigating Hallucinations in Vision-Language Models Chung-En et.al. 2504.14395 null
2025-04-19 Rethinking Target Label Conditioning in Adversarial Attacks: A 2D Tensor-Guided Generative Approach Hangyu Liu et.al. 2504.14137 null

2025-04-18

Publish Date Title Authors PDF Code
2025-04-18 Context-Awareness and Interpretability of Rare Occurrences for Discovery and Formalization of Critical Failure Modes Sridevi Polavaram et.al. 2504.16117 null
2025-04-18 VideoPASTA: 7K Preference Pairs That Matter for Video-LLM Alignment Yogesh Kulkarni et.al. 2504.14096 null
2025-04-18 Fairness and Robustness in Machine Unlearning Khoa Tran et.al. 2504.13610 null
2025-04-18 Q-FAKER: Query-free Hard Black-box Attack via Controlled Generation CheolWon Na et.al. 2504.13551 null

2025-04-17

Publish Date Title Authors PDF Code
2025-04-17 DYNAMITE: Dynamic Defense Selection for Enhancing Machine Learning-based Intrusion Detection Against Adversarial Attacks Jing Chen et.al. 2504.13301 null
2025-04-17 Quantum Computing Supported Adversarial Attack-Resilient Autonomous Vehicle Perception Module for Traffic Sign Classification Reek Majumder et.al. 2504.12644 **[link](https://github.com/reek129/QuantumAI_MultiClass_Sign_Recognition)**

2025-04-16

Publish Date Title Authors PDF Code
2025-04-16 Human Aligned Compression for Robust Models Samuel Räber et.al. 2504.12255 **[link](https://github.com/aplesner/Human-aligned-compression-for-robust-models)**
2025-04-16 SemDiff: Generating Natural Unrestricted Adversarial Examples via Semantic Attributes Optimization in Diffusion Models Zeyu Dai et.al. 2504.11923 null
2025-04-16 Support is All You Need for Certified VAE Training Changming Xu et.al. 2504.11831 null
2025-04-16 Towards Safe Synthetic Image Generation On the Web: A Multimodal Robust NSFW Defense and Million Scale Dataset Muhammad Shahid Muneer et.al. 2504.11707 **[link](https://github.com/shahidmuneer/multimodal-nsfw-defense)**

2025-04-15

Publish Date Title Authors PDF Code
2025-04-15 R-TPT: Improving Adversarial Robustness of Vision-Language Models through Test-Time Prompt Tuning Lijun Sheng et.al. 2504.11195 **[link](https://github.com/tomsheng21/r-tpt)**
2025-04-15 Exploring Backdoor Attack and Defense for LLM-empowered Recommendations Liangbo Ning et.al. 2504.11182 null
2025-04-15 Token-Level Constraint Boundary Search for Jailbreaking Text-to-Image Models Jiangtao Liu et.al. 2504.11106 null
2025-04-15 QAVA: Query-Agnostic Visual Attack to Large Vision-Language Models Yudong Zhang et.al. 2504.11038 **[link](https://github.com/btzyd/qava)**
2025-04-15 Defending Against Frequency-Based Attacks with Diffusion Models Fatemeh Amerehi et.al. 2504.11034 null
2025-04-15 Towards Spatially-Aware and Optimally Faithful Concept-Based Explanations Shubham Kumar et.al. 2504.10833 null
2025-04-15 The Sword of Damocles in ViTs: Computational Redundancy Amplifies Adversarial Transferability Jiani Liu et.al. 2504.10804 null

2025-04-14

Publish Date Title Authors PDF Code
2025-04-14 Investigating cybersecurity incidents using large language models in latest-generation wireless networks Leonid Legashev et.al. 2504.13196 null
2025-04-14 Demo: ViolentUTF as An Accessible Platform for Generative AI Red Teaming Tam n. Nguyen et.al. 2504.10603 null
2025-04-14 Quantifying Privacy Leakage in Split Inference via Fisher-Approximated Shannon Information Analysis Ruijun Deng et.al. 2504.10016 null
2025-04-14 An Investigation of Large Language Models and Their Vulnerabilities in Spam Detection Qiyao Tang et.al. 2504.09776 null

2025-04-13

Publish Date Title Authors PDF Code
2025-04-13 Bregman Linearized Augmented Lagrangian Method for Nonconvex Constrained Stochastic Zeroth-order Optimization Qiankun Shi et.al. 2504.09409 null

2025-04-12

Publish Date Title Authors PDF Code
2025-04-12 PapMOT: Exploring Adversarial Patch Attack against Multiple Object Tracking Jiahuan Long et.al. 2504.09361 null
2025-04-12 Classical Autoencoder Distillation of Quantum Adversarial Manipulations Amena Khatun et.al. 2504.09216 null
2025-04-12 From Visual Explanations to Counterfactual Explanations with Latent Diffusion Tung Luu et.al. 2504.09202 null

2025-04-11

Publish Date Title Authors PDF Code
2025-04-11 Robust SAM: On the Adversarial Robustness of Vision Foundation Models Jiahuan Long et.al. 2504.08906 null
2025-04-11 Toward Spiking Neural Network Local Learning Modules Resistant to Adversarial Attacks Jiaqi Lin et.al. 2504.08897 null
2025-04-11 On Transfer-based Universal Attacks in Pure Black-box Setting Mohammad A. A. K. Jalwana et.al. 2504.08866 null
2025-04-11 X-Guard: Multilingual Guard Agent for Content Moderation Bibek Upadhayay et.al. 2504.08848 null
2025-04-11 Enabling Safety for Aerial Robots: Planning and Control Architectures Kaleb Ben Naveed et.al. 2504.08601 null
2025-04-11 Toward Realistic Adversarial Attacks in IDS: A Novel Feasibility Metric for Transferability Sabrine Ennaji et.al. 2504.08480 null
2025-04-11 Adversarial Examples in Environment Perception for Automated Driving (Review) Jun Yan et.al. 2504.08414 null
2025-04-11 Adversarial Training of Reward Models Alexander Bukharin et.al. 2504.06141 null

2025-04-10

Publish Date Title Authors PDF Code
2025-04-10 Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge Riccardo Cantini et.al. 2504.07887 **[link](https://github.com/SCAlabUnical/CLEAR-Bias_LLM_benchmark)**

2025-04-09

Publish Date Title Authors PDF Code
2025-04-09 SafeMLRM: Demystifying Safety in Multi-modal Large Reasoning Models Junfeng Fang et.al. 2504.08813 null
2025-04-09 Code Generation with Small Language Models: A Deep Evaluation on Codeforces Débora Souza et.al. 2504.07343 null

2025-04-08

Publish Date Title Authors PDF Code
2025-04-08 Exploring Gradient-Guided Masked Language Model to Detect Textual Adversarial Attacks Xiaomei Zhang et.al. 2504.08798 null
2025-04-08 Exploiting Meta-Learning-based Poisoning Attacks for Graph Link Prediction Mingchen Li et.al. 2504.06492 null
2025-04-08 Towards Calibration Enhanced Network by Inverse Adversarial Attack Yupeng Cheng et.al. 2504.06358 null
2025-04-08 Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking Junxi Chen et.al. 2504.05838 **[link](https://github.com/fhdnskfbeuv/attackipa)**
2025-04-08 Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-spoofing Tianchi Liu et.al. 2504.05657 **[link](https://github.com/liu-tianchi/nes2net)**

2025-04-07

Publish Date Title Authors PDF Code
2025-04-07 SINCon: Mitigate LLM-Generated Malicious Message Injection Attack for Rumor Detection Mingqing Zhang et.al. 2504.07135 null
2025-04-07 Secure Diagnostics: Adversarial Robustness Meets Clinical Interpretability Mohammad Hossein Najafi et.al. 2504.05483 null
2025-04-07 Adversarial KA Sviatoslav Dzhenzher et.al. 2504.05255 null
2025-04-07 Security Risks in Vision-Based Beam Prediction: From Spatial Proxy Attacks to Feature Refinement Avi Deb Raha et.al. 2504.05222 null
2025-04-07 Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models Yoojin Jung et.al. 2504.04747 null
2025-04-07 On the Robustness of GUI Grounding Models Against Image Attacks Haoren Zhao et.al. 2504.04716 null
2025-04-07 Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models Zhaochen Wang et.al. 2504.01589 null

2025-04-06

Publish Date Title Authors PDF Code
2025-04-06 Systematic Literature Review on Vehicular Collaborative Perception -- A Computer Vision Perspective Lei Wan et.al. 2504.04631 null
2025-04-06 Selective Masking Adversarial Attack on Automatic Speech Recognition Systems Zheng Fang et.al. 2504.04394 null
2025-04-06 WeiDetect: Weibull Distribution-Based Defense against Poisoning Attacks in Federated Learning for Network Intrusion Detection Systems Sameera K. M. et.al. 2504.04367 null

2025-04-03

Publish Date Title Authors PDF Code
2025-04-03 SLACK: Attacking LiDAR-based SLAM with Adversarial Point Injections Prashant Kumar et.al. 2504.03089 null
2025-04-03 Moving Target Defense Against Adversarial False Data Injection Attacks In Power Grids Yexiang Chen et.al. 2504.03065 null
2025-04-03 ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization Kehua Feng et.al. 2504.02725 null
2025-04-03 Evaluating and Enhancing Segmentation Model Robustness with Metamorphic Testing Seif Mzoughi et.al. 2504.02335 null
2025-04-03 Robust Unsupervised Domain Adaptation for 3D Point Cloud Segmentation Under Source Adversarial Attacks Haosheng Li et.al. 2504.01659 null
2025-04-03 No Free Lunch with Guardrails Divyanshu Kumar et.al. 2504.00441 null

2025-04-02

Publish Date Title Authors PDF Code
2025-04-02 Watermarking for AI Content Detection: A Review on Text, Visual, and Audio Modalities Lele Cao et.al. 2504.03765 null
2025-04-02 AdPO: Enhancing the Adversarial Robustness of Large Vision-Language Models with Preference Optimization Chaohu Liu et.al. 2504.01735 null
2025-04-02 Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation Junjie Chen et.al. 2504.01668 null
2025-04-02 Representation Bending for Large Language Model Safety Ashkan Yousefpour et.al. 2504.01550 null
2025-04-02 Leveraging Generalizability of Image-to-Image Translation for Enhanced Adversarial Defense Haibo Zhang et.al. 2504.01399 null
2025-04-02 Breaking BERT: Gradient Attack on Twitter Sentiment Analysis for Targeted Misclassification Akil Raj Subedi et.al. 2504.01345 **[link](https://github.com/akil003/bert-attack)**

2025-04-01

Publish Date Title Authors PDF Code
2025-04-01 Towards Resilient Federated Learning in CyberEdge Networks: Recent Advances and Future Trends Kai Li et.al. 2504.01240 null
2025-04-01 TenAd: A Tensor-based Low-rank Black Box Adversarial Attack for Video Classification Kimia haghjooei et.al. 2504.01228 null
2025-04-01 S3C2 Summit 2024-08: Government Secure Supply Chain Summit Courtney Miller et.al. 2504.00924 null
2025-04-01 Whispering Under the Eaves: Protecting User Privacy Against Commercial and LLM-powered Automatic Speech Recognition Systems Weifei Jin et.al. 2504.00858 **[link](https://github.com/WeifeiJin/AudioShield)**
2025-04-01 Alleviating Performance Disparity in Adversarial Spatiotemporal Graph Learning Under Zero-Inflated Distribution Songran Bai et.al. 2504.00721 null
2025-04-01 Impact of Data Duplication on Deep Neural Network-Based Image Classifiers: Robust vs. Standard Models Alireza Aghabagherloo et.al. 2504.00638 null
2025-04-01 Unleashing the Power of Pre-trained Encoders for Universal Adversarial Attack Detection Yinghe Zhang et.al. 2504.00429 null
2025-04-01 A Survey on Unlearnable Data Jiahao Li et.al. 2503.23536 **[link](https://github.com/LiJiahao-Alex/Awesome-UnLearnable-Data)**

2025-03-31

Publish Date Title Authors PDF Code
2025-03-31 System Identification from Partial Observations under Adversarial Attacks Jihun Kim et.al. 2504.00244 null
2025-03-31 Towards Benchmarking and Assessing the Safety and Robustness of Autonomous Driving on Safety-critical Scenarios Jingzheng Li et.al. 2503.23708 null

2025-03-30

Publish Date Title Authors PDF Code
2025-03-30 Towards Trustworthy GUI Agents: A Survey Yucheng Shi et.al. 2503.23434 **[link](https://github.com/sycny/Awesome-Trustworthy-GUI-Agents)**

2025-03-28

Publish Date Title Authors PDF Code
2025-03-28 Data-Free Universal Attack by Exploiting the Intrinsic Vulnerability of Deep Models YangTian Yan et.al. 2503.22205 **[link](https://github.com/yyt0718/Intri_Attack)**

2025-03-27

Publish Date Title Authors PDF Code
2025-03-27 Learning to Lie: Reinforcement Learning Attacks Damage Human-AI Teams and Teams of LLMs Abed Kareem Musaffar et.al. 2503.21983 null
2025-03-27 Adversarial Wear and Tear: Exploiting Natural Damage for Generating Physical-World Adversarial Examples Samra Irshad et.al. 2503.21164 null

2025-03-26

Publish Date Title Authors PDF Code
2025-03-26 Robust Deep Reinforcement Learning in Robotics via Adaptive Gradient-Masked Adversarial Attacks Zongyuan Zhang et.al. 2503.20844 null
2025-03-26 State-Aware Perturbation Optimization for Robust Deep Reinforcement Learning Zongyuan Zhang et.al. 2503.20613 null
2025-03-26 Feature Statistics with Uncertainty Help Adversarial Robustness Ran Wang et.al. 2503.20583 **[link](https://github.com/techtrekkerz/fsu)**
2025-03-26 Lipschitz Constant Meets Condition Number: Learning Robust and Compact Deep Neural Networks Yangqi Feng et.al. 2503.20454 null
2025-03-26 Enabling Heterogeneous Adversarial Transferability via Feature Permutation Attacks Tao Wu et.al. 2503.20310 null
2025-03-26 Are We There Yet? Unraveling the State-of-the-Art Graph Network Intrusion Detection Systems Chenglong Wang et.al. 2503.20281 null
2025-03-26 How Secure is Forgetting? Linking Machine Unlearning to Machine Learning Attacks Muhammed Shafi K. P. et.al. 2503.20257 null
2025-03-26 Hi-ALPS -- An Experimental Robustness Quantification of Six LiDAR-based Object Detection Systems for Autonomous Driving Alexandra Arzberger et.al. 2503.17168 null

2025-03-25

Publish Date Title Authors PDF Code
2025-03-25 Bitstream Collisions in Neural Image Compression via Adversarial Perturbations Jordan Madden et.al. 2503.19817 **[link](https://github.com/neddamj/nicsec)**
2025-03-25 SITA: Structurally Imperceptible and Transferable Adversarial Attacks for Stylized Image Generation Jingdan Kang et.al. 2503.19791 **[link](https://github.com/a-raniy-day/sita)**
2025-03-25 Boosting the Transferability of Audio Adversarial Examples with Acoustic Representation Optimization Weifei Jin et.al. 2503.19591 null
2025-03-25 Towards Imperceptible Adversarial Attacks for Time Series Classification with Local Perturbations and Frequency Analysis Wenwei Gu et.al. 2503.19519 null
2025-03-25 Stop Walking in Circles! Bailing Out Early in Projected Gradient Descent Philip Doldo et.al. 2503.19347 null
2025-03-25 Efficient Adversarial Detection Frameworks for Vehicle-to-Microgrid Services in Edge Computing Ahmed Omara et.al. 2503.19318 null
2025-03-25 Robustness of Proof of Team Sprint (PoTS) Against Attacks: A Simulation-Based Analysis Naoki Yonezawa et.al. 2503.19293 null

2025-03-24

Publish Date Title Authors PDF Code
2025-03-24 CEFW: A Comprehensive Evaluation Framework for Watermark in Large Language Models Shuhao Zhang et.al. 2503.20802 **[link](https://github.com/drankxs/balancedwatermark)**

2025-03-22

Publish Date Title Authors PDF Code
2025-03-22 Safe RLHF-V: Safe Reinforcement Learning from Human Feedback in Multimodal Large Language Models Jiaming Ji et.al. 2503.17682 null

2025-03-21

Publish Date Title Authors PDF Code
2025-03-21 Debugging and Runtime Analysis of Neural Networks with VLMs (A Case Study) Boyue Caroline Hu et.al. 2503.17416 null
2025-03-21 Robustness of deep learning classification to adversarial input on GPUs: asynchronous parallel accumulation is a source of vulnerability Sanjif Shanmugavelu et.al. 2503.17173 null
2025-03-21 EasyRobust: A Comprehensive and Easy-to-use Toolkit for Robust and Generalized Vision Xiaofeng Mao et.al. 2503.16975 null

2025-03-20

Publish Date Title Authors PDF Code
2025-03-20 REVAL: A Comprehension Evaluation on Reliability and Values of Large Vision-Language Models Jie Zhang et.al. 2503.16566 null
2025-03-20 Narrowing Class-Wise Robustness Gaps in Adversarial Training Fatemeh Amerehi et.al. 2503.16179 null
2025-03-20 SAUCE: Selective Concept Unlearning in Vision-Language Models with Sparse Autoencoders Qing Li et.al. 2503.14530 null

2025-03-19

Publish Date Title Authors PDF Code
2025-03-19 Improving Adversarial Transferability on Vision Transformers via Forward Propagation Refinement Yuchen Ren et.al. 2503.15404 **[link](https://github.com/ryc-98/fpr)**
2025-03-19 Improving Generalization of Universal Adversarial Perturbation via Dynamic Maximin Optimization Yechao Zhang et.al. 2503.12793 **[link](https://github.com/yechao-zhang/dm-uap)**

2025-03-18

Publish Date Title Authors PDF Code
2025-03-18 Unveiling the Role of Randomization in Multiclass Adversarial Classification: Insights from Graph Theory Lucas Gnecco-Heredia et.al. 2503.14299 null
2025-03-18 Survey of Adversarial Robustness in Multimodal Large Language Models Chengze Jiang et.al. 2503.13962 null
2025-03-18 Make the Most of Everything: Further Considerations on Disrupting Diffusion-based Customization Long Tang et.al. 2503.13945 null
2025-03-18 Evolution-based Region Adversarial Prompt Learning for Robustness Enhancement in Vision-Language Models Xiaojun Jia et.al. 2503.12874 null
2025-03-18 GSBA $^K$: $top$-$K$ Geometric Score-based Black-box Attack Md Farhamdur Reza et.al. 2503.12827 null

2025-03-17

Publish Date Title Authors PDF Code
2025-03-17 Securing Virtual Reality Experiences: Unveiling and Tackling Cybersickness Attacks with Explainable AI Ripan Kumar Kundu et.al. 2503.13419 null
2025-03-17 How Good is my Histopathology Vision-Language Foundation Model? A Holistic Benchmark Roba Al Majzoub et.al. 2503.12990 **[link](https://github.com/musk007/Histopathology_Benchmark)**

2025-03-16

Publish Date Title Authors PDF Code
2025-03-16 Towards Privacy-Preserving Data-Driven Education: The Potential of Federated Learning Mohammad Khalil et.al. 2503.13550 null
2025-03-16 Algebraic Adversarial Attacks on Explainability Models Lachlan Simpson et.al. 2503.12683 null
2025-03-16 GAN-Based Single-Stage Defense for Traffic Sign Classification Under Adversarial Patch Attack Abyad Enan et.al. 2503.12567 null
2025-03-16 Augmented Adversarial Trigger Learning Zhe Wang et.al. 2503.12339 null

2025-03-15

Publish Date Title Authors PDF Code
2025-03-15 Robust Dataset Distillation by Matching Adversarial Trajectories Wei Lai et.al. 2503.12069 null

Poisoning attacks

2025-06-26

Publish Date Title Authors PDF Code
2025-06-26 E-FreeM2: Efficient Training-Free Multi-Scale and Cross-Modal News Verification via MLLMs Van-Hoang Phan et.al. 2506.20944 null
2025-06-26 SPA: Towards More Stealth and Persistent Backdoor Attacks in Federated Learning Chengcheng Zhu et.al. 2506.20931 null

2025-06-25

Publish Date Title Authors PDF Code
2025-06-25 Client Clustering Meets Knowledge Sharing: Enhancing Privacy and Robustness in Personalized Peer-to-Peer Learning Mohammad Mahdi Maheri et.al. 2506.20413 null
2025-06-25 Don't Hash Me Like That: Exposing and Mitigating Hash-Induced Unfairness in Local Differential Privacy Berkay Kemal Balioglu et.al. 2506.20290 null
2025-06-25 Screen Hijack: Visual Poisoning of VLM Agents in Mobile Environments Xuan Wang et.al. 2506.13205 null

2025-06-24

Publish Date Title Authors PDF Code
2025-06-24 Identifying Physically Realizable Triggers for Backdoored Face Recognition Networks Ankita Raj et.al. 2506.19533 null

2025-06-22

Publish Date Title Authors PDF Code
2025-06-22 Generalization under Byzantine & Poisoning Attacks: Tight Stability Bounds in Robust Distributed Learning Thomas Boudou et.al. 2506.18020 null

2025-06-20

Publish Date Title Authors PDF Code
2025-06-20 CUBA: Controlled Untargeted Backdoor Attack against Deep Neural Networks Yinghao Wu et.al. 2506.17350 null

2025-06-19

Publish Date Title Authors PDF Code
2025-06-19 SecureFed: A Two-Phase Framework for Detecting Malicious Clients in Federated Learning Likhitha Annapurna Kavuri et.al. 2506.16458 null
2025-06-19 Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models Biao Yi et.al. 2506.16447 null

2025-06-18

Publish Date Title Authors PDF Code
2025-06-18 PDLRecover: Privacy-preserving Decentralized Model Recovery with Machine Unlearning Xiangman Li et.al. 2506.15112 null

2025-06-17

Publish Date Title Authors PDF Code
2025-06-17 Winter Soldier: Backdooring Language Models at Pre-Training with Indirect Data Poisoning Wassim Bouaziz et.al. 2506.14913 null

2025-06-16

Publish Date Title Authors PDF Code
2025-06-16 EBS-CFL: Efficient and Byzantine-robust Secure Clustered Federated Learning Zhiqiang Li et.al. 2506.13612 **[link](https://github.com/lee-va/ebs-cfl)**
2025-06-16 Unlearning-Enhanced Website Fingerprinting Attack: Against Backdoor Poisoning in Anonymous Networks Yali Yuan et.al. 2506.13563 null
2025-06-16 Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs Houcheng Jiang et.al. 2506.13285 null
2025-06-16 Detecting Hard-Coded Credentials in Software Repositories via LLMs Chidera Biringa et.al. 2506.13090 null
2025-06-16 Data Shifts Hurt CoT: A Theoretical Study Lang Yin et.al. 2506.10647 null

2025-06-15

Publish Date Title Authors PDF Code
2025-06-15 TrojanTO: Action-Level Backdoor Attacks against Trajectory Optimization Models Yang Dai et.al. 2506.12815 null

2025-06-14

Publish Date Title Authors PDF Code
2025-06-14 When Forgetting Triggers Backdoors: A Clean Unlearning Attack Marco Arazzi et.al. 2506.12522 null
2025-06-14 InverTune: Removing Backdoors from Multimodal Contrastive Learning Models via Trigger Inversion and Activation Tuning Mengyuan Sun et.al. 2506.12411 null

2025-06-13

Publish Date Title Authors PDF Code
2025-06-13 Investigating Vulnerabilities and Defenses Against Audio-Visual Attacks: A Comprehensive Survey Emphasizing Multimodal Models Jinming Wen et.al. 2506.11521 null
2025-06-13 Bias Amplification in RAG: Poisoning Knowledge Retrieval to Steer LLMs Linlin Wang et.al. 2506.11415 null

2025-06-12

Publish Date Title Authors PDF Code
2025-06-12 Collapsing Sequence-Level Data-Policy Coverage via Poisoning Attack in Offline Reinforcement Learning Xue Zhou et.al. 2506.11172 null
2025-06-12 ME: Trigger Element Combination Backdoor Attack on Copyright Infringement Feiyu Yang et.al. 2506.10776 null
2025-06-12 TED-LaST: Towards Robust Backdoor Defense Against Adaptive Attacks Xiaoxing Mo et.al. 2506.10722 null
2025-06-12 TooBadRL: Trigger Optimization to Boost Effectiveness of Backdoor Attacks on Deep Reinforcement Learning Songze Li et.al. 2506.09562 null

2025-06-11

Publish Date Title Authors PDF Code
2025-06-11 FedMLAC: Mutual Learning Driven Heterogeneous Federated Audio Classification Jun Bai et.al. 2506.10207 null
2025-06-11 Devil's Hand: Data Poisoning Attacks to Locally Private Graph Learning Protocols Longzhu He et.al. 2506.09803 null
2025-06-11 Evasion Attacks Against Bayesian Predictive Models Pablo G. Arce et.al. 2506.09640 **[link](https://github.com/pablogarciarce/advreg)**
2025-06-11 Your Agent Can Defend Itself against Backdoor Attacks Li Changjiang et.al. 2506.08336 null

2025-06-10

Publish Date Title Authors PDF Code
2025-06-10 WGLE:Backdoor-free and Multi-bit Black-box Watermarking for Graph Neural Networks Tingzhi Li et.al. 2506.08602 null
2025-06-10 Single-Node Trigger Backdoor Attacks in Graph-Based Recommendation Systems Runze Li et.al. 2506.08401 null
2025-06-10 SPBA: Utilizing Speech Large Language Model for Backdoor Attacks on Speech Classification Models Wenhan Yao et.al. 2506.08346 null

2025-06-09

Publish Date Title Authors PDF Code
2025-06-09 Circumventing Backdoor Space via Weight Symmetry Jie Peng et.al. 2506.07467 **[link](https://github.com/jiepeng104/tsc)**

2025-06-08

Publish Date Title Authors PDF Code
2025-06-08 Backdoor Attack on Vision Language Models with Stealthy Semantic Manipulation Zhiyuan Zhong et.al. 2506.07214 null

2025-06-07

Publish Date Title Authors PDF Code
2025-06-07 Can In-Context Reinforcement Learning Recover From Reward Poisoning Attacks? Paulius Sasnauskas et.al. 2506.06891 null
2025-06-07 Rescaled Influence Functions: Accurate Data Attribution in High Dimension Ittai Rubinstein et.al. 2506.06656 null

2025-06-06

Publish Date Title Authors PDF Code
2025-06-06 Securing Traffic Sign Recognition Systems in Autonomous Vehicles Thushari Hapuarachchi et.al. 2506.06563 null
2025-06-06 A Systematic Review of Poisoning Attacks Against Large Language Models Neil Fendley et.al. 2506.06518 null
2025-06-06 Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems Haowei Wang et.al. 2506.06151 null
2025-06-06 SATversary: Adversarial Attacks on Satellite Fingerprinting Joshua Smailes et.al. 2506.06119 null
2025-06-06 What Really is a Member? Discrediting Membership Inference via Poisoning Neal Mangaokar et.al. 2506.06003 null

2025-06-05

Publish Date Title Authors PDF Code
2025-06-05 Invisible Backdoor Triggers in Image Editing Model via Deep Watermarking Yu-Feng Chen et.al. 2506.04879 null
2025-06-05 SRD: Reinforcement-Learned Semantic Perturbation for Backdoor Defense in VLMs Shuhan Xu et.al. 2506.04743 null
2025-06-05 Beyond the Protocol: Unveiling Attack Vectors in the Model Context Protocol Ecosystem Hao Song et.al. 2506.02040 null

2025-06-04

Publish Date Title Authors PDF Code
2025-06-04 Robust Anti-Backdoor Instruction Tuning in LVLMs Yuan Xun et.al. 2506.05401 null
2025-06-04 VLMs Can Aggregate Scattered Training Patches Zhanhui Zhou et.al. 2506.03614 **[link](https://github.com/zhziszz/visual-stitching)**
2025-06-04 DyePack: Provably Flagging Test Set Contamination in LLMs Using Backdoors Yize Cheng et.al. 2505.23001 **[link](https://github.com/chengez/DyePack)**

2025-06-03

Publish Date Title Authors PDF Code
2025-06-03 BadReward: Clean-Label Poisoning of Reward Models in Text-to-Image RLHF Kaiwen Duan et.al. 2506.03234 null
2025-06-03 Agnostic Learning under Targeted Poisoning: Optimal Rates and the Role of Randomness Bogdan Chornomaz et.al. 2506.03075 null

2025-06-02

Publish Date Title Authors PDF Code
2025-06-02 Mitigating Data Poisoning Attacks to Local Differential Privacy Xiaolin Li et.al. 2506.02156 null
2025-06-02 Which Factors Make Code LLMs More Vulnerable to Backdoor Attacks? A Systematic Study Chenyu Wang et.al. 2506.01825 null
2025-06-02 Variance-Based Defense Against Blended Backdoor Attacks Sujeevan Aseervatham et.al. 2506.01444 null

2025-05-31

Publish Date Title Authors PDF Code
2025-05-31 Security Concerns for Large Language Models: A Survey Miles Q. Li et.al. 2505.18889 null

2025-05-30

Publish Date Title Authors PDF Code
2025-05-30 Adversarial Threat Vectors and Risk Mitigation for Retrieval-Augmented Generation Systems Chris M. Ward et.al. 2506.00281 null
2025-05-30 Heterogeneous Graph Backdoor Attack Jiawei Chen et.al. 2506.00191 null
2025-05-30 Cascading Adversarial Bias from Injection to Distillation in Language Models Harsh Chaudhari et.al. 2505.24842 null

2025-05-29

Publish Date Title Authors PDF Code
2025-05-29 Distributed Federated Learning for Vehicular Network Security: Anomaly Detection Benefits and Multi-Domain Attack Threats Utku Demir et.al. 2505.23706 null
2025-05-29 Merge Hijacking: Backdoor Attacks to Model Merging of Large Language Models Zenghui Yuan et.al. 2505.23561 null
2025-05-29 Performance Guaranteed Poisoning Attacks in Federated Learning: A Sliding Mode Approach Huazi Pan et.al. 2505.16403 null

2025-05-28

Publish Date Title Authors PDF Code
2025-05-28 Spa-VLM: Stealthy Poisoning Attacks on RAG-based VLM Lei Yu et.al. 2505.23828 null
2025-05-28 Wolf Hidden in Sheep's Conversations: Toward Harmless Data-Based Backdoor Attacks for Jailbreaking Large Language Models Jiawei Kong et.al. 2505.17601 null

2025-05-27

Publish Date Title Authors PDF Code
2025-05-27 GUARD:Dual-Agent based Backdoor Defense on Chain-of-Thought in Neural Code Generation Naizhu Jin et.al. 2505.21425 null
2025-05-27 HeteroBA: A Structure-Manipulating Backdoor Attack on Heterogeneous Graphs Honglin Gao et.al. 2505.21140 null
2025-05-27 Multi-level Certified Defense Against Poisoning Attacks in Offline Reinforcement Learning Shijie Liu et.al. 2505.20621 null
2025-05-27 Backdoors in DRL: Four Environments Focusing on In-distribution Triggers Chace Ashcraft et.al. 2505.17248 null

2025-05-26

Publish Date Title Authors PDF Code
2025-05-26 CPA-RAG:Covert Poisoning Attacks on Retrieval-Augmented Generation in Large Language Models Chunyang Li et.al. 2505.19864 null
2025-05-26 Poison in the Well: Feature Embedding Disruption in Backdoor Attacks Zhou Feng et.al. 2505.19821 null
2025-05-26 Fox in the Henhouse: Supply-Chain Backdoor Attacks Against Reinforcement Learning Shijie Liu et.al. 2505.19532 null
2025-05-26 Are Time-Series Foundation Models Deployment-Ready? A Systematic Study of Adversarial Robustness Across Domains Jiawen Zhang et.al. 2505.19397 null

2025-05-24

Publish Date Title Authors PDF Code
2025-05-24 Benchmarking Poisoning Attacks against Retrieval-Augmented Generation Baolei Zhang et.al. 2505.18543 null

2025-05-23

Publish Date Title Authors PDF Code
2025-05-23 Ranking Free RAG: Replacing Re-ranking with Selection in RAG for Sensitive Domains Yash Saxena et.al. 2505.16014 null
2025-05-23 A Linear Approach to Data Poisoning Diego Granziol et.al. 2505.15175 null
2025-05-23 Hidden Ghost Hand: Unveiling Backdoor Vulnerabilities in MLLM-Powered Mobile GUI Agents Pengzhou Cheng et.al. 2505.14418 null

2025-05-22

Publish Date Title Authors PDF Code
2025-05-22 BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization Xueyang Zhou et.al. 2505.16640 null
2025-05-22 Chain-of-Thought Poisoning Attacks against R1-based Retrieval-Augmented Generation Systems Hongru Song et.al. 2505.16367 null
2025-05-22 BadDepth: Backdoor Attacks Against Monocular Depth Estimation in the Physical World Ji Guo et.al. 2505.16154 null
2025-05-22 PoisonArena: Uncovering Competing Poisoning Attacks in Retrieval-Augmented Generation Liuji Chen et.al. 2505.12574 **[link](https://github.com/yxf203/poisonarena)**

2025-05-21

Publish Date Title Authors PDF Code
2025-05-21 BadSR: Stealthy Label Backdoor Attacks on Image Super-Resolution Ji Guo et.al. 2505.15308 null

2025-05-20

Publish Date Title Authors PDF Code
2025-05-20 SifterNet: A Generalized and Model-Agnostic Trigger Purification Approach Shaoye Luo et.al. 2505.14531 null
2025-05-20 Capturing the Effects of Quantization on Trojans in Code LLMs Aftab Hussain et.al. 2505.14200 null
2025-05-20 SVAFD: A Secure and Verifiable Co-Aggregation Protocol for Federated Distillation Tian Wen et.al. 2505.13319 null
2025-05-20 One Shot Dominance: Knowledge Poisoning Attack on Retrieval-Augmented Generation Systems Zhiyuan Chang et.al. 2505.11548 null

2025-05-19

Publish Date Title Authors PDF Code
2025-05-19 Does Low Rank Adaptation Lead to Lower Robustness against Training-Time Attacks? Zi Liang et.al. 2505.12871 **[link](https://github.com/liangzid/lora-ssecurity)**

2025-05-17

Publish Date Title Authors PDF Code
2025-05-17 FIGhost: Fluorescent Ink-based Stealthy and Flexible Backdoor Attacks on Physical Traffic Sign Recognition Shuai Yuan et.al. 2505.12045 null
2025-05-17 FL-PLAS: Federated Learning with Partial Layer Aggregation for Backdoor Defense Against High-Ratio Malicious Clients Jianyi Zhang et.al. 2505.12019 **[link](https://github.com/besticsp/fl-plas)**

2025-05-16

Publish Date Title Authors PDF Code
2025-05-16 PeerGuard: Defending Multi-Agent Systems Against Backdoor Attacks Through Mutual Reasoning Falong Fan et.al. 2505.11642 **[link](https://github.com/anonymoususertech/defensecot)**
2025-05-16 The Ripple Effect: On Unforeseen Complications of Backdoor Attacks Rui Zhang et.al. 2505.11586 **[link](https://github.com/zhangrui4041/backdoor_complications)**
2025-05-16 Towards Robust Spiking Neural Networks:Mitigating Heterogeneous Training Vulnerability via Dominant Eigencomponent Projection Desong Zhang et.al. 2505.11134 null
2025-05-16 Task Reconstruction and Extrapolation for $π_0$ using Text Latent Quanyi Li et.al. 2505.03500 null

2025-05-15

Publish Date Title Authors PDF Code
2025-05-15 Defending the Edge: Representative-Attention for Mitigating Backdoor Attacks in Federated Learning Chibueze Peace Obioma et.al. 2505.10297 null
2025-05-15 Sybil-based Virtual Data Poisoning Attacks in Federated Learning Changxun Zhu et.al. 2505.09983 null
2025-05-15 Model-Targeted Data Poisoning Attacks against ITS Applications with Provable Convergence Xin Wang et.al. 2505.03966 null

2025-05-14

Publish Date Title Authors PDF Code
2025-05-14 Toward Malicious Clients Detection in Federated Learning Zhihao Dou et.al. 2505.09110 null

2025-05-13

Publish Date Title Authors PDF Code
2025-05-13 MUBox: A Critical Evaluation Framework of Deep Machine Unlearning Xiang Li et.al. 2505.08576 null

2025-05-12

Publish Date Title Authors PDF Code
2025-05-12 MixBridge: Heterogeneous Image-to-Image Backdoor Attack through Mixture of Schrödinger Bridges Shixi Qin et.al. 2505.08809 **[link](https://github.com/qsx830/mixbridge)**

2025-05-10

Publish Date Title Authors PDF Code
2025-05-10 POISONCRAFT: Practical Poisoning of Retrieval-Augmented Generation for Large Language Models Yangguang Shao et.al. 2505.06579 **[link](https://github.com/andyshaw01/poisoncraft)**

2025-05-09

Publish Date Title Authors PDF Code
2025-05-09 Natural Reflection Backdoor Attack on Vision Language Model for Autonomous Driving Ming Liu et.al. 2505.06413 null
2025-05-09 Sparsification Under Siege: Defending Against Poisoning Attacks in Communication-Efficient Federated Learning Zhiyong Jin et.al. 2505.01454 null

2025-05-08

Publish Date Title Authors PDF Code
2025-05-08 KPI Poisoning: An Attack in Open RAN Near Real-Time Control Loop Hamed Alimohammadi et.al. 2505.05537 null
2025-05-08 MTL-UE: Learning to Learn Nothing for Multi-Task Learning Yi Yu et.al. 2505.05279 null
2025-05-08 Stealthy LLM-Driven Data Poisoning Attacks Against Embedding-Based Retrieval-Augmented Recommender Systems Fatemeh Nazary et.al. 2505.05196 null

2025-05-06

Publish Date Title Authors PDF Code
2025-05-06 BadLingual: A Novel Lingual-Backdoor Attack against Large Language Models Zihan Wang et.al. 2505.03501 null
2025-05-06 Mitigating Backdoor Triggered and Targeted Data Poisoning Attacks in Voice Authentication Systems Alireza Mohammadi et.al. 2505.03455 null
2025-05-06 Framework GNN-AID: Graph Neural Network Analysis Interpretation and Defense Kirill Lukyanov et.al. 2505.03424 **[link](https://github.com/ispras/gnn-aid)**
2025-05-06 A Chaos Driven Metric for Backdoor Attack Detection Hema Karnam Surendrababu et.al. 2505.03208 null
2025-05-06 Adversarial Robustness of Deep Learning Models for Inland Water Body Segmentation from SAR Images Siddharth Kothari et.al. 2505.01884 **[link](https://github.com/gvcl/iwseg-sar-poison)**

2025-05-04

Publish Date Title Authors PDF Code
2025-05-04 Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents Christian Schroeder de Witt et.al. 2505.02077 null

2025-05-03

Publish Date Title Authors PDF Code
2025-05-03 Backdoor Attacks Against Patch-based Mixture of Experts Cedric Chan et.al. 2505.01811 **[link](https://github.com/geefmegeld/pmoe-backdoor)**

2025-05-02

Publish Date Title Authors PDF Code
2025-05-02 Explainable AI Based Diagnosis of Poisoning Attacks in Evolutionary Swarms Mehrdad Asadi et.al. 2505.01181 null

2025-05-01

Publish Date Title Authors PDF Code
2025-05-01 Protocol-agnostic and Data-free Backdoor Attacks on Pre-trained Models in RF Fingerprinting Tianya Zhao et.al. 2505.00881 **[link](https://github.com/Tianyaz97/rf_backdoor)**

2025-04-30

Publish Date Title Authors PDF Code
2025-04-30 Cert-SSB: Toward Certified Sample-Specific Backdoor Defense Ting Qiao et.al. 2504.21730 **[link](https://github.com/ncepuqiaoting/cert-ssb)**
2025-04-30 Traceback of Poisoning Attacks to Retrieval-Augmented Generation Baolei Zhang et.al. 2504.21668 null
2025-04-30 How to Backdoor the Knowledge Distillation Chen Wu et.al. 2504.21323 null

2025-04-29

Publish Date Title Authors PDF Code
2025-04-29 Erased but Not Forgotten: How Backdoors Compromise Concept Erasure Jonas Henry Grebe et.al. 2504.21072 null
2025-04-29 FFCBA: Feature-based Full-target Clean-label Backdoor Attacks Yangxu Yin et.al. 2504.21054 null
2025-04-29 SFIBA: Spatial-based Full-target Invisible Backdoor Attacks Yangxu Yin et.al. 2504.21052 null
2025-04-29 GaussTrap: Stealthy Poisoning Attacks on 3D Gaussian Splatting for Targeted Scene Confusion Jiaxin Hong et.al. 2504.20829 null
2025-04-29 Dynamic Attention Analysis for Backdoor Detection in Text-to-Image Diffusion Models Zhongqi Wang et.al. 2504.20518 null
2025-04-29 BadMoE: Backdooring Mixture-of-Experts LLMs via Optimizing Routing Triggers and Infecting Dormant Experts Qingyue Wang et.al. 2504.18598 null

2025-04-28

Publish Date Title Authors PDF Code
2025-04-28 What's Pulling the Strings? Evaluating Integrity and Attribution in AI Training and Inference through Concept Shift Jiamin Chang et.al. 2504.21042 null

2025-04-25

Publish Date Title Authors PDF Code
2025-04-25 Intelligent Attacks and Defense Methods in Federated Learning-enabled Energy-Efficient Wireless Networks Han Zhang et.al. 2504.18519 null

2025-04-24

Publish Date Title Authors PDF Code
2025-04-24 Unsupervised Corpus Poisoning Attacks in Continuous Space for Dense Retrieval Yongkang Li et.al. 2504.17884 null
2025-04-24 GRANITE : a Byzantine-Resilient Dynamic Gossip Learning Framework Yacine Belal et.al. 2504.17471 null
2025-04-24 The Ultimate Cookbook for Invisible Poison: Crafting Subtle Clean-Label Text Backdoors with Style Attributes Wencong You et.al. 2504.17300 null

2025-04-23

Publish Date Title Authors PDF Code
2025-04-23 Robo-Troj: Attacking LLM-based Task Planners Mohaiminul Al Nahian et.al. 2504.17070 null
2025-04-23 BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation Ruotong Wang et.al. 2504.16907 null

2025-04-22

Publish Date Title Authors PDF Code
2025-04-22 A Geometric Approach to Problems in Optimization and Data Science Naren Sarayu Manoj et.al. 2504.16270 null
2025-04-22 OPUS-VFL: Incentivizing Optimal Privacy-Utility Tradeoffs in Vertical Federated Learning Sindhuja Madabushi et.al. 2504.15995 null
2025-04-22 TrojanDam: Detection-Free Backdoor Defense in Federated Learning through Proactive Model Robustification utilizing OOD Data Yanbo Dai et.al. 2504.15674 null

2025-04-21

Publish Date Title Authors PDF Code
2025-04-21 Backdoor Defense in Diffusion Models via Spatial Attention Unlearning Abha Jha et.al. 2504.18563 null
2025-04-21 BadApex: Backdoor Attack Based on Adaptive Optimization Mechanism of Black-box Large Language Models Zhengxian Wu et.al. 2504.13775 null
2025-04-21 A Client-level Assessment of Collaborative Backdoor Poisoning in Non-IID Federated Learning Phung Lai et.al. 2504.12875 null

2025-04-20

Publish Date Title Authors PDF Code
2025-04-20 REDEditing: Relationship-Driven Precise Backdoor Poisoning on Text-to-Image Diffusion Models Chongye Guo et.al. 2504.14554 null

2025-04-19

Publish Date Title Authors PDF Code
2025-04-19 WeiDetect: Weibull Distribution-Based Defense against Poisoning Attacks in Federated Learning for Network Intrusion Detection Systems Sameera K. M. et.al. 2504.04367 null

2025-04-17

Publish Date Title Authors PDF Code
2025-04-17 Strategic Planning of Stealthy Backdoor Attacks in Markov Decision Processes Xinyi Wei et.al. 2504.13276 null
2025-04-17 ControlNET: A Firewall for RAG-based LLM System Hongwei Yao et.al. 2504.09593 null

2025-04-16

Publish Date Title Authors PDF Code
2025-04-16 Secure Transfer Learning: Training Clean Models Against Backdoor in (Both) Pre-trained Encoders and Downstream Datasets Yechao Zhang et.al. 2504.11990 null

2025-04-15

Publish Date Title Authors PDF Code
2025-04-15 Propaganda via AI? A Study on Semantic Backdoors in Large Language Models Nay Myat Min et.al. 2504.12344 **[link](https://github.com/naymyatmin/raven)**
2025-04-15 Exploring Backdoor Attack and Defense for LLM-empowered Recommendations Liangbo Ning et.al. 2504.11182 null

2025-04-14

Publish Date Title Authors PDF Code
2025-04-14 Investigating cybersecurity incidents using large language models in latest-generation wireless networks Leonid Legashev et.al. 2504.13196 null
2025-04-14 An Investigation of Large Language Models and Their Vulnerabilities in Spam Detection Qiyao Tang et.al. 2504.09776 null

2025-04-10

Publish Date Title Authors PDF Code
2025-04-10 Augmented Shuffle Protocols for Accurate and Robust Frequency Estimation under Differential Privacy Takao Murakami et.al. 2504.07362 null

2025-04-09

Publish Date Title Authors PDF Code
2025-04-09 Bridging the Gap Between Preference Alignment and Machine Unlearning Xiaohua Feng et.al. 2504.06659 null
2025-04-09 Diversity-aware Dual-promotion Poisoning Attack on Sequential Recommendation Yuchuan Zhao et.al. 2504.06586 null

2025-04-08

Publish Date Title Authors PDF Code
2025-04-08 Exploiting Meta-Learning-based Poisoning Attacks for Graph Link Prediction Mingchen Li et.al. 2504.06492 **[link](https://github.com/mingchenli/VGAE_Attack_Meta)**
2025-04-08 Defending Deep Neural Networks against Backdoor Attacks via Module Switching Weijun Li et.al. 2504.05902 null
2025-04-08 Parasite: A Steganography-based Backdoor Attack Framework for Diffusion Models Jiahao Chen et.al. 2504.05815 null
2025-04-08 ShadowCoT: Cognitive Hijacking for Stealthy Reasoning Backdoors in LLMs Gejian Zhao et.al. 2504.05605 null

2025-04-04

Publish Date Title Authors PDF Code
2025-04-04 Practical Poisoning Attacks against Retrieval-Augmented Generation Baolei Zhang et.al. 2504.03957 null
2025-04-04 PPFPL: Cross-silo Privacy-preserving Federated Prototype Learning Against Data Poisoning Attacks on Non-IID Data Hongliang Zhang et.al. 2504.03173 null

2025-04-02

Publish Date Title Authors PDF Code
2025-04-02 One Pic is All it Takes: Poisoning Visual Document Retrieval Augmented Generation with a Single Image Ezzeldin Shereen et.al. 2504.02132 null
2025-04-02 Sky of Unlearning (SoUL): Rewiring Federated Machine Unlearning via Selective Pruning Md Mahabub Uz Zaman et.al. 2504.01705 null

2025-03-31

Publish Date Title Authors PDF Code
2025-03-31 Backdoor Detection through Replicated Execution of Outsourced Training Hengrui Jia et.al. 2504.00170 null
2025-03-31 A Channel-Triggered Backdoor Attack on Wireless Semantic Image Reconstruction Jialin Wan et.al. 2503.23866 null

2025-03-30

Publish Date Title Authors PDF Code
2025-03-30 Buffer is All You Need: Defending Federated Learning against Backdoor Attacks under Non-iids via Buffering Xingyu Lyu et.al. 2503.23511 null
2025-03-30 Two Heads Are Better than One: Model-Weight and Latent-Space Analysis for Federated Learning on Non-iid Data against Poisoning Attacks Xingyu Lyu et.al. 2503.23288 null

2025-03-27

Publish Date Title Authors PDF Code
2025-03-27 Data Poisoning in Deep Learning: A Survey Pinlong Zhao et.al. 2503.22759 **[link](https://github.com/pinlong-zhao/data-poisoning)**
2025-03-27 Tricking Retrievers with Influential Tokens: An Efficient Black-Box Corpus Poisoning Attack Cheng Wang et.al. 2503.21315 null
2025-03-27 DeBackdoor: A Deductive Framework for Detecting Backdoor Attacks on Deep Models with Limited Data Dorde Popovic et.al. 2503.21305 null
2025-03-27 Clean Image May be Dangerous: Data Poisoning Attacks Against Deep Hashing Shuai Li et.al. 2503.21236 null

2025-03-26

Publish Date Title Authors PDF Code
2025-03-26 Robust Federated Learning Against Poisoning Attacks: A GAN-Based Defense Framework Usama Zafar et.al. 2503.20884 **[link](https://github.com/SciML-FL/gan-filter)**
2025-03-26 How Secure is Forgetting? Linking Machine Unlearning to Machine Learning Attacks Muhammed Shafi K. P. et.al. 2503.20257 null

2025-03-24

Publish Date Title Authors PDF Code
2025-03-24 Deterministic Certification of Graph Neural Networks against Graph Poisoning Attacks with Arbitrary Perturbations Jiate Li et.al. 2503.18503 null

2025-03-22

Publish Date Title Authors PDF Code
2025-03-22 Towards Invisible Backdoor Attack on Text-to-Image Diffusion Model Jie Zhang et.al. 2503.17724 **[link](https://github.com/robin-wzq/iba)**

2025-03-21

Publish Date Title Authors PDF Code
2025-03-21 Large Language Models Can Verbatim Reproduce Long Malicious Sequences Sharon Lin et.al. 2503.17578 null

2025-03-20

Publish Date Title Authors PDF Code
2025-03-20 BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models Zenghui Yuan et.al. 2503.16023 null

2025-03-19

Publish Date Title Authors PDF Code
2025-03-19 Cyber Threats in Financial Transactions -- Addressing the Dual Challenge of AI and Quantum Computing Ahmed M. Elmisery et.al. 2503.15678 null
2025-03-19 Test-Time Backdoor Detection for Object Detection Models Hangtao Zhang et.al. 2503.15293 null
2025-03-19 OFL: Opportunistic Federated Learning for Resource-Heterogeneous and Privacy-Aware Devices Yunlong Mao et.al. 2503.15015 null
2025-03-19 A Semantic and Clean-label Backdoor Attack against Graph Convolutional Networks Jiazhu Dai et.al. 2503.14922 null

2025-03-18

Publish Date Title Authors PDF Code
2025-03-18 Entente: Cross-silo Intrusion Detection on Network Log Graphs with Federated Learning Jiacen Xu et.al. 2503.14284 null
2025-03-18 XOXO: Stealthy Cross-Origin Context Poisoning Attacks against AI Coding Assistants Adam Štorek et.al. 2503.14281 null

2025-03-17

Publish Date Title Authors PDF Code
2025-03-17 Optimizing ML Training with Metagradient Descent Logan Engstrom et.al. 2503.13751 null

2025-03-16

Publish Date Title Authors PDF Code
2025-03-16 One Goal, Many Challenges: Robust Preference Optimization Amid Content-Aware and Multi-Source Noise Amirabbas Afzali et.al. 2503.12301 null

2025-03-15

Publish Date Title Authors PDF Code
2025-03-15 Revisiting Training-Inference Trigger Intensity in Backdoor Attacks Chenhao Lin et.al. 2503.12058 **[link](https://github.com/cv12ha0/TITIM)**
2025-03-15 Revisiting Backdoor Attacks on Time Series Classification in the Frequency Domain Yuanmin Huang et.al. 2503.09712 null

2025-03-14

Publish Date Title Authors PDF Code
2025-03-14 Trust Under Siege: Label Spoofing Attacks against Machine Learning for Android Malware Detection Tianwei Lan et.al. 2503.11841 null

2025-03-13

Publish Date Title Authors PDF Code
2025-03-13 Targeted Data Poisoning for Black-Box Audio Datasets Ownership Verification Wassim Bouaziz et.al. 2503.10269 null
2025-03-13 Policy Teaching via Data Poisoning in Learning from Human Preferences Andi Nika et.al. 2503.10228 null

2025-03-12

Publish Date Title Authors PDF Code
2025-03-12 Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models Sangwon Jang et.al. 2503.09669 null
2025-03-12 Stealthy Patch-Wise Backdoor Attack in 3D Point Cloud via Curvature Awareness Yu Feng et.al. 2503.09336 null
2025-03-12 Detecting and Preventing Data Poisoning Attacks on AI Models Halima I. Kure et.al. 2503.09302 null
2025-03-12 C^2 ATTACK: Towards Representation Backdoor on CLIP via Concept Confusion Lijie Hu et.al. 2503.09095 null
2025-03-12 Adaptive Backdoor Attacks with Reasonable Constraints on Graph Neural Networks Xuewen Dong et.al. 2503.09049 null
2025-03-12 Not All Edges are Equally Robust: Evaluating the Robustness of Ranking-Based Federated Learning Zirui Gong et.al. 2503.08976 null

Generative models safety

2025-06-18

Publish Date Title Authors PDF Code
2025-06-18 LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning Gabrel J. Perin et.al. 2506.15606 **[link](https://github.com/vita-group/lox)**

2025-06-17

Publish Date Title Authors PDF Code
2025-06-17 Safe-Child-LLM: A Developmental Benchmark for Evaluating LLM Safety in Child-LLM Interactions Junfeng Jiao et.al. 2506.13510 **[link](https://github.com/the-responsible-ai-initiative/safe_child_llm_benchmark)**

2025-06-16

Publish Date Title Authors PDF Code
2025-06-16 We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems Junfeng Fang et.al. 2506.13666 **[link](https://github.com/littlelittlenine/safemcp)**

2025-06-15

Publish Date Title Authors PDF Code
2025-06-15 SecurityLingua: Efficient Defense of LLM Jailbreak Attacks via Security-Aware Prompt Compression Yucheng Li et.al. 2506.12707 null

2025-06-14

Publish Date Title Authors PDF Code
2025-06-14 QGuard:Question-based Zero-shot Guard for Multi-modal LLM Safety Taegyeong Lee et.al. 2506.12299 null
2025-06-14 Monitoring Decomposition Attacks in LLMs with Lightweight Sequential Monitors Chen Yueh-Han et.al. 2506.10949 **[link](https://github.com/yuehhanchen/monitoring-decomposition-attack)**

2025-06-09

Publish Date Title Authors PDF Code
2025-06-09 When Style Breaks Safety: Defending Language Models Against Superficial Style Alignment Yuxin Xiao et.al. 2506.07452 **[link](https://github.com/xiaoyuxin1002/SafeStyle)**
2025-06-09 Beyond Jailbreaks: Revealing Stealthier and Broader LLM Security Risks Stemming from Alignment Failures Yukai Zhou et.al. 2506.07402 null
2025-06-09 Refusal-Feature-guided Teacher for Safe Finetuning via Data Filtering and Alignment Distillation Seokil Ham et.al. 2506.07356 null

2025-06-08

Publish Date Title Authors PDF Code
2025-06-08 Quality-Diversity Red-Teaming: Automated Generation of High-Quality and Diverse Attackers for Large Language Models Ren-Jian Wang et.al. 2506.07121 null
2025-06-08 AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint Leheng Sheng et.al. 2506.07022 **[link](https://github.com/alphalab-ustc/alphasteer)**

2025-06-07

Publish Date Title Authors PDF Code
2025-06-07 SafeLawBench: Towards Safe Alignment of Large Language Models Chuxue Cao et.al. 2506.06636 null

2025-06-06

Publish Date Title Authors PDF Code
2025-06-06 The Scales of Justitia: A Comprehensive Survey on Safety Evaluation of LLMs Songyang Liu et.al. 2506.11094 null
2025-06-06 Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety Assurance Ruizhong Qiu et.al. 2506.06444 **[link](https://github.com/q-rz/saffron)**

2025-06-05

Publish Date Title Authors PDF Code
2025-06-05 Interpretation Meets Safety: A Survey on Interpretation Methods and Tools for Improving LLM Safety Seongmin Lee et.al. 2506.05451 null
2025-06-05 Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets Lei Hsiung et.al. 2506.05346 null
2025-06-05 Evaluating Prompt-Driven Chinese Large Language Models: The Influence of Persona Assignment on Stereotypes and Safeguards Geng Liu et.al. 2506.04975 null

2025-06-04

Publish Date Title Authors PDF Code
2025-06-04 Should LLM Safety Be More Than Refusing Harmful Instructions? Utsav Maskey et.al. 2506.02442 null

2025-06-02

Publish Date Title Authors PDF Code
2025-06-02 ReGA: Representation-Guided Abstraction for Model-based Safeguarding of LLMs Zeming Wei et.al. 2506.01770 **[link](https://github.com/weizeming/rega)**

2025-05-31

Publish Date Title Authors PDF Code
2025-05-31 SafeTuneBed: A Toolkit for Benchmarking LLM Safety Alignment in Fine-Tuning Saad Hossain et.al. 2506.00676 null

2025-05-30

Publish Date Title Authors PDF Code
2025-05-30 Learning Safety Constraints for Large Language Models Xin Chen et.al. 2505.24445 **[link](https://github.com/lasgroup/safetypolytope)**
2025-05-30 The State of Multilingual LLM Safety Research: From Measuring the Language Gap to Mitigating It Zheng-Xin Yong et.al. 2505.24119 null

2025-05-26

Publish Date Title Authors PDF Code
2025-05-26 Safety Through Reasoning: An Empirical Study of Reasoning Guardrail Models Makesh Narsimhan Sreedhar et.al. 2505.20087 null
2025-05-26 PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks Guobin Shen et.al. 2505.13862 **[link](https://github.com/beijing-aisi/panda-guard)**

2025-05-25

Publish Date Title Authors PDF Code
2025-05-25 Benign Samples Matter! Fine-tuning On Outlier Benign Samples Severely Breaks Safety Zihan Guan et.al. 2505.06843 **[link](https://github.com/guanzihan/benign-samples-matter)**

2025-05-24

Publish Date Title Authors PDF Code
2025-05-24 Exploring the Vulnerability of the Content Moderation Guardrail in Large Language Models via Intent Manipulation Jun Zhuang et.al. 2505.18556 null

2025-05-23

Publish Date Title Authors PDF Code
2025-05-23 Chain-of-Lure: A Synthetic Narrative-Driven Approach to Compromise Large Language Models Wenhan Chang et.al. 2505.17519 null

2025-05-22

Publish Date Title Authors PDF Code
2025-05-22 Shape it Up! Restoring LLM Safety during Finetuning ShengYun Peng et.al. 2505.17196 null
2025-05-22 Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models Junjie Xiong et.al. 2505.16957 null
2025-05-22 MixAT: Combining Continuous and Discrete Adversarial Training for LLMs Csaba Dékány et.al. 2505.16947 **[link](https://github.com/insait-institute/mixat)**

2025-05-21

Publish Date Title Authors PDF Code
2025-05-21 Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering Hwan Chang et.al. 2505.15805 **[link](https://github.com/hwanchang00/CoPriva)**
2025-05-21 Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval Taiye Chen et.al. 2505.15753 null

2025-05-20

Publish Date Title Authors PDF Code
2025-05-20 Evaluating the efficacy of LLM Safety Solutions : The Palit Benchmark Dataset Sayon Palit et.al. 2505.13028 null
2025-05-20 MrGuard: A Multilingual Reasoning Guardrail for Universal LLM Safety Yahan Yang et.al. 2504.15241 null

2025-05-19

Publish Date Title Authors PDF Code
2025-05-19 Concept-Level Explainability for Auditing & Steering LLM Responses Kenza Amara et.al. 2505.07610 **[link](https://github.com/k-amara/ConceptX)**
2025-05-19 A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment Kun Wang et.al. 2504.15585 null

2025-05-18

Publish Date Title Authors PDF Code
2025-05-18 Logic Jailbreak: Efficiently Unlocking LLM Safety Restrictions Through Formal Logical Expression Jingyu Peng et.al. 2505.13527 null

2025-05-16

Publish Date Title Authors PDF Code
2025-05-16 CARES: Comprehensive Evaluation of Safety and Adversarial Robustness in Medical LLMs Sijia Chen et.al. 2505.11413 null
2025-05-16 PIG: Privacy Jailbreak Attack on LLMs via Gradient-based Iterative In-Context Optimization Yidan Wang et.al. 2505.09921 **[link](https://github.com/redwyd/privacyjailbreak)**

2025-05-14

Publish Date Title Authors PDF Code
2025-05-14 Understanding Gen Alpha Digital Language: Evaluation of LLM Safety Systems for Content Moderation Manisha Mehta et.al. 2505.10588 null

2025-05-03

Publish Date Title Authors PDF Code
2025-05-03 Cannot See the Forest for the Trees: Invoking Heuristics and Biases to Elicit Irrational Choices of LLMs Haoming Yang et.al. 2505.02862 null

2025-04-30

Publish Date Title Authors PDF Code
2025-04-30 Hoist with His Own Petard: Inducing Guardrails to Facilitate Denial-of-Service Attacks on Retrieval-Augmented Generation of LLMs Pan Suo et.al. 2504.21680 null

2025-04-28

Publish Date Title Authors PDF Code
2025-04-28 Prefill-Based Jailbreak: A Novel Approach of Bypassing LLM Safety Boundary Yakai Li et.al. 2504.21038 **[link](https://github.com/star5o/Prefill-based-Jailbreak)**
2025-04-28 $\texttt{SAGE}$ : A Generic Framework for LLM Safety Evaluation Madhur Jindal et.al. 2504.19674 **[link](https://github.com/Madhur-1/SageLLMSafetyEval)**

2025-04-26

Publish Date Title Authors PDF Code
2025-04-26 Siege: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search Andy Zhou et.al. 2503.10619 null

2025-04-23

Publish Date Title Authors PDF Code
2025-04-23 Safety Pretraining: Toward the Next Generation of Safe AI Pratyush Maini et.al. 2504.16980 null
2025-04-23 aiXamine: Simplified LLM Safety and Security Fatih Deniz et.al. 2504.14985 null

2025-04-22

Publish Date Title Authors PDF Code
2025-04-22 Red Team Diffuser: Exposing Toxic Continuation Vulnerabilities in Vision-Language Models via Reinforcement Learning Ruofan Wang et.al. 2503.06223 null

2025-04-21

Publish Date Title Authors PDF Code
2025-04-21 RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity Search Quy-Anh Dang et.al. 2504.15047 **[link](https://github.com/knoveleng/rainbowplus)**

2025-04-17

Publish Date Title Authors PDF Code
2025-04-17 GraphAttack: Exploiting Representational Blindspots in LLM Safety Mechanisms Sinan He et.al. 2504.13052 null

2025-04-13

Publish Date Title Authors PDF Code
2025-04-13 SaRO: Enhancing LLM Safety through Reasoning-based Alignment Yutao Mou et.al. 2504.09420 null

2025-04-11

Publish Date Title Authors PDF Code
2025-04-11 SAEs $\textit{Can}$ Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs Aashiq Muhamed et.al. 2504.08192 null

2025-03-31

Publish Date Title Authors PDF Code
2025-03-31 $\textit{Agents Under Siege}$ : Breaking Pragmatic Multi-Agent LLM Systems with Optimized Prompt Attacks Rana Muhammad Shahroz Khan et.al. 2504.00218 null
2025-03-31 Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms Shuoming Zhang et.al. 2503.24191 null

2025-03-14

Publish Date Title Authors PDF Code
2025-03-14 LLMs for Translation: Historical, Low-Resourced Languages and Contemporary AI Models Merve Tekgurler et.al. 2503.11898 null
2025-03-14 Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification Yingjie Zhang et.al. 2503.11185 null

2025-03-12

Publish Date Title Authors PDF Code
2025-03-12 How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation Ruohao Guo et.al. 2503.09598 **[link](https://github.com/octaviaguo/EchoMist)**

2025-03-10

Publish Date Title Authors PDF Code
2025-03-10 Safety Guardrails for LLM-Enabled Robots Zachary Ravichandran et.al. 2503.07885 null
2025-03-10 Graphormer-Guided Task Planning: Beyond Static Rules with LLM Safety Perception Wanjing Huang et.al. 2503.06866 **[link](https://github.com/hwj20/ggtp)**

2025-03-06

Publish Date Title Authors PDF Code
2025-03-06 Safety is Not Only About Refusal: Reasoning-Enhanced Fine-tuning for Interpretable LLM Safety Yuyou Zhang et.al. 2503.05021 null
2025-03-06 One-Shot is Enough: Consolidating Multi-Turn Attacks into Efficient Single-Turn Prompts for LLMs Junwoo Ha et.al. 2503.04856 null
2025-03-06 Know Thy Judge: On the Robustness Meta-Evaluation of LLM Safety Judges Francisco Eiras et.al. 2503.04474 null

2025-03-05

Publish Date Title Authors PDF Code
2025-03-05 Improving LLM Safety Alignment with Dual-Objective Optimization Xuandong Zhao et.al. 2503.03710 **[link](https://github.com/wicai24/door-alignment)**

2025-03-04

Publish Date Title Authors PDF Code
2025-03-04 LLM-Safety Evaluations Lack Robustness Tim Beyer et.al. 2503.02574 null
2025-03-04 Adversarial Tokenization Renato Lui Geh et.al. 2503.02174 null

2025-03-02

Publish Date Title Authors PDF Code
2025-03-02 Output Length Effect on DeepSeek-R1's Safety in Forced Thinking Xuying Li et.al. 2503.01923 null

2025-02-26

Publish Date Title Authors PDF Code
2025-02-26 JailBench: A Comprehensive Chinese Security Assessment Benchmark for Large Language Models Shuyi Liu et.al. 2502.18935 null

2025-02-24

Publish Date Title Authors PDF Code
2025-02-24 LongSafety: Evaluating Long-Context Safety of Large Language Models Yida Lu et.al. 2502.16971 **[link](https://github.com/thu-coai/longsafety)**

2025-02-21

Publish Date Title Authors PDF Code
2025-02-21 SafeInt: Shielding Large Language Models from Jailbreak Attacks via Safety-Aware Representation Intervention Jiaqi Wu et.al. 2502.15594 null
2025-02-21 Attention Eclipse: Manipulating Attention to Bypass LLM Safety-Alignment Pedram Zaree et.al. 2502.15334 null

2025-02-20

Publish Date Title Authors PDF Code
2025-02-20 Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models Yeonjun In et.al. 2502.15086 **[link](https://github.com/yeonjun-in/u-safebench)**

2025-02-19

Publish Date Title Authors PDF Code
2025-02-19 Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region Chak Tou Leong et.al. 2502.13946 null
2025-02-19 Qorgau: Evaluating LLM Safety in Kazakh-Russian Bilingual Contexts Maiya Goloburda et.al. 2502.13640 null

Data privacy

2025-06-25

Publish Date Title Authors PDF Code
2025-06-25 WallStreetFeds: Client-Specific Tokens as Investment Vehicles in Federated Learning Arno Geimer et.al. 2506.20518 null
2025-06-25 Client Clustering Meets Knowledge Sharing: Enhancing Privacy and Robustness in Personalized Peer-to-Peer Learning Mohammad Mahdi Maheri et.al. 2506.20413 null
2025-06-25 FedBKD: Distilled Federated Learning to Embrace Gerneralization and Personalization on Non-IID Data Yushan Zhao et.al. 2506.20245 null
2025-06-25 Personalized Mental State Evaluation in Human-Robot Interaction using Federated Learning Andrea Bussolan et.al. 2506.20212 null

2025-06-24

Publish Date Title Authors PDF Code
2025-06-24 Progressive Size-Adaptive Federated Learning: A Comprehensive Framework for Heterogeneous Multi-Modal Data Systems Sajid Hussain et.al. 2506.20685 null
2025-06-24 ReBoot: Encrypted Training of Deep Neural Networks with CKKS Bootstrapping Alberto Pirillo et.al. 2506.19693 null
2025-06-24 Automated Detection of Pre-training Text in Black-box LLMs Ruihan Hu et.al. 2506.19399 null
2025-06-24 Behavioral Anomaly Detection in Distributed Systems via Federated Contrastive Learning Renzi Meng et.al. 2506.19246 null
2025-06-24 FOCoOp: Enhancing Out-of-Distribution Robustness in Federated Prompt Learning for Vision-Language Models Xinting Liao et.al. 2506.16218 null

2025-06-23

Publish Date Title Authors PDF Code
2025-06-23 SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications Jinyang Li et.al. 2506.18951 null
2025-06-23 Federated Learning from Molecules to Processes: A Perspective Jan G. Rittig et.al. 2506.18525 null

2025-06-22

Publish Date Title Authors PDF Code
2025-06-22 Federated Learning-Based Data Collaboration Method for Enhancing Edge Cloud AI System Security Using Large Language Models Huaiying Luo et.al. 2506.18087 null

2025-06-21

Publish Date Title Authors PDF Code
2025-06-21 Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs Yiwei Chen et.al. 2506.14003 **[link](https://github.com/optml-group/unlearn-trace)**

2025-06-20

Publish Date Title Authors PDF Code
2025-06-20 AI based Content Creation and Product Recommendation Applications in E-commerce: An Ethical overview Aditi Madhusudan Jain et.al. 2506.17370 null

2025-06-19

Publish Date Title Authors PDF Code
2025-06-19 SecureFed: A Two-Phase Framework for Detecting Malicious Clients in Federated Learning Likhitha Annapurna Kavuri et.al. 2506.16458 null
2025-06-19 Leveraging Optimal Transport for Distributed Two-Sample Testing: An Integrated Transportation Distance-based Framework Zhengqi Lin et.al. 2506.16047 null

2025-06-18

Publish Date Title Authors PDF Code
2025-06-18 Tracking GPTs Third Party Service: Automation, Analysis, and Insights Chuan Yan et.al. 2506.17315 null
2025-06-18 PNCS:Power-Norm Cosine Similarity for Diverse Client Selection in Federated Learning Liangyan Li et.al. 2506.15923 null
2025-06-18 Mapping Caregiver Needs to AI Chatbot Design: Strengths and Gaps in Mental Health Support for Alzheimer's and Dementia Caregivers Jiayue Melissa Shi et.al. 2506.15047 null

2025-06-17

Publish Date Title Authors PDF Code
2025-06-17 SoK: Privacy-Enhancing Technologies in Artificial Intelligence Nouha Oualha et.al. 2506.14576 null
2025-06-17 CausalDiffTab: Mixed-Type Causal-Aware Diffusion for Tabular Data Generation Jia-Chen Zhang et.al. 2506.14206 null

2025-06-16

Publish Date Title Authors PDF Code
2025-06-16 ReinDSplit: Reinforced Dynamic Split Learning for Pest Recognition in Precision Agriculture Vishesh Kumar Tanwar et.al. 2506.13935 null

2025-06-15

Publish Date Title Authors PDF Code
2025-06-15 Privacy-Preserving Federated Learning against Malicious Clients Based on Verifiable Functional Encryption Nina Cai et.al. 2506.12846 null

2025-06-14

Publish Date Title Authors PDF Code
2025-06-14 Real-Time, Low-Latency Surveillance Using Entropy-Based Adaptive Buffering and MobileNetV2 on Edge Devices Poojashree Chandrashekar Pankaj M Sajjanar et.al. 2506.14833 null
2025-06-14 OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and Metrics Vineeth Dorna et.al. 2506.12618 **[link](https://github.com/locuslab/open-unlearning)**

2025-06-13

Publish Date Title Authors PDF Code
2025-06-13 MRI-CORE: A Foundation Model for Magnetic Resonance Imaging Haoyu Dong et.al. 2506.12186 null
2025-06-13 AutoGen Driven Multi Agent Framework for Iterative Crime Data Analysis and Prediction Syeda Kisaa Fatima et.al. 2506.11475 null

2025-06-12

Publish Date Title Authors PDF Code
2025-06-12 Federated Learning within Global Energy Budget over Heterogeneous Edge Accelerators Roopkatha Banerjee et.al. 2506.10413 null

2025-06-11

Publish Date Title Authors PDF Code
2025-06-11 Knockoffs Inference under Privacy Constraints Zhanrui Cai et.al. 2506.09690 null
2025-06-11 Wavelet Scattering Transform and Fourier Representation for Offline Detection of Malicious Clients in Federated Learning Alessandro Licciardi et.al. 2506.09674 null
2025-06-11 A Survey on the Role of Artificial Intelligence and Machine Learning in 6G-V2X Applications Donglin Wang et.al. 2506.09512 null

2025-06-10

Publish Date Title Authors PDF Code
2025-06-10 DIsoN: Decentralized Isolation Networks for Out-of-Distribution Detection in Medical Imaging Felix Wagner et.al. 2506.09024 null
2025-06-10 A Privacy-Preserving Federated Learning Framework for Generalizable CBCT to Synthetic CT Translation in Head and Neck Ciro Benito Raggio et.al. 2506.08654 null

2025-06-09

Publish Date Title Authors PDF Code
2025-06-09 Private Memorization Editing: Turning Memorization into a Defense to Strengthen Data Privacy in Large Language Models Elena Sofia Ruzzetti et.al. 2506.10024 **[link](https://github.com/elenasofia98/pme)**
2025-06-09 FedGA-Tree: Federated Decision Tree using Genetic Algorithm Anh V Nguyen et.al. 2506.08176 null
2025-06-09 A Systematic Literature Review on Continuous Integration and Deployment (CI/CD) for Secure Cloud Computing Sabbir M. Saleh et.al. 2506.08055 null
2025-06-09 Understanding the Error Sensitivity of Privacy-Aware Computing Matías Mazzanti et.al. 2506.07957 null
2025-06-09 Federated In-Context Learning: Iterative Refinement for Improved Answer Quality Ruhan Wang et.al. 2506.07440 null

2025-06-08

Publish Date Title Authors PDF Code
2025-06-08 Patient Similarity Computation for Clinical Decision Support: An Efficient Use of Data Transformation, Combining Static and Time Series Data Joydeb Kumar Sana et.al. 2506.07092 null

2025-06-07

Publish Date Title Authors PDF Code
2025-06-07 In-Sensor Motion Recognition with Memristive System and Light Sensing Surfaces Hritom Das et.al. 2506.06829 null
2025-06-07 LADSG: Label-Anonymized Distillation and Similar Gradient Substitution for Label Privacy in Vertical Federated Learning Zeyu Yan et.al. 2506.06742 null
2025-06-07 Fuse and Federate: Enhancing EV Charging Station Security with Multimodal Fusion and Federated Learning Rabah Rahal et.al. 2506.06730 null

2025-06-06

Publish Date Title Authors PDF Code
2025-06-06 Privacy Perspectives and Practices of Chinese Smart Home Product Teams Shijing He et.al. 2506.06591 null
2025-06-06 Reinforcement Learning for Autonomous Warehouse Orchestration in SAP Logistics Execution: Redefining Supply Chain Agility Sumanth Pillella et.al. 2506.06523 null
2025-06-06 A Certified Unlearning Approach without Access to Source Data Umit Yigit Basaran et.al. 2506.06486 null
2025-06-06 Direct Behavior Optimization: Unlocking the Potential of Lightweight LLMs Hongming Yang et.al. 2506.06401 null
2025-06-06 Towards Lifecycle Unlearning Commitment Management: Measuring Sample-level Unlearning Completeness Cheng-Long Wang et.al. 2506.06112 **[link](https://github.com/happy2git/unlearning_inference_iam)**
2025-06-06 Simple Yet Effective: Extracting Private Data Across Clients in Federated Fine-Tuning of Large Language Models Yingqi Hu et.al. 2506.06060 null
2025-06-06 Mitigating Catastrophic Forgetting with Adaptive Transformer Block Expansion in Federated Fine-Tuning Yujia Huo et.al. 2506.05977 null
2025-06-06 Small Models, Big Support: A Local LLM Framework for Teacher-Centric Content Creation and Assessment using RAG and CAG Zarreen Reza et.al. 2506.05925 null
2025-06-06 When Better Features Mean Greater Risks: The Performance-Privacy Trade-Off in Contrastive Learning Ruining Sun et.al. 2506.05743 null
2025-06-06 FedShield-LLM: A Secure and Scalable Federated Fine-Tuned Large Language Model Md Jueal Mia et.al. 2506.05640 null
2025-06-06 Rethinking Machine Unlearning in Image Generation Models Renyang Liu et.al. 2506.02761 **[link](https://github.com/ryliu68/igmu)**
2025-06-06 Federated Foundation Model for GI Endoscopy Images Alina Devkota et.al. 2505.24108 null

2025-06-05

Publish Date Title Authors PDF Code
2025-06-05 Federated Isolation Forest for Efficient Anomaly Detection on Edge IoT Systems Pavle Vasiljevic et.al. 2506.05138 null
2025-06-05 Software Bill of Materials in Software Supply Chain Security A Systematic Literature Review Eric O'Donoghue et.al. 2506.03507 null

2025-06-04

Publish Date Title Authors PDF Code
2025-06-04 Privacy and Security Threat for OpenAI GPTs Wei Wenying et.al. 2506.04036 null
2025-06-04 PC-MoE: Memory-Efficient and Privacy-Preserving Collaborative Training for Mixture-of-Experts LLMs Ze Yu Zhang et.al. 2506.02965 null

2025-06-03

Publish Date Title Authors PDF Code
2025-06-03 An Algorithmic Pipeline for GDPR-Compliant Healthcare Data Anonymisation: Moving Toward Standardisation Hamza Khan et.al. 2506.02942 null
2025-06-03 ThinkTank: A Framework for Generalizing Domain-Specific AI Agent Systems into Universal Collaborative Intelligence Platforms Praneet Sai Madhu Surabhi et.al. 2506.02931 **[link](https://github.com/taugroup/thinktank)**
2025-06-03 CLONE: Customizing LLMs for Efficient Latency-Aware Inference at the Edge Chunlin Tian et.al. 2506.02847 null
2025-06-03 Decentralized COVID-19 Health System Leveraging Blockchain Lingsheng Chen et.al. 2506.02674 null
2025-06-03 From Prompts to Protection: Large Language Model-Enabled In-Context Learning for Smart Public Safety UAV Yousef Emami et.al. 2506.02649 null
2025-06-03 State Similarity in Modular Superconducting Quantum Processors with Classical Communications Bujiao Wu et.al. 2506.01657 null

2025-06-02

Publish Date Title Authors PDF Code
2025-06-02 Fingerprinting Deep Learning Models via Network Traffic Patterns in Federated Learning Md Nahid Hasan Shuvo et.al. 2506.03207 null

2025-06-01

Publish Date Title Authors PDF Code
2025-06-01 Addressing the Collaboration Dilemma in Low-Data Federated Learning via Transient Sparsity Qiao Xiao et.al. 2506.00932 null

2025-05-31

Publish Date Title Authors PDF Code
2025-05-31 Blockchain Powered Edge Intelligence for U-Healthcare in Privacy Critical and Time Sensitive Environment Anum Nawaz et.al. 2506.02038 null
2025-05-31 PSI-PFL: Population Stability Index for Client Selection in non-IID Personalized Federated Learning Daniel-M. Jimenez-Gutierrez et.al. 2506.00440 null
2025-05-31 Hybrid Cloud Security: Balancing Performance, Cost, and Compliance in Multi-Cloud Deployments Anjani kumar Polinati et.al. 2506.00426 null
2025-05-31 SHARE: An SLM-based Hierarchical Action CorREction Assistant for Text-to-SQL Ge Qu et.al. 2506.00391 null

2025-05-30

Publish Date Title Authors PDF Code
2025-05-30 Structuring Radiology Reports: Challenging LLMs with Lightweight Models Johannes Moll et.al. 2506.00200 null
2025-05-30 Position: Federated Foundation Language Model Post-Training Should Focus on Open-Source Models Nikita Agrawal et.al. 2505.23593 null

2025-05-29

Publish Date Title Authors PDF Code
2025-05-29 Adaptive Deadline and Batch Layered Synchronized Federated Learning Asaf Goren et.al. 2505.23973 null
2025-05-29 Does Machine Unlearning Truly Remove Model Knowledge? A Framework for Auditing Unlearning in LLMs Haokun Chen et.al. 2505.23270 null
2025-05-29 Loss-Guided Model Sharing and Local Learning Correction in Decentralized Federated Learning for Crop Disease Classification Denis Mamba Kabala et.al. 2505.23063 null
2025-05-29 Deep Modeling and Optimization of Medical Image Classification Yihang Wu et.al. 2505.23040 **[link](https://github.com/aipmlab/skincancersimulation)**
2025-05-29 EL4NER: Ensemble Learning for Named Entity Recognition via Multiple Small-Parameter Large Language Models Yuzhen Xiao et.al. 2505.23038 null

2025-05-28

Publish Date Title Authors PDF Code
2025-05-28 TensorShield: Safeguarding On-Device Inference by Shielding Critical DNN Tensors with TEE Tong Sun et.al. 2505.22735 null
2025-05-28 Evolution of repositories and privacy laws: commit activities in the GDPR and CCPA era Georgia M. Kapitsaki et.al. 2505.22234 null

2025-05-27

Publish Date Title Authors PDF Code
2025-05-27 DP-RTFL: Differentially Private Resilient Temporal Federated Learning for Trustworthy AI in Regulated Industries Abhijit Talluri et.al. 2505.23813 null
2025-05-27 MedOrchestra: A Hybrid Cloud-Local LLM Approach for Clinical Data Interpretation Sihyeon Lee et.al. 2505.23806 null
2025-05-27 StreamLink: Large-Language-Model Driven Distributed Data Engineering System Dawei Feng et.al. 2505.21575 null
2025-05-27 Federated Instrumental Variable Analysis via Federated Generalized Method of Moments Geetika et.al. 2505.21012 null
2025-05-27 Facial Attribute Based Text Guided Face Anonymization Mustafa İzzet Muştu et.al. 2505.21002 null
2025-05-27 Generalized and Personalized Federated Learning with Foundation Models via Orthogonal Transformations Eun Gyung Kong et.al. 2505.19888 null

2025-05-26

Publish Date Title Authors PDF Code
2025-05-26 SEMFED: Semantic-Aware Resource-Efficient Federated Learning for Heterogeneous NLP Tasks Sajid Hussain et.al. 2505.23801 null
2025-05-26 LAPA-based Dynamic Privacy Optimization for Wireless Federated Learning in Heterogeneous Environments Pengcheng Sun et.al. 2505.19823 null
2025-05-26 Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments Junming Liu et.al. 2505.19699 null

2025-05-25

Publish Date Title Authors PDF Code
2025-05-25 Cellular Traffic Prediction via Byzantine-robust Asynchronous Federated Learning Hui Ma et.al. 2505.19263 **[link](https://github.com/maggiemh/bafdp)**

2025-05-24

Publish Date Title Authors PDF Code
2025-05-24 Understanding the Relationship Between Personal Data Privacy Literacy and Data Privacy Information Sharing by University Students Brady D. Lund et.al. 2505.18870 null
2025-05-24 Anonymity-washing Szivia Lestyán et.al. 2505.18627 null

2025-05-23

Publish Date Title Authors PDF Code
2025-05-23 Temporal Restoration and Spatial Rewiring for Source-Free Multivariate Time Series Domain Adaptation Peiliang Gong et.al. 2505.21525 null
2025-05-23 Retrieval Augmented Generation-based Large Language Models for Bridging Transportation Cybersecurity Legal Knowledge Gaps Khandakar Ashrafi Akbar et.al. 2505.18426 null
2025-05-23 RedactOR: An LLM-Powered Framework for Automatic Clinical Data De-Identification Praphul Singh et.al. 2505.18380 null
2025-05-23 WiNGPT-3.0 Technical Report Boqin Zhuang et.al. 2505.17387 null

2025-05-22

Publish Date Title Authors PDF Code
2025-05-22 LLM Access Shield: Domain-Specific LLM Framework for Privacy Policy Compliance Yu Wang et.al. 2505.17145 null
2025-05-22 Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks Hongyuan Tao et.al. 2505.16901 null
2025-05-22 ATR-Bench: A Federated Learning Benchmark for Adaptation, Trust, and Reasoning Tajamul Ashraf et.al. 2505.16850 **[link](https://github.com/tajamul21/atr-bench)**
2025-05-22 From Local Patterns to Global Understanding: Cross-Stock Trend Integration for Enhanced Predictive Modeling Yi Hu et.al. 2505.16573 null
2025-05-22 A Two-Stage Data Selection Framework for Data-Efficient Model Training on Edge Devices Chen Gong et.al. 2505.16563 null
2025-05-22 Interpretable Anomaly Detection in Encrypted Traffic Using SHAP with Machine Learning Models Kalindi Singh et.al. 2505.16261 null
2025-05-22 Enhancing Federated Survival Analysis through Peer-Driven Client Reputation in Healthcare Navid Seidi et.al. 2505.16190 null

2025-05-21

Publish Date Title Authors PDF Code
2025-05-21 Are LLMs reliable? An exploration of the reliability of large language models in clinical note generation Kristine Ann M. Carandang et.al. 2505.17095 null
2025-05-21 Federated Learning-Enhanced Blockchain Framework for Privacy-Preserving Intrusion Detection in Industrial IoT Anas Ali et.al. 2505.15376 null
2025-05-21 EC-LDA : Label Distribution Inference Attack against Federated Graph Learning with Embedding Compression Tong Cheng et.al. 2505.15140 null
2025-05-21 A Survey On Secure Machine Learning Taobo Liao et.al. 2505.15124 null

2025-05-20

Publish Date Title Authors PDF Code
2025-05-20 Listen, Analyze, and Adapt to Learn New Attacks: An Exemplar-Free Class Incremental Learning Method for Audio Deepfake Source Tracing Yang Xiao et.al. 2505.14601 null
2025-05-20 Federated prediction for scalable and privacy-preserved knowledge-based planning in radiotherapy Jingyun Chen et.al. 2505.14507 null
2025-05-20 CE-LSLM: Efficient Large-Small Language Model Inference and Communication via Cloud-Edge Collaboration Pengyan Zhu et.al. 2505.14085 null
2025-05-20 FedGraM: Defending Against Untargeted Attacks in Federated Learning via Embedding Gram Matrix Di Wu et.al. 2505.14024 null
2025-05-20 Zk-SNARK for String Match Taoran Li et.al. 2505.13964 **[link](https://github.com/taobol2/CS407_Project)**

2025-05-19

Publish Date Title Authors PDF Code
2025-05-19 Advancing Software Quality: A Standards-Focused Review of LLM-Based Assurance Techniques Avinash Patil et.al. 2505.13766 null
2025-05-19 FedCTTA: A Collaborative Approach to Continual Test-Time Adaptation in Federated Learning Rakibul Hasan Rajib et.al. 2505.13643 null
2025-05-19 Exploring Federated Pruning for Large Language Models Pengxin Guo et.al. 2505.13547 **[link](https://github.com/pengxin-guo/fedprllm)**
2025-05-19 DynaNoise: Dynamic Probabilistic Noise Injection for Defending Against Membership Inference Attacks Javad Forough et.al. 2505.13362 null
2025-05-19 Cross-Cloud Data Privacy Protection: Optimizing Collaborative Mechanisms of AI Systems by Integrating Federated Learning and LLMs Huaiying Luo et.al. 2505.13292 null
2025-05-19 Unlearning for Federated Online Learning to Rank: A Reproducibility Study Yiling Tao et.al. 2505.12791 **[link](https://github.com/iris1026/unlearning-for-foltr)**

2025-05-18

Publish Date Title Authors PDF Code
2025-05-18 A Comprehensive Review of Techniques, Algorithms, Advancements, Challenges, and Clinical Applications of Multi-modal Medical Image Fusion for Improved Diagnosis Muhammad Zubair et.al. 2505.14715 null
2025-05-18 PoLO: Proof-of-Learning and Proof-of-Ownership at Once with Chained Watermarking Haiyu Deng et.al. 2505.12296 null
2025-05-18 ACU: Analytic Continual Unlearning for Efficient and Exact Forgetting with Privacy Preservation Jianheng Tang et.al. 2505.12239 null
2025-05-18 Enhancing the Performance of Global Model by Improving the Adaptability of Local Models in Federated Learning Wujun Zhou et.al. 2505.10125 null

2025-05-17

Publish Date Title Authors PDF Code
2025-05-17 Federated Deep Reinforcement Learning for Privacy-Preserving Robotic-Assisted Surgery Sana Hafeez et.al. 2505.12153 null
2025-05-17 FedHQ: Hybrid Runtime Quantization for Federated Learning Zihao Zheng et.al. 2505.11982 null

2025-05-16

Publish Date Title Authors PDF Code
2025-05-16 Federated Low-Rank Adaptation for Foundation Models: A Survey Yiyuan Yang et.al. 2505.13502 null
2025-05-16 Verifiably Forgotten? Gradient Differences Still Enable Data Reconstruction in Federated Unlearning Fuyao Zhang et.al. 2505.11097 null
2025-05-16 Nosy Layers, Noisy Fixes: Tackling DRAs in Federated Learning Systems using Explainable AI Meghali Nandi et.al. 2505.10942 null
2025-05-16 Privacy-Aware Lifelong Learning Ozan Özdenizci et.al. 2505.10941 null
2025-05-16 Convergence Analysis of the Last Iterate in Distributed Stochastic Gradient Descent with Momentum Difei Cheng et.al. 2505.10889 null

2025-05-15

Publish Date Title Authors PDF Code
2025-05-15 Locally Differentially Private Frequency Estimation via Joint Randomized Response Ye Zheng et.al. 2505.10349 **[link](https://github.com/ZhengYeah/JRR)**
2025-05-15 Private Transformer Inference in MLaaS: A Survey Yang Li et.al. 2505.10315 null
2025-05-15 Cutting Through Privacy: A Hyperplane-Based Data Reconstruction Attack in Federated Learning Francesco Diana et.al. 2505.10264 null
2025-05-15 Robust Federated Learning on Edge Devices with Domain Heterogeneity Huy Q. Le et.al. 2505.10128 null

2025-05-14

Publish Date Title Authors PDF Code
2025-05-14 Robust Federated Learning with Confidence-Weighted Filtering and GAN-Based Completion under Noisy and Incomplete Data Alpaslan Gokcen et.al. 2505.09733 null
2025-05-14 FedSaaS: Class-Consistency Federated Semantic Segmentation via Global Prototype Supervision and Local Adversarial Harmonization Xiaoyang Yu et.al. 2505.09385 null

2025-05-13

Publish Date Title Authors PDF Code
2025-05-13 AI and Generative AI Transforming Disaster Management: A Survey of Damage Assessment and Response Techniques Aman Raj et.al. 2505.08202 null

2025-05-12

Publish Date Title Authors PDF Code
2025-05-12 A Federated Random Forest Solution for Secure Distributed Machine Learning Alexandre Cotorobai et.al. 2505.08085 **[link](https://github.com/ieeta-pt/fed_rf)**
2025-05-12 Privacy-Preserving Real-Time Vietnamese-English Translation on iOS using Edge AI Cong Le et.al. 2505.07583 null
2025-05-12 FedIFL: A federated cross-domain diagnostic framework for motor-driven systems with inconsistent fault modes Zexiao Wang et.al. 2505.07315 null
2025-05-12 Empowering the Grid: Collaborative Edge Artificial Intelligence for Decentralized Energy Systems Eddie de Paula Jr et.al. 2505.07170 null

2025-05-11

Publish Date Title Authors PDF Code
2025-05-11 Source Anonymity for Private Random Walk Decentralized Learning Maximilian Egger et.al. 2505.07011 null

2025-05-10

Publish Date Title Authors PDF Code
2025-05-10 A Contrastive Federated Semi-Supervised Learning Intrusion Detection Framework for Internet of Robotic Things Yifan Zeng et.al. 2505.06636 null

2025-05-09

Publish Date Title Authors PDF Code
2025-05-09 Remote Rowhammer Attack using Adversarial Observations on Federated Learning Clients Jinsheng Yuan et.al. 2505.06335 null
2025-05-09 Self-Supervised Federated GNSS Spoofing Detection with Opportunistic Data Wenjie Liu et.al. 2505.06171 null
2025-05-09 Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation Stefan Vasilev et.al. 2505.06027 null
2025-05-09 Efficient Full-Stack Private Federated Deep Learning with Post-Quantum Security Yiwei Zhang et.al. 2505.05751 null

2025-05-08

Publish Date Title Authors PDF Code
2025-05-08 Optimal Regret of Bernoulli Bandits under Global Differential Privacy Achraf Azize et.al. 2505.05613 null
2025-05-08 Adaptive Biased User Scheduling for Heterogeneous Wireless Federate Learning Network Changxiang Wu et.al. 2505.05231 null
2025-05-08 FedTDP: A Privacy-Preserving and Unified Framework for Trajectory Data Preparation via Federated Learning Zhihao Zeng et.al. 2505.05155 null
2025-05-08 CacheFL: Efficient Federated Cache Model Fine-Tuning for Vision-Language Models Mengjun Yi et.al. 2505.05130 null
2025-05-08 Balancing Client Participation in Federated Learning Using AoI Alireza Javani et.al. 2505.05099 null
2025-05-08 Federated Learning for Cyber Physical Systems: A Comprehensive Survey Minh K. Quan et.al. 2505.04873 null

2025-05-07

Publish Date Title Authors PDF Code
2025-05-07 Privacy-preserving neutral atom-based quantum classifier towards real healthcare applications Ettore Canonici et.al. 2505.04570 null
2025-05-07 RDPP-TD: Reputation and Data Privacy-Preserving based Truth Discovery Scheme in Mobile Crowdsensing Lijian Wu et.al. 2505.04361 null
2025-05-07 A Framework to Prevent Biometric Data Leakage in the Immersive Technologies Domain Keshav Sood et.al. 2505.04123 null

2025-05-06

Publish Date Title Authors PDF Code
2025-05-06 An Overview of the Prospects and Challenges of Using Artificial Intelligence for Energy Management Systems in Microgrids Noor ul Misbah Khanum et.al. 2505.05498 null
2025-05-06 AI-Driven Security in Cloud Computing: Enhancing Threat Detection, Automated Response, and Cyber Resilience Shamnad Mohamed Shaffi et.al. 2505.03945 null
2025-05-06 Event-Triggered GAT-LSTM Framework for Attack Detection in Heating, Ventilation, and Air Conditioning Systems Zhenan Feng et.al. 2505.03559 null
2025-05-06 SKALD: Scalable K-Anonymisation for Large Datasets Kailash Reddy et.al. 2505.03529 null
2025-05-06 SemSpaceFL: A Collaborative Hierarchical Federated Learning Framework for Semantic Communication in 6G LEO Satellites Loc X. Nguyen et.al. 2505.00966 null

2025-05-05

Publish Date Title Authors PDF Code
2025-05-05 Memorization or Interpolation ? Detecting LLM Memorization through Input Perturbation Analysis Albérick Euraste Djiré et.al. 2505.03019 null
2025-05-05 Navigating Privacy and Trust: AI Assistants as Social Support for Older Adults Karina LaRubbio et.al. 2505.02975 null
2025-05-05 Unlearning vs. Obfuscation: Are We Truly Removing Knowledge? Guangzhi Sun et.al. 2505.02884 null
2025-05-05 Encrypted Federated Search Using Homomorphic Encryption Om Rathod et.al. 2505.02409 null
2025-05-05 Quantitative Analysis of Performance Drop in DeepSeek Model Quantization Enbo Zhao et.al. 2505.02390 **[link](https://github.com/unicomai/deepseek-eval)**

2025-05-04

Publish Date Title Authors PDF Code
2025-05-04 Student Perspectives on the Benefits and Risks of AI in Education Griffin Pitts et.al. 2505.02198 null

2025-05-03

Publish Date Title Authors PDF Code
2025-05-03 Towards Trustworthy Federated Learning with Untrusted Participants Youssef Allouah et.al. 2505.01874 null
2025-05-03 PQS-BFL: A Post-Quantum Secure Blockchain-based Federated Learning Framework Daniel Commey et.al. 2505.01866 null
2025-05-03 Privacy Preserving Machine Learning Model Personalization through Federated Personalized Learning Md. Tanzib Hosain et.al. 2505.01788 null
2025-05-03 Enhanced Flexibility Aggregation Using LinDistFlow Model with Loss Compensation Yanlin Jiang et.al. 2505.01715 null

2025-05-02

Publish Date Title Authors PDF Code
2025-05-02 Securing the Future of IVR: AI-Driven Innovation with Agile Security, Data Regulation, and Ethical AI Integration Khushbu Mehboob Shaikh et.al. 2505.01514 null

2025-05-01

Publish Date Title Authors PDF Code
2025-05-01 Distributed Retrieval-Augmented Generation Chenhao Xu et.al. 2505.00443 null

2025-04-30

Publish Date Title Authors PDF Code
2025-04-30 Sparsification Under Siege: Defending Against Poisoning Attacks in Communication-Efficient Federated Learning Zhiyong Jin et.al. 2505.01454 null
2025-04-30 VDDP: Verifiable Distributed Differential Privacy under the Client-Server-Verifier Setup Haochen Sun et.al. 2504.21752 null
2025-04-30 Bilateral Differentially Private Vertical Federated Boosted Decision Trees Bokang Zhang et.al. 2504.21739 null

2025-04-29

Publish Date Title Authors PDF Code
2025-04-29 FedHERO: A Federated Learning Approach for Node Classification Task on Heterophilic Graphs Zihan Chen et.al. 2504.21206 null
2025-04-29 Federated One-Shot Learning with Data Privacy and Objective-Hiding Maximilian Egger et.al. 2504.21182 null
2025-04-29 A Brief Review for Compression and Transfer Learning Techniques in DeepFake Detection Andreas Karathanasis et.al. 2504.21066 null
2025-04-29 Bipartite Randomized Response Mechanism for Local Differential Privacy Shun Zhang et.al. 2504.20926 null
2025-04-29 ReCIT: Reconstructing Full Private Data from Gradient in Parameter-Efficient Fine-Tuning of Large Language Models Jin Xie et.al. 2504.20570 null
2025-04-29 Clustering-Based Evolutionary Federated Multiobjective Optimization and Learning Chengui Xiao et.al. 2504.20346 null

2025-04-28

Publish Date Title Authors PDF Code
2025-04-28 FedCCL: Federated Clustered Continual Learning Framework for Privacy-focused Energy Forecasting Michael A. Helcig et.al. 2504.20282 null
2025-04-28 Financial Data Analysis with Robust Federated Logistic Regression Kun Yang et.al. 2504.20250 null
2025-04-28 Federated Out-of-Distribution Generalization: A Causal Augmentation View Runhui Zhang et.al. 2504.19882 null
2025-04-28 Dynamic Tsetlin Machine Accelerators for On-Chip Training at the Edge using FPGAs Gang Mao et.al. 2504.19797 null

2025-04-27

Publish Date Title Authors PDF Code
2025-04-27 Privacy-Preserving Federated Embedding Learning for Localized Retrieval-Augmented Generation Qianren Mao et.al. 2504.19101 null

2025-04-26

Publish Date Title Authors PDF Code
2025-04-26 SONNI: Secure Oblivious Neural Network Inference Luke Sperling et.al. 2504.18974 null

2025-04-25

Publish Date Title Authors PDF Code
2025-04-25 The Rise of Small Language Models in Healthcare: A Comprehensive Survey Muskan Garg et.al. 2504.17119 null

2025-04-24

Publish Date Title Authors PDF Code
2025-04-24 Federated Learning: A Survey on Privacy-Preserving Collaborative Intelligence Edward Collins et.al. 2504.17703 null
2025-04-24 From Randomized Response to Randomized Index: Answering Subset Counting Queries with Local Differential Privacy Qingqing Ye et.al. 2504.17523 null
2025-04-24 PatientDx: Merging Large Language Models for Protecting Data-Privacy in Healthcare Jose G. Moreno et.al. 2504.17360 null

2025-04-23

Publish Date Title Authors PDF Code
2025-04-23 AI for Accessible Education: Personalized Audio-Based Learning for Blind Students Crystal Yang et.al. 2504.17117 null
2025-04-23 Simplified Swarm Learning Framework for Robust and Scalable Diagnostic Services in Cancer Histopathology Yanjie Wu et.al. 2504.16732 null
2025-04-23 DP2FL: Dual Prompt Personalized Federated Learning in Foundation Models Ying Chang et.al. 2504.16357 null
2025-04-23 aiXamine: Simplified LLM Safety and Security Fatih Deniz et.al. 2504.14985 null

2025-04-22

Publish Date Title Authors PDF Code
2025-04-22 Towards a Distributed Federated Learning Aggregation Placement using Particle Swarm Intelligence Amir Ali-Pour et.al. 2504.16227 null
2025-04-22 LLMs meet Federated Learning for Scalable and Secure IoT Management Yazan Otoum et.al. 2504.16032 null
2025-04-22 Federated Latent Factor Learning for Recovering Wireless Sensor Networks Signal with Privacy-Preserving Chengjun Yu et.al. 2504.15525 null

2025-04-21

Publish Date Title Authors PDF Code
2025-04-21 From Reviews to Dialogues: Active Synthesis for Zero-Shot LLM-based Conversational Recommender System Rohan Surana et.al. 2504.15476 null
2025-04-21 Federated Latent Factor Model for Bias-Aware Recommendation with Privacy-Preserving Junxiang Gao et.al. 2504.15090 null
2025-04-21 Aligning Beam with Imbalanced Multi-modality: A Generative Federated Learning Approach Jiahui Liang et.al. 2504.14835 null

2025-04-19

Publish Date Title Authors PDF Code
2025-04-19 WeiDetect: Weibull Distribution-Based Defense against Poisoning Attacks in Federated Learning for Network Intrusion Detection Systems Sameera K. M. et.al. 2504.04367 null

2025-04-17

Publish Date Title Authors PDF Code
2025-04-17 SHA256 at SemEval-2025 Task 4: Selective Amnesia -- Constrained Unlearning for Large Language Models via Knowledge Isolation Saransh Agrawal et.al. 2504.12996 **[link](https://github.com/LAB-FLAIR/Constrained-Unlearning-for-LLM)**

2025-04-16

Publish Date Title Authors PDF Code
2025-04-16 Enhanced Battery Capacity Estimation in Data-Limited Scenarios through Swarm Learning Jiawei Zhang et.al. 2504.12444 null
2025-04-16 Federated Spectral Graph Transformers Meet Neural Ordinary Differential Equations for Non-IID Graphs Kishan Gurumurthy et.al. 2504.11808 **[link](https://github.com/springwiz11/fed-gnodeformer)**

2025-04-15

Publish Date Title Authors PDF Code
2025-04-15 Never Start from Scratch: Expediting On-Device LLM Personalization via Explainable Model Selection Haoming Wang et.al. 2504.13938 null
2025-04-15 FLSSM: A Federated Learning Storage Security Model with Homomorphic Encryption Yang Li et.al. 2504.11088 null
2025-04-15 Collaborative Bayesian Optimization via Wasserstein Barycenters Donglin Zhan et.al. 2504.10770 null

2025-04-14

Publish Date Title Authors PDF Code
2025-04-14 Optimising Intrusion Detection Systems in Cloud-Edge Continuum with Knowledge Distillation for Privacy-Preserving and Efficient Communication Soad Almabdy et.al. 2504.10698 null
2025-04-14 VAE-based Feature Disentanglement for Data Augmentation and Compression in Generalized GNSS Interference Classification Lucas Heublein et.al. 2504.10556 null
2025-04-14 Privacy-Preserving Distributed Link Predictions Among Peers in Online Classrooms Using Federated Learning Anurata Prabha Hridi et.al. 2504.10456 null
2025-04-14 Understanding the Impact of Data Domain Extraction on Synthetic Data Privacy Georgi Ganev et.al. 2504.08254 null

2025-04-12

Publish Date Title Authors PDF Code
2025-04-12 Deploying Large AI Models on Resource-Limited Devices with Split Federated Learning Xianke Qiang et.al. 2504.09114 null
2025-04-12 Large Language Models integration in Smart Grids Seyyedreza Madani et.al. 2504.09059 null

2025-04-11

Publish Date Title Authors PDF Code
2025-04-11 DataMap: A Portable Application for Visualizing High-Dimensional Data Xijin Ge et.al. 2504.08875 **[link](https://github.com/gexijin/datamap)**
2025-04-11 Personalizing Federated Learning for Hierarchical Edge Networks with Non-IID Data Seunghyun Lee et.al. 2504.08872 null

2025-04-10

Publish Date Title Authors PDF Code
2025-04-10 Traversal Learning Coordination For Lossless And Efficient Distributed Learning Erdenebileg Batbaatar et.al. 2504.07471 null
2025-04-10 FAST: Federated Active Learning with Foundation Models for Communication-efficient Sampling and Training Haoyuan Li et.al. 2504.03783 null

2025-04-05

Publish Date Title Authors PDF Code
2025-04-05 Unmasking the Reality of PII Masking Models: Performance Gaps and the Call for Accountability Devansh Singh et.al. 2504.12308 null
2025-04-05 Multi-Agent Reinforcement Learning for Graph Discovery in D2D-Enabled Federated Learning Satyavrat Wagle et.al. 2503.23218 null

2025-04-04

Publish Date Title Authors PDF Code
2025-04-04 Secure Federated XGBoost with CUDA-accelerated Homomorphic Encryption via NVIDIA FLARE Ziyue Xu et.al. 2504.03909 null
2025-04-04 An Intelligent and Privacy-Preserving Digital Twin Model for Aging-in-Place Yongjie Wang et.al. 2504.03798 null
2025-04-04 Hierarchical Knowledge Structuring for Effective Federated Learning in Heterogeneous Environments Wai Fong Tam et.al. 2504.03505 null

2025-04-03

Publish Date Title Authors PDF Code
2025-04-03 Enhancing Air Quality Monitoring: A Brief Review of Federated Learning Advances Sara Yarham et.al. 2504.02909 null
2025-04-03 Web3DB: Web 3.0 RDBMS for Individual Data Ownership Shankha Shubhra Mukherjee et.al. 2504.02713 null

2025-04-02

Publish Date Title Authors PDF Code
2025-04-02 Efficient Federated Learning Tiny Language Models for Mobile Network Feature Prediction Daniel Becking et.al. 2504.01947 null
2025-04-02 CO-DEFEND: Continuous Decentralized Federated Learning for Secure DoH-Based Threat Detection Diego Cajaraville-Aboy et.al. 2504.01882 **[link](https://gitlab.com/compromise3/co-defend)**
2025-04-02 A Two-Timescale Approach for Wireless Federated Learning with Parameter Freezing and Power Control Jinhao Ouyang et.al. 2504.01752 null
2025-04-02 Sky of Unlearning (SoUL): Rewiring Federated Machine Unlearning via Selective Pruning Md Mahabub Uz Zaman et.al. 2504.01705 null
2025-04-02 Split Federated Learning for UAV-Enabled Integrated Sensing, Computation, and Communication Xiangwang Hou et.al. 2504.01443 null
2025-04-02 TeleAntiFraud-28k: An Audio-Text Slow-Thinking Dataset for Telecom Fraud Detection Zhiming Ma et.al. 2503.24115 **[link](https://github.com/jimmyma99/teleantifraud)**
2025-04-02 SimDC: A High-Fidelity Device Simulation Platform for Device-Cloud Collaborative Computing Ruiguang Pei et.al. 2503.22288 **[link](https://github.com/opas-lab/olearning-sim)**

2025-04-01

Publish Date Title Authors PDF Code
2025-04-01 AI Regulation and Capitalist Growth: Balancing Innovation, Ethics, and Global Governance Vikram Kulothungan et.al. 2504.02000 null
2025-04-01 Benchmarking Federated Machine Unlearning methods for Tabular Data Chenguang Xiao et.al. 2504.00921 null
2025-04-01 A Survey on Unlearnable Data Jiahao Li et.al. 2503.23536 **[link](https://github.com/LiJiahao-Alex/Awesome-UnLearnable-Data)**

2025-03-31

Publish Date Title Authors PDF Code
2025-03-31 Federated Learning for Cross-Domain Data Privacy: A Distributed Approach to Secure Collaboration Yiwei Zhang et.al. 2504.00282 null
2025-03-31 Communication-Efficient and Personalized Federated Foundation Model Fine-Tuning via Tri-Matrix Adaptation Yongle Li et.al. 2503.23869 null
2025-03-31 VIDEX: A Disaggregated and Extensible Virtual Index for the Cloud and AI Era Rong Kang et.al. 2503.23776 **[link](https://github.com/bytedance/videx)**

2025-03-29

Publish Date Title Authors PDF Code
2025-03-29 Towards Secure Semantic Communications in the Presence of Intelligent Eavesdroppers Shunpu Tang et.al. 2503.23103 null
2025-03-29 Disentangled Source-Free Personalization for Facial Expression Recognition with Neutral Target Data Masoumeh Sharafi et.al. 2503.20771 **[link](https://github.com/MasoumehSharafi/DSFDA-for-Pain-Assessment)**

2025-03-28

Publish Date Title Authors PDF Code
2025-03-28 Efficient Verified Machine Unlearning For Distillation Yijun Quan et.al. 2503.22539 null

2025-03-27

Publish Date Title Authors PDF Code
2025-03-27 Adaptive Clipping for Privacy-Preserving Few-Shot Learning: Enhancing Generalization with Limited Data Kanishka Ranaweera et.al. 2503.22749 null
2025-03-27 Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation Reza Qorbani et.al. 2503.21780 **[link](https://github.com/rezaqorbani/semla)**
2025-03-27 Federated Intelligence: When Large AI Models Meet Federated Fine-Tuning and Collaborative Reasoning at the Network Edge Wanli Ni et.al. 2503.21412 null
2025-03-27 Improving $(α, f)$ -Byzantine Resilience in Federated Learning via layerwise aggregation and cosine distance Mario García-Márquez et.al. 2503.21244 **[link](https://github.com/ari-dasci/S-layerwise_cosine_aggregation)**
2025-03-27 Federated Learning with Differential Privacy: An Utility-Enhanced Approach Kanishka Ranaweera et.al. 2503.21154 null

2025-03-26

Publish Date Title Authors PDF Code
2025-03-26 Privacy in Immersive Extended Reality: Exploring User Perceptions, Concerns, and Coping Strategies Hilda Hadan et.al. 2503.21010 null
2025-03-26 MedSegNet10: A Publicly Accessible Network Repository for Split Federated Medical Image Segmentation Chamani Shiranthika et.al. 2503.20830 null
2025-03-26 Continual learning via probabilistic exchangeable sequence modelling Hanwen Xing et.al. 2503.20725 null
2025-03-26 How Secure is Forgetting? Linking Machine Unlearning to Machine Learning Attacks Muhammed Shafi K. P. et.al. 2503.20257 null

2025-03-25

Publish Date Title Authors PDF Code
2025-03-25 Federated Learning: A new frontier in the exploration of multi-institutional medical imaging data Dominika Ciupek et.al. 2503.20107 null
2025-03-25 Distributed Stochastic Zeroth-Order Optimization with Compressed Communication Youqing Hua et.al. 2503.17429 null

2025-03-24

Publish Date Title Authors PDF Code
2025-03-24 Two Types of Data Privacy Controls Eman Alashwali et.al. 2503.18729 null
2025-03-24 The Role of Artificial Intelligence in Enhancing Insulin Recommendations and Therapy Outcomes Maria Panagiotou et.al. 2503.18592 null
2025-03-24 Surgical Action Planning with Large Language Models Mengya Xu et.al. 2503.18296 null
2025-03-24 Zero-Knowledge Federated Learning: A New Trustworthy and Privacy-Preserving Distributed Learning Paradigm Yuxin Jin et.al. 2503.15550 null

2025-03-23

Publish Date Title Authors PDF Code
2025-03-23 FROG: Fair Removal on Graphs Ziheng Chen et.al. 2503.18197 null
2025-03-23 Active Inference for Energy Control and Planning in Smart Buildings and Communities Seyyed Danial Nazemi et.al. 2503.18161 null
2025-03-23 Dynamic Gradient Sparse Update for Edge Training I-Hsuan Li et.al. 2503.17959 null

2025-03-22

Publish Date Title Authors PDF Code
2025-03-22 Feather-SQL: A Lightweight NL2SQL Framework with Dual-Model Collaboration Paradigm for Small Language Models Wenqi Pei et.al. 2503.17811 null
2025-03-22 Decentralized Federated Dataset Dictionary Learning for Multi-Source Domain Adaptation Rebecca Clain et.al. 2503.17683 null
2025-03-22 A Qualitative Study of User Perception of M365 AI Copilot Muneera Bano et.al. 2503.17661 null

2025-03-21

Publish Date Title Authors PDF Code
2025-03-21 Collaborative Value Function Estimation Under Model Mismatch: A Federated Temporal Difference Analysis Ali Beikmohammadi et.al. 2503.17454 **[link](https://github.com/AliBeikmohammadi/FedRL)**
2025-03-21 On-Device Federated Continual Learning on RISC-V-based Ultra-Low-Power SoC for Intelligent Nano-Drone Swarms Lars Kröger et.al. 2503.17436 null
2025-03-21 A Thorough Assessment of the Non-IID Data Impact in Federated Learning Daniel M. Jimenez-Gutierrez et.al. 2503.17070 null

2025-03-20

Publish Date Title Authors PDF Code
2025-03-20 Empirical Analysis of Privacy-Fairness-Accuracy Trade-offs in Federated Learning: A Step Towards Responsible AI Dawood Wasif et.al. 2503.16233 null
2025-03-20 Privacy-Preserving Utilization of Distribution System Flexibility for Enhanced TSO-DSO Interoperability: A Novel Machine Learning-Based Optimal Power Flow Approach Burak Dindar et.al. 2503.15966 null

2025-03-19

Publish Date Title Authors PDF Code
2025-03-19 ChatGPT or A Silent Everywhere Helper: A Survey of Large Language Models Azim Akhtarshenas et.al. 2503.17403 null
2025-03-19 AEJIM: A Real-Time AI Framework for Crowdsourced, Transparent, and Ethical Environmental Hazard Detection and Reporting Torsten Tiltack et.al. 2503.17401 null
2025-03-19 Distributed Generalized Linear Models: A Privacy-Preserving Approach Daniel Tinoco et.al. 2503.15287 **[link](https://github.com/dbtnc/distributed_glm)**
2025-03-19 Online federated learning framework for classification Wenxing Guo et.al. 2503.15210 null
2025-03-19 Prada: Black-Box LLM Adaptation with Private Data on Resource-Constrained Devices Ziyao Wang et.al. 2503.14932 null

2025-03-18

Publish Date Title Authors PDF Code
2025-03-18 Advanced Relay-Based Collaborative Framework for Optimizing Synchronization in Split Federated Learning over Wireless Networks Haoran Gao et.al. 2503.15559 null
2025-03-18 Anomaly-Flow: A Multi-domain Federated Generative Adversarial Network for Distributed Denial-of-Service Detection Leonardo Henrique de Melo et.al. 2503.14618 **[link](https://github.com/c2dc/anomaly-flow)**
2025-03-18 Robust Machine Unlearning for Quantized Neural Networks via Adaptive Gradient Reweighting with Similar Labels Yujia Tong et.al. 2503.13917 null
2025-03-18 Automatic MILP Model Construction for Multi-Robot Task Allocation and Scheduling Based on Large Language Models Mingming Peng et.al. 2503.13813 null

2025-03-17

Publish Date Title Authors PDF Code
2025-03-17 The Impact of Artificial Intelligence on Emergency Medicine: A Review of Recent Advances Gustavo Correia et.al. 2503.14546 null
2025-03-17 Regulating Ai In Financial Services: Legal Frameworks And Compliance Challenges Shahmar Mirishli et.al. 2503.14541 null
2025-03-17 SDFLMQ: A Semi-Decentralized Federated Learning Framework over MQTT Amir Ali-Pour et.al. 2503.13624 null
2025-03-17 How Good is my Histopathology Vision-Language Foundation Model? A Holistic Benchmark Roba Al Majzoub et.al. 2503.12990 **[link](https://github.com/musk007/Histopathology_Benchmark)**

2025-03-16

Publish Date Title Authors PDF Code
2025-03-16 Synthetic Data for Robust AI Model Development in Regulated Enterprises Aditi Godbole et.al. 2503.12353 null
2025-03-16 Synthesizing Privacy-Preserving Text Data via Finetuning without Finetuning Billion-Scale LLMs Bowen Tan et.al. 2503.12347 null

2025-03-15

Publish Date Title Authors PDF Code
2025-03-15 From Laboratory to Real World: A New Benchmark Towards Privacy-Preserved Visible-Infrared Person Re-Identification Yan Jiang et.al. 2503.12232 null
2025-03-15 Efficient and Privacy-Preserved Link Prediction via Condensed Graphs Yunbo Long et.al. 2503.12156 null
2025-03-15 A Survey on Federated Fine-tuning of Large Language Models Yebo Wu et.al. 2503.12016 **[link](https://github.com/clin0212/awesome-federated-llm-learning)**

Model Privacy

2025-06-03

Publish Date Title Authors PDF Code
2025-06-03 Privacy Leaks by Adversaries: Adversarial Iterations for Membership Inference Attack Jing Xue et.al. 2506.02711 null

2025-05-15

Publish Date Title Authors PDF Code
2025-05-15 Private Transformer Inference in MLaaS: A Survey Yang Li et.al. 2505.10315 null

2025-05-09

Publish Date Title Authors PDF Code
2025-05-09 From Models to Network Topologies: A Topology Inference Attack in Decentralized Federated Learning Chao Feng et.al. 2501.03119 null

2025-05-05

Publish Date Title Authors PDF Code
2025-05-05 Bayes-Nash Generative Privacy Against Membership Inference Attacks Tao Zhang et.al. 2410.07414 null

2025-04-18

Publish Date Title Authors PDF Code
2025-04-18 Selective Attention Federated Learning: Improving Privacy and Efficiency for Clinical Text Classification Yue Li et.al. 2504.11793 null

2025-04-15

Publish Date Title Authors PDF Code
2025-04-15 FLSSM: A Federated Learning Storage Security Model with Homomorphic Encryption Yang Li et.al. 2504.11088 null

2025-04-12

Publish Date Title Authors PDF Code
2025-04-12 Footprints of Data in a Classifier: Understanding the Privacy Risks and Solution Strategies Payel Sadhukhan et.al. 2407.02268 null

2025-04-07

Publish Date Title Authors PDF Code
2025-04-07 Enhancing Trust in AI Marketplaces: Evaluating On-Chain Verification of Personalized AI models using zk-SNARKs Nishant Jagannath et.al. 2504.04794 null

2025-03-26

Publish Date Title Authors PDF Code
2025-03-26 Modelling Privacy Compliance in Cross-border Data Transfers with Bigraphs Ebtihal Althubiti et.al. 2503.20464 null

2025-03-19

Publish Date Title Authors PDF Code
2025-03-19 Prada: Black-Box LLM Adaptation with Private Data on Resource-Constrained Devices Ziyao Wang et.al. 2503.14932 null

2025-03-11

Publish Date Title Authors PDF Code
2025-03-11 Multi-P $^2$ A: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language Models Jie Zhang et.al. 2412.19496 **[link](https://github.com/Xiangkui-Cao/Multi-P2A)**

2025-02-27

Publish Date Title Authors PDF Code
2025-02-27 GOD model: Privacy Preserved AI School for Personal Assistant PIN AI Team et.al. 2502.18527 **[link](https://github.com/pin-ai/god-model)**

2025-02-21

Publish Date Title Authors PDF Code
2025-02-21 Model Privacy: A Unified Framework to Understand Model Stealing Attacks and Defenses Ganghua Wang et.al. 2502.15567 null

2025-02-14

Publish Date Title Authors PDF Code
2025-02-14 An Interactive Framework for Implementing Privacy-Preserving Federated Learning: Experiments on Large Language Models Kasra Ahmadi et.al. 2502.08008 **[link](https://github.com/KasraAhmadi/FL-Privacy-LLM)**

2025-02-08

Publish Date Title Authors PDF Code
2025-02-08 Dual Defense: Enhancing Privacy and Mitigating Poisoning Attacks in Federated Learning Runhua Xu et.al. 2502.05547 **[link](https://github.com/irxyzzz/DualDefense)**

2025-01-06

Publish Date Title Authors PDF Code
2025-01-06 Pathway to Secure and Trustworthy ZSM for LLMs: Attacks, Defense, and Opportunities Sunder Ali Khowaja et.al. 2408.00722 null

2025-01-04

Publish Date Title Authors PDF Code
2025-01-04 AdaMixup: A Dynamic Defense Framework for Membership Inference Attack Mitigation Ying Chen et.al. 2501.02182 null

2024-12-13

Publish Date Title Authors PDF Code
2024-12-13 ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression Kai Yao et.al. 2412.09812 null

2024-10-29

Publish Date Title Authors PDF Code
2024-10-29 PrivCirNet: Efficient Private Inference via Block Circulant Transformation Tianshi Xu et.al. 2405.14569 **[link](https://github.com/tianshi-xu/privcirnet)**

2024-10-24

Publish Date Title Authors PDF Code
2024-10-24 Does Differential Privacy Impact Bias in Pretrained NLP Models? Md. Khairul Islam et.al. 2410.18749 **[link](https://github.com/khairulislam/dp-on-nlp-bias)**

2024-10-01

Publish Date Title Authors PDF Code
2024-10-01 PrivTuner with Homomorphic Encryption and LoRA: A P3EFT Scheme for Privacy-Preserving Parameter-Efficient Fine-Tuning of AI Foundation Models Yang Li et.al. 2410.00433 null

2024-09-10

Publish Date Title Authors PDF Code
2024-09-10 A Pervasive, Efficient and Private Future: Realizing Privacy-Preserving Machine Learning Through Hybrid Homomorphic Encryption Khoa Nguyen et.al. 2409.06422 **[link](https://github.com/khoaguin/pockethhe)**

2024-07-23

Publish Date Title Authors PDF Code
2024-07-23 Representation Magnitude has a Liability to Privacy Vulnerability Xingli Fang et.al. 2407.16164 **[link](https://github.com/jekimlab/aies2024_srcm)**

2024-06-26

Publish Date Title Authors PDF Code
2024-06-26 Natural Language but Omitted? On the Ineffectiveness of Large Language Models' privacy policy from End-users' Perspective Shuning Zhang et.al. 2406.18100 null

2024-06-21

Publish Date Title Authors PDF Code
2024-06-21 A Survey on Intelligent Internet of Things: Applications, Security, Privacy, and Future Directions Ons Aouedi et.al. 2406.03820 null

2024-06-16

Publish Date Title Authors PDF Code
2024-06-16 Promoting Data and Model Privacy in Federated Learning through Quantized LoRA JianHao Zhu et.al. 2406.10976 null

2024-04-17

Publish Date Title Authors PDF Code
2024-04-17 OmniLytics+: A Secure, Efficient, and Affordable Blockchain Data Market for Machine Learning through Off-Chain Processing Songze Li et.al. 2406.06477 null

Forensics

2025-06-26

Publish Date Title Authors PDF Code
2025-06-26 Post-training for Deepfake Speech Detection Wanying Ge et.al. 2506.21090 null
2025-06-26 IndieFake Dataset: A Benchmark Dataset for Audio Deepfake Detection Abhay Kumar et.al. 2506.19014 null

2025-06-25

Publish Date Title Authors PDF Code
2025-06-25 Pay Less Attention to Deceptive Artifacts: Robust Detection of Compressed Deepfakes on Online Social Networks Manyi Li et.al. 2506.20548 null

2025-06-21

Publish Date Title Authors PDF Code
2025-06-21 SELFI: Selective Fusion of Identity for Generalizable Deepfake Detection Younghun Kim et.al. 2506.17592 null

2025-06-20

Publish Date Title Authors PDF Code
2025-06-20 Loupe: A Generalizable and Adaptive Framework for Image Forgery Detection Yuchu Jiang et.al. 2506.16819 **[link](https://github.com/kamichanw/loupe)**

2025-06-18

Publish Date Title Authors PDF Code
2025-06-18 I Know Which LLM Wrote Your Code Last Summer: LLM generated Code Stylometry for Authorship Attribution Tamas Bisztray et.al. 2506.17323 null

2025-06-17

Publish Date Title Authors PDF Code
2025-06-17 A Comparative Study on Proactive and Passive Detection of Deepfake Speech Chia-Hua Wu et.al. 2506.14398 **[link](https://github.com/nii-yamagishilab/antispoofing-watermark)**
2025-06-17 Manipulated Regions Localization For Partially Deepfake Audio: A Survey Jiayi He et.al. 2506.14396 null

2025-06-15

Publish Date Title Authors PDF Code
2025-06-15 Governments Should Mandate Tiered Anonymity on Social-Media Platforms to Counter Deepfakes and LLM-Driven Mass Misinformation David Khachaturov et.al. 2506.12814 null

2025-06-14

Publish Date Title Authors PDF Code
2025-06-14 Towards Neural Audio Codec Source Parsing Orchid Chetia Phukan et.al. 2506.12627 null

2025-06-13

Publish Date Title Authors PDF Code
2025-06-13 From Sharpness to Better Generalization for Speech Deepfake Detection Wen Huang et.al. 2506.11532 **[link](https://github.com/nii-yamagishilab/SAM-AntiSpoofing)**
2025-06-13 FAME: A Lightweight Spatio-Temporal Network for Model Attribution of Face-Swap Deepfakes Wasim Ahmad et.al. 2506.11477 **[link](https://github.com/wasim004/FAME)**
2025-06-13 A Watermark for Auto-Regressive Image Generation Models Yihan Wu et.al. 2506.11371 null

2025-06-12

Publish Date Title Authors PDF Code
2025-06-12 Enhancing Deepfake Detection using SE Block Attention with CNN Subhram Dasgupta et.al. 2506.10683 null
2025-06-12 LLMs Are Not Yet Ready for Deepfake Image Detection Shahroz Tariq et.al. 2506.10474 null
2025-06-12 TikTok's Research API: Problems Without Explanations Carlos Entrena-Serrano et.al. 2506.09746 null

2025-06-11

Publish Date Title Authors PDF Code
2025-06-11 Unmasking real-world audio deepfakes: A data-centric approach David Combei et.al. 2506.09606 **[link](https://github.com/davidcombei/ai4t)**
2025-06-11 TADA: Training-free Attribution and Out-of-Domain Detection of Audio Deepfakes Adriana Stan et.al. 2506.05802 **[link](https://github.com/adrianastan/tada)**

2025-06-10

Publish Date Title Authors PDF Code
2025-06-10 Risks & Benefits of LLMs & GenAI for Platform Integrity, Healthcare Diagnostics, Cybersecurity, Privacy & AI Safety: A Comprehensive Survey, Roadmap & Implementation Blueprint Kiarash Ahi et.al. 2506.12088 null
2025-06-10 Context-aware TFL: A Universal Context-aware Contrastive Learning Framework for Temporal Forgery Localization Qilin Yin et.al. 2506.08493 null
2025-06-10 Multimodal Zero-Shot Framework for Deepfake Hate Speech Detection in Low-Resource Languages Rishabh Ranjan et.al. 2506.08372 null
2025-06-10 Towards Generalized Source Tracing for Codec-Based Deepfake Speech Xuanjun Chen et.al. 2506.07294 null

2025-06-09

Publish Date Title Authors PDF Code
2025-06-09 Lightweight Joint Audio-Visual Deepfake Detection via Single-Stream Multi-Modal Learning Framework Kuiyuan Zhang et.al. 2506.07358 null

2025-06-07

Publish Date Title Authors PDF Code
2025-06-07 Identity Deepfake Threats to Biometric Authentication Systems: Public and Expert Perspectives Shijing He et.al. 2506.06825 null
2025-06-07 SynHate: Detecting Hate Speech in Synthetic Deepfake Audio Rishabh Ranjan et.al. 2506.06772 null

2025-06-06

Publish Date Title Authors PDF Code
2025-06-06 DeepFake Doctor: Diagnosing and Treating Audio-Video Fake Detection Marcel Klemt et.al. 2506.05851 null

2025-06-05

Publish Date Title Authors PDF Code
2025-06-05 SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media Platforms Arnesh Batra et.al. 2506.05538 null
2025-06-05 Practical Manipulation Model for Robust Deepfake Detection Benedikt Hopf et.al. 2506.05119 null
2025-06-05 STOPA: A Database of Systematic VariaTion Of DeePfake Audio for Open-Set Source Tracing and Attribution Anton Firc et.al. 2505.19644 null

2025-06-04

Publish Date Title Authors PDF Code
2025-06-04 AuthGuard: Generalizable Deepfake Detection via Language Guidance Guangyu Shen et.al. 2506.04501 null

2025-06-03

Publish Date Title Authors PDF Code
2025-06-03 A Data-Driven Diffusion-based Approach for Audio Deepfake Explanations Petr Grinberg et.al. 2506.03425 null
2025-06-03 Towards Source Attribution of Singing Voice Deepfake with Multimodal Foundation Models Orchid Chetia Phukan et.al. 2506.03364 null
2025-06-03 DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models Jiarui Wang et.al. 2506.03007 null
2025-06-03 PartialEdit: Identifying Partial Deepfakes in the Era of Neural Speech Editing You Zhang et.al. 2506.02958 null
2025-06-03 Enhancing Abnormality Identification: Robust Out-of-Distribution Strategies for Deepfake Detection Luca Maiano et.al. 2506.02857 null
2025-06-03 Beyond Face Swapping: A Diffusion-Based Digital Human Benchmark for Multimodal Deepfake Detection Jiaxin Liu et.al. 2505.16512 null

2025-06-02

Publish Date Title Authors PDF Code
2025-06-02 Unveiling Audio Deepfake Origins: A Deep Metric learning And Conformer Network Approach With Ensemble Fusion Ajinkya Kulkarni et.al. 2506.02085 null

2025-06-01

Publish Date Title Authors PDF Code
2025-06-01 Multiverse Through Deepfakes: The MultiFakeVerse Dataset of Person-Centric Visual and Conceptual Manipulations Parul Gupta et.al. 2506.00868 **[link](https://github.com/parul-gupta/multifakeverse)**
2025-06-01 Replay Attacks Against Audio Deepfake Detection Nicolas Müller et.al. 2505.14862 null

2025-05-31

Publish Date Title Authors PDF Code
2025-05-31 XMAD-Bench: Cross-Domain Multilingual Audio Deepfake Benchmark Ioan-Paul Ciobanu et.al. 2506.00462 **[link](https://github.com/ristea/xmad-bench)**
2025-05-31 RPRA-ADD: Forgery Trace Enhancement-Driven Audio Deepfake Detection Ruibo Fu et.al. 2506.00375 null

2025-05-30

Publish Date Title Authors PDF Code
2025-05-30 TalkingHeadBench: A Multi-Modal Benchmark & Analysis of Talking-Head DeepFake Detection Xinqi Xiong et.al. 2505.24866 null
2025-05-30 Rehearsal with Auxiliary-Informed Sampling for Audio Deepfake Detection Falih Gozi Febrinanto et.al. 2505.24486 null
2025-05-30 Benchmarking Foundation Models for Zero-Shot Biometric Tasks Redwan Sony et.al. 2505.24214 null

2025-05-29

Publish Date Title Authors PDF Code
2025-05-29 Few-Shot Speech Deepfake Detection Adaptation with Gaussian Processes Neta Glazer et.al. 2505.23619 **[link](https://github.com/NetaGlazer/ADD-GP)**

2025-05-28

Publish Date Title Authors PDF Code
2025-05-28 Speaking images. A novel framework for the automated self-description of artworks Valentine Bernasconi et.al. 2506.05368 null
2025-05-28 Tell me Habibi, is it Real or Fake? Kartik Kuckreja et.al. 2505.22581 null

2025-05-27

Publish Date Title Authors PDF Code
2025-05-27 RoGA: Towards Generalizable Deepfake Detection through Robust Gradient Alignment Lingyu Qiu et.al. 2505.20653 null

2025-05-26

Publish Date Title Authors PDF Code
2025-05-26 ArVoice: A Multi-Speaker Dataset for Arabic Speech Synthesis Hawau Olamide Toyin et.al. 2505.20506 null
2025-05-26 Guard Me If You Know Me: Protecting Specific Face-Identity from Deepfakes Kaiqing Lin et.al. 2505.19582 null

2025-05-25

Publish Date Title Authors PDF Code
2025-05-25 EnvSDD: Benchmarking Environmental Sound Deepfake Detection Han Yin et.al. 2505.19203 null

2025-05-24

Publish Date Title Authors PDF Code
2025-05-24 Think Twice before Adaptation: Improving Adaptability of DeepFake Detection via Online Test-Time Adaptation Hong-Hanh Nguyen-Le et.al. 2505.18787 **[link](https://github.com/honghanh2104/t2a-think-twice-before-adaptation)**
2025-05-24 HyperFake: Hyperspectral Reconstruction and Attention-Guided Analysis for Advanced Deepfake Detection Pavan C Shekar et.al. 2505.18587 null
2025-05-24 Preserving AUC Fairness in Learning with Noisy Protected Groups Mingyang Wu et.al. 2505.18532 **[link](https://github.com/purdue-m2/auc_fairness_with_noisy_groups)**

2025-05-23

Publish Date Title Authors PDF Code
2025-05-23 CAMME: Adaptive Deepfake Image Detection with Multi-Modal Cross-Attention Naseem Khan et.al. 2505.18035 **[link](https://github.com/magnet300/camme)**
2025-05-23 What You Read Isn't What You Hear: Linguistic Sensitivity in Deepfake Speech Detection Binh Nguyen et.al. 2505.17513 null

2025-05-22

Publish Date Title Authors PDF Code
2025-05-22 Do DeepFake Attribution Models Generalize? Spiros Baxavanakis et.al. 2505.21520 null

2025-05-21

Publish Date Title Authors PDF Code
2025-05-21 My Face Is Mine, Not Yours: Facial Protection Against Diffusion Model Face Swapping Hon Ming Yam et.al. 2505.15336 null
2025-05-21 CAD: A General Multimodal Framework for Video Deepfake Detection via Cross-Modal Alignment and Distillation Yuxuan Du et.al. 2505.15233 null
2025-05-21 BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation Haiquan Wen et.al. 2505.12620 **[link](https://github.com/l8cv/busterx)**

2025-05-20

Publish Date Title Authors PDF Code
2025-05-20 Listen, Analyze, and Adapt to Learn New Attacks: An Exemplar-Free Class Incremental Learning Method for Audio Deepfake Source Tracing Yang Xiao et.al. 2505.14601 null
2025-05-20 Source Verification for Speech Deepfakes Viola Negroni et.al. 2505.14188 null
2025-05-20 Naturalness-Aware Curriculum Learning with Dynamic Temperature for Speech Deepfake Detection Taewoo Kim et.al. 2505.13976 null
2025-05-20 BiCrossMamba-ST: Speech Deepfake Detection with Bidirectional Mamba Spectro-Temporal Cross-Attention Yassine El Kheir et.al. 2505.13930 null
2025-05-20 Forensic deepfake audio detection using segmental speech features Tianle Yang et.al. 2505.13847 null
2025-05-20 Benchmarking Unified Face Attack Detection via Hierarchical Prompt Tuning Ajian Liu et.al. 2505.13327 null

2025-05-19

Publish Date Title Authors PDF Code
2025-05-19 Codec-Based Deepfake Source Tracing via Neural Audio Codec Taxonomy Xuanjun Chen et.al. 2505.12994 null
2025-05-19 Multiscale Adaptive Conflict-Balancing Model For Multimedia Deepfake Detection Zihan Xiong et.al. 2505.12966 null

2025-05-18

Publish Date Title Authors PDF Code
2025-05-18 Towards Open-world Generalized Deepfake Detection: General Feature Extraction via Unsupervised Domain Adaptation Midou Guo et.al. 2505.12339 null
2025-05-18 Is Artificial Intelligence Generated Image Detection a Solved Problem? Ziqiang Li et.al. 2505.12335 **[link](https://github.com/horizontel/aigibench)**

2025-05-16

Publish Date Title Authors PDF Code
2025-05-16 X-Edit: Detecting and Localizing Edits in Images Altered by Text-Guided Diffusion Models Valentina Bazyleva et.al. 2505.11753 null
2025-05-16 Deepfake Forensic Analysis: Source Dataset Attribution and Legal Implications of Synthetic Media Manipulation Massimiliano Cassia et.al. 2505.11110 null
2025-05-16 MAVOS-DD: Multilingual Audio-Video Open-Set Deepfake Detection Benchmark Florinel-Alin Croitoru et.al. 2505.11109 null
2025-05-16 $\mathcal{A}LLM4ADD$ : Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection Hao Gu et.al. 2505.11079 null
2025-05-16 ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and Localization Bo Du et.al. 2505.11003 **[link](https://github.com/scu-zjz/forensichub)**
2025-05-16 BanglaFake: Constructing and Evaluating a Specialized Bengali Deepfake Audio Dataset Istiaq Ahmed Fahad et.al. 2505.10885 **[link](https://github.com/KamruzzamanAsif/BanglaFake)**
2025-05-16 Visual Watermarking in the Era of Diffusion Models: Advances and Challenges Junxian Duan et.al. 2505.08197 null

2025-05-15

Publish Date Title Authors PDF Code
2025-05-15 Characterizing AI-Generated Misinformation on Social Media Chiara Drolsbach et.al. 2505.10266 null

2025-05-14

Publish Date Title Authors PDF Code
2025-05-14 WaveGuard: Robust Deepfake Detection and Source Tracing via Dual-Tree Complex Wavelet and Graph Neural Networks Ziyuan He et.al. 2505.08614 **[link](https://github.com/vpsg-research/waveguard)**

2025-05-13

Publish Date Title Authors PDF Code
2025-05-13 DFA-CON: A Contrastive Learning Approach for Detecting Copyright Infringement in DeepFake Art Haroon Wahab et.al. 2505.08552 null
2025-05-13 TT-DF: A Large-Scale Diffusion-Based Dataset and Benchmark for Human Body Forgery Detection Wenkui Yang et.al. 2505.08437 **[link](https://github.com/hashtag00002/tt-df)**
2025-05-13 FauForensics: Boosting Audio-Visual Deepfake Detection with Facial Action Units Jian Wang et.al. 2505.08294 null
2025-05-13 Where the Devil Hides: Deepfake Detectors Can No Longer Be Trusted Shuaiwei Yuan et.al. 2505.08255 null

2025-05-11

Publish Date Title Authors PDF Code
2025-05-11 Multimodal Fake News Detection: MFND Dataset and Shallow-Deep Multitask Learning Ye Zhu et.al. 2505.06796 **[link](https://github.com/yunan-wang33/sdml)**

2025-05-10

Publish Date Title Authors PDF Code
2025-05-10 Beyond Identity: A Generalizable Approach for Deepfake Audio Detection Yasaman Ahmadiadli et.al. 2505.06766 null
2025-05-10 Unmasking Deep Fakes: Leveraging Deep Learning for Video Authenticity Detection Mahmudul Hasan et.al. 2505.06528 null

2025-05-08

Publish Date Title Authors PDF Code
2025-05-08 Cross-Branch Orthogonality for Improved Generalization in Face Deepfake Detection Tharindu Fernando et.al. 2505.04888 null

2025-05-07

Publish Date Title Authors PDF Code
2025-05-07 Perpetuating Misogyny with Generative AI: How Model Personalization Normalizes Gendered Harm Laura Wagner et.al. 2505.04600 null
2025-05-07 Learning Real Facial Concepts for Independent Deepfake Detection Ming-Hui Liu et.al. 2505.04460 null
2025-05-07 DATA: Multi-Disentanglement based Contrastive Learning for Open-World Semi-Supervised Deepfake Attribution Ming-Hui Liu et.al. 2505.04384 null
2025-05-07 From Incidents to Insights: Patterns of Responsibility following AI Harms Isabel Richards et.al. 2505.04291 null

2025-05-06

Publish Date Title Authors PDF Code
2025-05-06 Deepfakes on Demand: the rise of accessible non-consensual deepfake image generators Will Hawkins et.al. 2505.03859 **[link](https://github.com/WillHawkins3/deepfakesondemand)**

2025-05-04

Publish Date Title Authors PDF Code
2025-05-04 Robust AI-Generated Face Detection with Imbalanced Data Yamini Sri Krubha et.al. 2505.02182 **[link](https://github.com/purdue-m2/sp_cup)**
2025-05-04 MLLM-Enhanced Face Forgery Detection: A Vision-Language Fusion Solution Siran Peng et.al. 2505.02013 null

2025-05-03

Publish Date Title Authors PDF Code
2025-05-03 Detecting Musical Deepfakes Nick Sunday et.al. 2505.09633 **[link](https://github.com/nicksunday/deepfake-music-detector)**

2025-05-01

Publish Date Title Authors PDF Code
2025-05-01 AWARE-NET: Adaptive Weighted Averaging for Robust Ensemble Network in Deepfake Detection Muhammad Salman et.al. 2505.00312 **[link](https://github.com/recluzegeek/AWARE-NET)**

2025-04-30

Publish Date Title Authors PDF Code
2025-04-30 Generative AI in Financial Institution: A Global Survey of Opportunities, Threats, and Regulation Bikash Saha et.al. 2504.21574 null
2025-04-30 End-to-end Audio Deepfake Detection from RAW Waveforms: a RawNet-Based Approach with Cross-Dataset Evaluation Andrea Di Pierno et.al. 2504.20923 null

2025-04-29

Publish Date Title Authors PDF Code
2025-04-29 A Brief Review for Compression and Transfer Learning Techniques in DeepFake Detection Andreas Karathanasis et.al. 2504.21066 null
2025-04-29 TriniMark: A Robust Generative Speech Watermarking Method for Trinity-Level Attribution Yue Li et.al. 2504.20532 null

2025-04-27

Publish Date Title Authors PDF Code
2025-04-27 Balancing Creativity and Automation: The Influence of AI on Modern Film Production and Dissemination Yiren Xu et.al. 2504.19275 null
2025-04-27 CapsFake: A Multimodal Capsule Network for Detecting Instruction-Guided Deepfakes Tuan Nguyen et.al. 2504.19212 null

2025-04-24

Publish Date Title Authors PDF Code
2025-04-24 Towards Generalizable Deepfake Detection with Spatial-Frequency Collaborative Learning and Hierarchical Cross-Modal Fusion Mengyu Qiao et.al. 2504.17223 null

2025-04-19

Publish Date Title Authors PDF Code
2025-04-19 BMRL: Bi-Modal Guided Multi-Perspective Representation Learning for Zero-Shot Deepfake Attribution Yaning Zhang et.al. 2504.14129 null

2025-04-18

Publish Date Title Authors PDF Code
2025-04-18 MLEP: Multi-granularity Local Entropy Patterns for Universal AI-generated Image Detection Lin Yuan et.al. 2504.13726 null

2025-04-16

Publish Date Title Authors PDF Code
2025-04-16 Benchmarking Audio Deepfake Detection Robustness in Real-world Communication Scenarios Haohan Shi et.al. 2504.12423 null

2025-04-15

Publish Date Title Authors PDF Code
2025-04-15 Big Brother is Watching: Proactive Deepfake Detection via Learnable Hidden Face Hongbo Li et.al. 2504.11309 null
2025-04-15 Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy Botao Zhao et.al. 2504.10819 null

2025-04-14

Publish Date Title Authors PDF Code
2025-04-14 SafeSpeech: Robust and Universal Voice Protection Against Malicious Speech Synthesis Zhisheng Zhang et.al. 2504.09839 **[link](https://github.com/wxzyd123/safespeech)**

2025-04-13

Publish Date Title Authors PDF Code
2025-04-13 FractalForensics: Proactive Deepfake Detection and Localization via Fractal Watermarks Tianyi Wang et.al. 2504.09451 null
2025-04-13 Detecting Localized Deepfake Manipulations Using Action Unit-Guided Video Representations Tharun Anand et.al. 2503.22121 null

2025-04-10

Publish Date Title Authors PDF Code
2025-04-10 LoRAX: LoRA eXpandable Networks for Continual Synthetic Image Attribution Danielle Sullivan-Pao et.al. 2504.08149 **[link](https://github.com/mit-ll/lorax_cil)**

2025-04-09

Publish Date Title Authors PDF Code
2025-04-09 Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning Ashutosh Chaubey et.al. 2504.07198 null
2025-04-09 Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception Yuankun Xie et.al. 2504.06753 **[link](https://github.com/xieyuankun/all-type-add)**

2025-04-08

Publish Date Title Authors PDF Code
2025-04-08 Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-spoofing Tianchi Liu et.al. 2504.05657 **[link](https://github.com/liu-tianchi/nes2net)**

2025-04-07

Publish Date Title Authors PDF Code
2025-04-07 From Specificity to Generality: Revisiting Generalizable Artifacts in Detecting Face Deepfakes Long Ma et.al. 2504.04827 null
2025-04-07 SUEDE:Shared Unified Experts for Physical-Digital Face Attack Detection Enhancement Zuying Xie et.al. 2504.04818 null

2025-04-04

Publish Date Title Authors PDF Code
2025-04-04 Three Forensic Cues for JPEG AI Images Sandra Bergmann et.al. 2504.03191 null

2025-04-03

Publish Date Title Authors PDF Code
2025-04-03 Comparative Analysis of Deepfake Detection Models: New Approaches and Perspectives Matheus Martins Batista et.al. 2504.02900 null

2025-04-02

Publish Date Title Authors PDF Code
2025-04-02 Robust AI-Synthesized Image Detection via Multi-feature Frequency-aware Learning Hongfei Cai et.al. 2504.02879 null
2025-04-02 Detecting Lip-Syncing Deepfakes: Vision Temporal Transformer for Analyzing Mouth Inconsistencies Soumyya Kanti Datta et.al. 2504.01470 **[link](https://github.com/skrantidatta/lipinc-v2)**

2025-04-01

Publish Date Title Authors PDF Code
2025-04-01 FA^{3}-CLIP: Frequency-Aware Cues Fusion and Attack-Agnostic Prompt Learning for Unified Face Attack Detection Yongze Li et.al. 2504.00454 null

2025-03-29

Publish Date Title Authors PDF Code
2025-03-29 Synthetic Art Generation and DeepFake Detection A Study on Jamini Roy Inspired Dataset Kushal Agrawal et.al. 2503.23226 null
2025-03-29 Can Multi-modal (reasoning) LLMs work as deepfake detectors? Simiao Ren et.al. 2503.20084 null

2025-03-26

Publish Date Title Authors PDF Code
2025-03-26 MVFNet: Multipurpose Video Forensics Network using Multiple Forms of Forensic Evidence Tai D. Nguyen et.al. 2503.20991 null
2025-03-26 Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector Xiao Guo et.al. 2503.20188 **[link](https://github.com/chelsea234/m2f2_det)**
2025-03-26 Unlocking the Hidden Potential of CLIP in Generalizable Deepfake Detection Andrii Yermakov et.al. 2503.19683 **[link](https://github.com/yermandy/deepfake-detection)**
2025-03-26 InnerSelf: Designing Self-Deepfaked Voice for Emotional Well-being Guang Dai et.al. 2503.14257 null

2025-03-24

Publish Date Title Authors PDF Code
2025-03-24 SCVI: Bridging Social and Cyber Dimensions for Comprehensive Vulnerability Assessment Shutonu Mitra et.al. 2503.20806 null
2025-03-24 NullSwap: Proactive Identity Cloaking Against Deepfake Face Swapping Tianyi Wang et.al. 2503.18678 null
2025-03-24 Deepfake-Eval-2024: A Multi-Modal In-the-Wild Benchmark of Deepfakes Circulated in 2024 Nuria Alina Chandra et.al. 2503.02857 **[link](https://github.com/nuriachandra/deepfake-eval-2024)**

2025-03-23

Publish Date Title Authors PDF Code
2025-03-23 Anomaly Detection and Localization for Speech Deepfakes via Feature Pyramid Matching Emma Coletta et.al. 2503.18032 null

2025-03-21

Publish Date Title Authors PDF Code
2025-03-21 Measuring the Robustness of Audio Deepfake Detectors Xiang Li et.al. 2503.17577 **[link](https://github.com/Jessegator/Audio_robustness_evaluation)**
2025-03-21 D2Fusion: Dual-domain Fusion with Feature Superposition for Deepfake Detection Xueqi Qiu et.al. 2503.17184 null

2025-03-20

Publish Date Title Authors PDF Code
2025-03-20 TruthLens: Explainable DeepFake Detection for Face Manipulated and Fully Synthetic Data Rohit Kundu et.al. 2503.15867 null

2025-03-19

Publish Date Title Authors PDF Code
2025-03-19 Cyber Threats in Financial Transactions -- Addressing the Dual Challenge of AI and Quantum Computing Ahmed M. Elmisery et.al. 2503.15678 null
2025-03-19 TruthLens:A Training-Free Paradigm for DeepFake Detection Ritabrata Chakraborty et.al. 2503.15342 null
2025-03-19 Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation Siwei Wen et.al. 2503.14905 null
2025-03-19 Unlocking the Capabilities of Vision-Language Models for Generalizable and Explainable Deepfake Detection Peipeng Yu et.al. 2503.14853 null

2025-03-18

Publish Date Title Authors PDF Code
2025-03-18 ExDDV: A New Dataset for Explainable Deepfake Detection in Video Vlad Hondru et.al. 2503.14421 **[link](https://github.com/vladhondru25/exddv)**

2025-03-16

Publish Date Title Authors PDF Code
2025-03-16 Deepfake Detection with Optimized Hybrid Model: EAR Biometric Descriptor via Improved RCNN Ruchika Sharma et.al. 2503.12381 null

2025-03-14

Publish Date Title Authors PDF Code
2025-03-14 Deepfake Detection of Face Images based on a Convolutional Neural Network Lukas Kroiß et.al. 2503.11389 null

2025-03-11

Publish Date Title Authors PDF Code
2025-03-11 Unmasking the Unknown: Facial Deepfake Detection in the Open-Set Paradigm Nadarasar Bahavan et.al. 2503.08055 null

2025-03-10

Publish Date Title Authors PDF Code
2025-03-10 VoD: Learning Volume of Differences for Video-Based Deepfake Detection Ying Xu et.al. 2503.07607 **[link](https://github.com/xuyingzhongguo/vod)**

2025-03-09

Publish Date Title Authors PDF Code
2025-03-09 Chameleon: On the Scene Diversity and Domain Variety of AI-Generated Videos Detection Meiyu Zeng et.al. 2503.06624 null

2025-03-08

Publish Date Title Authors PDF Code
2025-03-08 VLForgery Face Triad: Detection, Localization and Attribution via Multimodal Large Language Models Xinan He et.al. 2503.06142 null

2025-03-06

Publish Date Title Authors PDF Code
2025-03-06 Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems Jooyoung Lee et.al. 2503.04945 null

2025-03-05

Publish Date Title Authors PDF Code
2025-03-05 Reduced Spatial Dependency for More General Video-level Deepfake Detection Beilin Chu et.al. 2503.03270 null

2025-03-04

Publish Date Title Authors PDF Code
2025-03-04 Deepfake Detection via Knowledge Injection Tonghui Li et.al. 2503.02503 null

2025-02-28

Publish Date Title Authors PDF Code
2025-02-28 Two-Stream Spatial-Temporal Transformer Framework for Person Identification via Natural Conversational Keypoints Masoumeh Chapariniya et.al. 2502.20803 null

2025-02-27

Publish Date Title Authors PDF Code
2025-02-27 DIN-CTS: Low-Complexity Depthwise-Inception Neural Network with Contrastive Training Strategy for Deepfake Speech Detection Lam Pham et.al. 2502.20225 null

AIGC

2025-06-26

Publish Date Title Authors PDF Code
2025-06-26 Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test Ziyue Li et.al. 2506.21551 null
2025-06-26 mTSBench: Benchmarking Multivariate Time Series Anomaly Detection and Model Selection at Scale Xiaona Zhou et.al. 2506.21550 null
2025-06-26 Exploring the Design Space of 3D MLLMs for CT Report Generation Mohammed Baharoon et.al. 2506.21535 null
2025-06-26 "What's Up, Doc?": Analyzing How Users Seek Health Information in Large-Scale Conversational AI Datasets Akshay Paruchuri et.al. 2506.21532 null
2025-06-26 Potemkin Understanding in Large Language Models Marina Mancoridis et.al. 2506.21521 null
2025-06-26 Enhancing User Engagement in Socially-Driven Dialogue through Interactive LLM Alignments Jiashuo Wang et.al. 2506.21497 null
2025-06-26 Bridging Offline and Online Reinforcement Learning for LLMs Jack Lanchantin et.al. 2506.21495 null
2025-06-26 Controllable 3D Placement of Objects with Scene-Aware Diffusion Models Mohamed Omran et.al. 2506.21446 null
2025-06-26 Text2Cypher Across Languages: Evaluating Foundational Models Beyond English Makbule Gulcin Ozsoy et.al. 2506.21445 null
2025-06-26 Domain Knowledge-Enhanced LLMs for Fraud and Concept Drift Detection Ali Şenol et.al. 2506.21443 null
2025-06-26 Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning Prajwal Koirala et.al. 2506.21427 null
2025-06-26 XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation Bowen Chen et.al. 2506.21416 null
2025-06-26 Scalable Bayesian Low-Rank Adaptation of Large Language Models via Stochastic Variational Subspace Inference Colin Samplawski et.al. 2506.21408 null
2025-06-26 Leveraging LLM-Assisted Query Understanding for Live Retrieval-Augmented Generation Guanting Dong et.al. 2506.21384 null
2025-06-26 Structuralist Approach to AI Literary Criticism: Leveraging Greimas Semiotic Square for Large Language Models Fangzhou Dong et.al. 2506.21360 null
2025-06-26 CoPa-SG: Dense Scene Graphs with Parametric and Proto-Relations Julian Lorenz et.al. 2506.21357 null
2025-06-26 DynamicBench: Evaluating Real-Time Report Generation in Large Language Models Jingyao Li et.al. 2506.21343 null
2025-06-26 Latent Prototype Routing: Achieving Near-Perfect Load Balancing in Mixture-of-Experts Jiajie Yang et.al. 2506.21328 null
2025-06-26 Multimodal LLMs for Visualization Reconstruction and Understanding Can Liu et.al. 2506.21319 null
2025-06-26 Exploring Adapter Design Tradeoffs for Low Resource Music Generation Atharva Mehta et.al. 2506.21298 null

2025-06-25

Publish Date Title Authors PDF Code
2025-06-25 Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs Sonia K. Murthy et.al. 2506.20666 null
2025-06-25 The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind Andrei Lupu et.al. 2506.20664 null
2025-06-25 Memento: Note-Taking for Your Future Self Chao Wan et.al. 2506.20642 null
2025-06-25 Telegrapher's Generative Model via Kac Flows Richard Duong et.al. 2506.20641 null
2025-06-25 AI Assistants to Enhance and Exploit the PETSc Knowledge Base Barry Smith et.al. 2506.20608 null
2025-06-25 Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm Baixiang Huang et.al. 2506.20606 null
2025-06-25 Video Perception Models for 3D Scene Synthesis Rui Huang et.al. 2506.20601 null
2025-06-25 SFNet: Fusion of Spatial and Frequency-Domain Features for Remote Sensing Image Forgery Detection Ji Qi et.al. 2506.20599 null
2025-06-25 Fine-Tuning and Prompt Engineering of LLMs, for the Creation of Multi-Agent AI for Addressing Sustainable Protein Production Challenges Alexander D. Kalian et.al. 2506.20598 null
2025-06-25 AI in the Writing Process: How Purposeful AI Support Fosters Student Writing Momin N. Siddiqui et.al. 2506.20595 null
2025-06-25 CCISolver: End-to-End Detection and Repair of Method-Level Code-Comment Inconsistency Renyi Zhong et.al. 2506.20558 null
2025-06-25 Large Language Model-Driven Code Compliance Checking in Building Information Modeling Soumya Madireddy et.al. 2506.20551 null
2025-06-25 When Life Gives You Samples: The Benefits of Scaling up Inference Compute for Multilingual LLMs Ammar Khairi et.al. 2506.20544 null
2025-06-25 WattsOnAI: Measuring, Analyzing, and Visualizing Energy and Carbon Footprint of AI Workloads Hongzhen Huang et.al. 2506.20535 null
2025-06-25 Case-based Reasoning Augmented Large Language Model Framework for Decision Making in Realistic Safety-Critical Driving Scenarios Wenbin Gan et.al. 2506.20531 null
2025-06-25 Asymmetric REINFORCE for off-Policy Reinforcement Learning: Balancing positive and negative rewards Charles Arnal et.al. 2506.20520 null
2025-06-25 BotHash: Efficient and Training-Free Bot Detection Through Approximate Nearest Neighbor Edoardo Di Paolo et.al. 2506.20503 null
2025-06-25 ReCode: Updating Code API Knowledge with Reinforcement Learning Haoze Wu et.al. 2506.20495 null
2025-06-25 Generative AI for Vulnerability Detection in 6G Wireless Networks: Advances, Case Study, and Future Directions Shuo Yang et.al. 2506.20488 null
2025-06-25 GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching Guinan Su et.al. 2506.20480 null

2025-06-24

Publish Date Title Authors PDF Code
2025-06-24 JoyAgents-R1: Joint Evolution Dynamics for Versatile Multi-LLM Agents with Reinforcement Learning Ai Han et.al. 2506.19846 null
2025-06-24 MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration Yucheng Zhou et.al. 2506.19835 null
2025-06-24 A standard transformer and attention with linear biases for molecular conformer generation Viatcheslav Gurev et.al. 2506.19834 null
2025-06-24 ProxelGen: Generating Proteins as 3D Densities Felix Faltings et.al. 2506.19820 null
2025-06-24 KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality Baochang Ren et.al. 2506.19807 null
2025-06-24 LLM-Based Social Simulations Require a Boundary Zengqing Wu et.al. 2506.19806 null
2025-06-24 KnowML: Improving Generalization of ML-NIDS with Attack Knowledge Graphs Xin Fan Guo et.al. 2506.19802 null
2025-06-24 Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study Yuqi Zhu et.al. 2506.19794 null
2025-06-24 Line ratio identification of external photoevaporation Tyger Peake et.al. 2506.19788 null
2025-06-24 SAGE: Strategy-Adaptive Generation Engine for Query Rewriting Teng Wang et.al. 2506.19783 null
2025-06-24 Alleviating User-Sensitive bias with Fair Generative Sequential Recommendation Model Yang Liu et.al. 2506.19777 null
2025-06-24 Canary in the Mine: An LLM Augmented Survey of Disciplinary Complaints to the Ordre des ingénieurs du Québec (OIQ) Tammy Mackenzie et.al. 2506.19775 null
2025-06-24 Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation Jun Wang et.al. 2506.19774 null
2025-06-24 Automatic Prompt Optimization for Knowledge Graph Construction: Insights from an Empirical Study Nandana Mihindukulasooriya et.al. 2506.19773 null
2025-06-24 A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects Shulan Ruan et.al. 2506.19769 null
2025-06-24 SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning Yuqian Fu et.al. 2506.19767 null
2025-06-24 Arabic Dialect Classification using RNNs, Transformers, and Large Language Models: A Comparative Analysis Omar A. Essameldin et.al. 2506.19753 null
2025-06-24 Noise Consistency Training: A Native Approach for One-Step Generator in Learning Additional Controls Yihong Luo et.al. 2506.19741 null
2025-06-24 Breaking Barriers: Do Reinforcement Post Training Gains Transfer To Unseen Domains? Chuxuan Hu et.al. 2506.19733 null
2025-06-24 Who Does What in Deep Learning? Multidimensional Game-Theoretic Attribution of Function of Neural Units Shrey Dixit et.al. 2506.19732 null

2025-06-23

Publish Date Title Authors PDF Code
2025-06-23 FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation Kaiyi Huang et.al. 2506.18899 null
2025-06-23 Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations Jiaming Han et.al. 2506.18898 null
2025-06-23 MinD: Unified Visual Imagination and Control via Hierarchical World Models Xiaowei Chi et.al. 2506.18897 null
2025-06-23 ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs Jiaru Zou et.al. 2506.18896 null
2025-06-23 Steering Conceptual Bias via Transformer Latent-Subspace Activation Vansh Sharma et.al. 2506.18887 null
2025-06-23 Let Your Video Listen to Your Music! Xinyu Zhang et.al. 2506.18881 null
2025-06-23 OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization Yiyou Sun et.al. 2506.18880 null
2025-06-23 CommVQ: Commutative Vector Quantization for KV Cache Compression Junyan Li et.al. 2506.18879 null
2025-06-23 OmniGen2: Exploration to Advanced Multimodal Generation Chenyuan Wu et.al. 2506.18871 null
2025-06-23 OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation Qijun Gan et.al. 2506.18866 null
2025-06-23 LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning Yuhao Wu et.al. 2506.18841 null
2025-06-23 Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories Islem Bouzenia et.al. 2506.18824 null
2025-06-23 RWESummary: A Framework and Test for Choosing Large Language Models to Summarize Real-World Evidence (RWE) Studies Arjun Mukerji et.al. 2506.18819 null
2025-06-23 FORGE: An LLM-driven Framework for Large-Scale Smart Contract Vulnerability Dataset Construction Jiachi Chen et.al. 2506.18795 null
2025-06-23 3D Arena: An Open Platform for Generative 3D Evaluation Dylan Ebert et.al. 2506.18787 null
2025-06-23 TRIZ Agents: A Multi-Agent LLM Approach for TRIZ-Based Innovation Kamil Szczepanik et.al. 2506.18783 null
2025-06-23 Existing LLMs Are Not Self-Consistent For Simple Tasks Zhenru Lin et.al. 2506.18781 null
2025-06-23 DefFusionNet: Learning Multimodal Goal Shapes for Deformable Object Manipulation via a Diffusion-based Probabilistic Model Bao Thach et.al. 2506.18779 null
2025-06-23 Programming by Backprop: LLMs Acquire Reusable Algorithmic Abstractions During Code Training Jonathan Cook et.al. 2506.18777 null
2025-06-23 ContinualFlow: Learning and Unlearning with Neural Flow Matching Lorenzo Simone et.al. 2506.18747 null

2025-06-20

Publish Date Title Authors PDF Code
2025-06-20 No Free Lunch: Rethinking Internal Feedback for LLM Reasoning Yanzhi Zhang et.al. 2506.17219 null
2025-06-20 Fine-Tuning Lowers Safety and Disrupts Evaluation Consistency Kathleen C. Fraser et.al. 2506.17209 null
2025-06-20 Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems Matias Martinez et.al. 2506.17208 null
2025-06-20 Confidence Scoring for LLM-Generated SQL in Supply Chain Data Extraction Jiekai Ma et.al. 2506.17203 null
2025-06-20 Dex1B: Learning with 1B Demonstrations for Dexterous Manipulation Jianglong Ye et.al. 2506.17198 null
2025-06-20 Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres Samuel Howard et.al. 2506.17197 null
2025-06-20 Detecting LLM-Generated Short Answers and Effects on Learner Performance Shambhavi Bhushan et.al. 2506.17196 null
2025-06-20 Towards AI Search Paradigm Yuchen Li et.al. 2506.17188 null
2025-06-20 Deep generative models as the probability transformation functions Vitalii Bondar et.al. 2506.17171 null
2025-06-20 The MedPerturb Dataset: What Non-Content Perturbations Reveal About Human and Clinical LLM Decision Making Abinitha Gourabathina et.al. 2506.17163 null
2025-06-20 Do We Need Large VLMs for Spotting Soccer Actions? Ritabrata Chakraborty et.al. 2506.17144 null
2025-06-20 MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification David Jacob Drexlin et.al. 2506.17140 null
2025-06-20 Large Language Model Unlearning for Source Code Xue Jiang et.al. 2506.17125 null
2025-06-20 When Can Model-Free Reinforcement Learning be Enough for Thinking? Josiah P. Hanna et.al. 2506.17124 null
2025-06-20 Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving Chuxue Cao et.al. 2506.17104 null
2025-06-20 Chain-of-Thought Prompting Obscures Hallucination Cues in Large Language Models: An Empirical Evaluation Jiahao Cheng et.al. 2506.17088 null
2025-06-20 Tower+: Bridging Generality and Translation Specialization in Multilingual LLMs Ricardo Rei et.al. 2506.17080 null
2025-06-20 Simultaneous Translation with Offline Speech and LLM Models in CUNI Submission to IWSLT 2025 Dominik Macháček et.al. 2506.17077 null
2025-06-20 LLM-Based Bot Broadens the Range of Arguments in Online Discussions, Even When Transparently Disclosed as AI Valeria Vuk et.al. 2506.17073 null
2025-06-20 Empowering Near-Field Communications in Low-Altitude Economy with LLM: Fundamentals, Potentials, Solutions, and Future Directions Zhuo Xu et.al. 2506.17067 null

2025-06-18

Publish Date Title Authors PDF Code
2025-06-18 Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards Qingming Liu et.al. 2506.15684 null
2025-06-18 PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning Yuhui Shi et.al. 2506.15683 null
2025-06-18 Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model Anirud Aggarwal et.al. 2506.15682 **[link](https://github.com/aniaggarwal/ecad)**
2025-06-18 GenRecal: Generation after Recalibration from Large to Small Vision-Language Models Byung-Kwan Lee et.al. 2506.15681 null
2025-06-18 CC-LEARN: Cohort-based Consistency Learning Xiao Ye et.al. 2506.15662 null
2025-06-18 PhishDebate: An LLM-Based Multi-Agent Framework for Phishing Website Detection Wenhao Li et.al. 2506.15656 null
2025-06-18 deepSURF: Detecting Memory Safety Vulnerabilities in Rust Through Fuzzing LLM-Augmented Harnesses Georgios Androutsopoulos et.al. 2506.15648 null
2025-06-18 Revisiting Compositional Generalization Capability of Large Language Models Considering Instruction Following Ability Yusuke Sakai et.al. 2506.15629 null
2025-06-18 The Effect of State Representation on LLM Agent Behavior in Dynamic Routing Games Lyle Goodyear et.al. 2506.15624 null
2025-06-18 LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning Gabrel J. Perin et.al. 2506.15606 **[link](https://github.com/vita-group/lox)**
2025-06-18 From Model to Classroom: Evaluating Generated MCQs for Portuguese with Narrative and Difficulty Concerns Bernardo Leite et.al. 2506.15598 null
2025-06-18 LiteGD: Lightweight and dynamic GPU Dispatching for Large-scale Heterogeneous Clusters Kunming Zhang et.al. 2506.15595 null
2025-06-18 One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution Yujing Sun et.al. 2506.15591 **[link](https://github.com/yjsunnn/dloral)**
2025-06-18 Gender Inclusivity Fairness Index (GIFI): A Multilevel Framework for Evaluating Gender Diversity in Large Language Models Zhengyang Shan et.al. 2506.15568 **[link](https://github.com/zhengyangshan/gifi)**
2025-06-18 Managing Complex Failure Analysis Workflows with LLM-based Reasoning and Acting Agents Aline Dobrovsky et.al. 2506.15567 null
2025-06-18 PredGen: Accelerated Inference of Large Language Models through Input-Time Speculation for Real-Time Speech Interaction Shufan Li et.al. 2506.15556 null
2025-06-18 Diff-TONE: Timestep Optimization for iNstrument Editing in Text-to-Music Diffusion Models Teysir Baoueb et.al. 2506.15530 null
2025-06-18 Lessons from Training Grounded LLMs with Verifiable Rewards Shang Hong Sim et.al. 2506.15522 null
2025-06-18 RePCS: Diagnosing Data Memorization in LLM-Powered Retrieval-Augmented Generation Le Vu Anh et.al. 2506.15513 null
2025-06-18 Optimizing Web-Based AI Query Retrieval with GPT Integration in LangChain A CoT-Enhanced Prompt Engineering Approach Wenqi Guan et.al. 2506.15512 null

2025-06-17

Publish Date Title Authors PDF Code
2025-06-17 Large Language Models -- the Future of Fundamental Physics? Caroline Heneka et.al. 2506.14757 null
2025-06-17 Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs Ring Team et.al. 2506.14731 null
2025-06-17 AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes Jiahao Qiu et.al. 2506.14728 null
2025-06-17 Adaptive Accompaniment with ReaLchords Yusong Wu et.al. 2506.14723 null
2025-06-17 Unified Software Engineering agent as AI Software Engineer Leonhard Applis et.al. 2506.14683 null
2025-06-17 AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models Ads Dawson et.al. 2506.14682 null
2025-06-17 Massive Supervised Fine-tuning Experiments Reveal How Data, Layer, and Training Factors Shape LLM Alignment Quality Yuto Harada et.al. 2506.14681 null
2025-06-17 Issue Retrieval and Verification Enhanced Supplementary Code Comment Generation Yanzhen Zou et.al. 2506.14649 null
2025-06-17 Passing the Turing Test in Political Discourse: Fine-Tuning LLMs to Mimic Polarized Social Media Comments . Pazzaglia et.al. 2506.14645 null
2025-06-17 Revisiting Chain-of-Thought Prompting: Zero-shot Can Be Stronger than Few-shot Xiang Cheng et.al. 2506.14641 null
2025-06-17 AIn't Nothing But a Survey? Using Large Language Models for Coding German Open-Ended Survey Responses on Survey Motivation Leah von der Heyde et.al. 2506.14634 null
2025-06-17 Probabilistic Aggregation and Targeted Embedding Optimization for Collective Moral Reasoning in Large Language Models Chenchen Yuan et.al. 2506.14625 null
2025-06-17 Guaranteed Guess: A Language Modeling Approach for CISC-to-RISC Transpilation with Testing Guarantees Ahmed Heakl et.al. 2506.14606 null
2025-06-17 Align Your Flow: Scaling Continuous-Time Flow Map Distillation Amirmojtaba Sabour et.al. 2506.14603 null
2025-06-17 NetRoller: Interfacing General and Specialized Models for End-to-End Autonomous Driving Ren Xin et.al. 2506.14589 null
2025-06-17 GenerationPrograms: Fine-grained Attribution with Executable Programs David Wan et.al. 2506.14580 null
2025-06-17 AlphaDecay:Module-wise Weight Decay for Heavy-Tailed Balancing in LLMs Di He et.al. 2506.14562 null
2025-06-17 Doppelgänger Method: Breaking Role Consistency in LLM Agent via Prompt-based Transferable Adversarial Attack Daewon Kang et.al. 2506.14539 null
2025-06-17 Automatic Qiskit Code Refactoring Using Large Language Models José Manuel Suárez et.al. 2506.14535 null
2025-06-17 M2BeamLLM: Multimodal Sensing-empowered mmWave Beam Prediction with Large Language Models Can Zheng et.al. 2506.14532 null
2025-06-17 Prefix-Tuning+: Modernizing Prefix-Tuning by Decoupling the Prefix from Attention Haonan Wang et.al. 2506.13674 null

2025-06-16

Publish Date Title Authors PDF Code
2025-06-16 Diagnosing and Improving Diffusion Models by Estimating the Optimal Loss Value Yixian Xu et.al. 2506.13763 null
2025-06-16 Discrete Diffusion in Large Language and Multimodal Models: A Survey Runpeng Yu et.al. 2506.13759 null
2025-06-16 AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning Zewei Zhou et.al. 2506.13757 null
2025-06-16 UltraZoom: Generating Gigapixel Images from Regular Photos Jingwei Ma et.al. 2506.13756 null
2025-06-16 Steering LLM Thinking with Budget Guidance Junyan Li et.al. 2506.13752 null
2025-06-16 Evaluating Large Language Models for Phishing Detection, Self-Consistency, Faithfulness, and Explainability Shova Kuikel et.al. 2506.13746 null
2025-06-16 LTRR: Learning To Rank Retrievers for LLMs To Eun Kim et.al. 2506.13743 null
2025-06-16 Instruction Following by Boosting Attention of Large Language Models Vitoria Guardieiro et.al. 2506.13734 null
2025-06-16 Attribution-guided Pruning for Compression, Circuit Discovery, and Targeted Correction in LLMs Sayed Mohammad Vakilzadeh Hatefi et.al. 2506.13727 null
2025-06-16 TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning Junru Zhang et.al. 2506.13705 null
2025-06-16 Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry Junyoung Seo et.al. 2506.13697 null
2025-06-16 OneRec Technical Report Guorui Zhou et.al. 2506.13695 null
2025-06-16 Balancing Knowledge Delivery and Emotional Comfort in Healthcare Conversational Systems Shang-Chi Tsai et.al. 2506.13692 null
2025-06-16 UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions Zhucun Xue et.al. 2506.13691 null
2025-06-16 Enforcing tail calibration when training probabilistic forecast models Jakob Benjamin Wessel et.al. 2506.13687 null
2025-06-16 An LLM's Apology: Outsourcing Awkwardness in the Age of AI Twm Stone et.al. 2506.13685 null
2025-06-16 Turning Down the Heat: A Critical Analysis of Min-p Sampling in Language Models Rylan Schaeffer et.al. 2506.13681 null
2025-06-16 MultiViT2: A Data-augmented Multimodal Neuroimaging Prediction Framework via Latent Diffusion Model Bi Yuda et.al. 2506.13667 null
2025-06-16 We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems Junfeng Fang et.al. 2506.13666 null

2025-06-13

Publish Date Title Authors PDF Code
2025-06-13 code_transformed: The Influence of Large Language Models on Code Yuliang Xu et.al. 2506.12014 null
2025-06-13 Tracing LLM Reasoning Processes with Strategic Games: A Framework for Planning, Revision, and Resource-Constrained Decision Making Xiaopeng Yuan et.al. 2506.12012 null
2025-06-13 How Visual Representations Map to Language Feature Space in Multimodal LLMs Constantin Venhoff et.al. 2506.11976 null
2025-06-13 A Robust Local Fréchet Regression Using Unbalanced Neural Optimal Transport with Applications to Dynamic Single-cell Genomics Data Binghao Yan et.al. 2506.11969 null
2025-06-13 Improving Large Language Model Safety with Contrastive Representation Learning Samuel Simko et.al. 2506.11938 null
2025-06-13 Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback Dongwei Jiang et.al. 2506.11930 null
2025-06-13 LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming? Zihan Zheng et.al. 2506.11928 null
2025-06-13 Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation Min-Seop Kwak et.al. 2506.11924 null
2025-06-13 TreeRL: LLM Reinforcement Learning with On-Policy Tree Search Zhenyu Hou et.al. 2506.11902 null
2025-06-13 Towards a Cascaded LLM Framework for Cost-effective Human-AI Decision-Making Claudio Fanconi et.al. 2506.11887 null
2025-06-13 Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache Xiaoran Liu et.al. 2506.11886 null
2025-06-13 Addressing Bias in LLMs: Strategies and Application to Fair AI-based Recruitment Alejandro Peña et.al. 2506.11880 null
2025-06-13 A Short Survey on Formalising Software Requirements using Large Language Models Arshad Beg et.al. 2506.11874 null
2025-06-13 LLM-based Dynamic Differential Testing for Database Connectors with Reinforcement Learning-Guided Prompt Selection Ce Lyu et.al. 2506.11870 null
2025-06-13 Post Persona Alignment for Multi-Session Dialogue Generation Yi-Pei Chen et.al. 2506.11857 null
2025-06-13 TrustGLM: Evaluating the Robustness of GraphLLMs Against Prompt, Text, and Structure Attacks Qihai Zhang et.al. 2506.11844 null
2025-06-13 Your Ride, Your Rules: Psychology and Cognition Enabled Automated Driving Systems Zhipeng Bao et.al. 2506.11842 null
2025-06-13 Revealing Political Bias in LLMs through Structured Multi-Agent Debate Aishwarya Bandaru et.al. 2506.11825 null
2025-06-13 Rethinking Multilingual Vision-Language Translation: Dataset, Evaluation, and Adaptation Xintong Wang et.al. 2506.11820 null
2025-06-13 On the Performance of LLMs for Real Estate Appraisal Margot Geerts et.al. 2506.11812 null
2025-06-13 MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning Yuxuan Luo et.al. 2506.10963 null

2025-06-12

Publish Date Title Authors PDF Code
2025-06-12 SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis Weiliang Chen et.al. 2506.10981 null
2025-06-12 GenWorld: Towards Detecting AI-generated Real-world Simulation Videos Weiliang Chen et.al. 2506.10975 null
2025-06-12 AutoMind: Adaptive Knowledgeable Agent for Automated Data Science Yixin Ou et.al. 2506.10974 null
2025-06-12 Farseer: A Refined Scaling Law in Large Language Models Houyi Li et.al. 2506.10972 null
2025-06-12 GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation Ning Gao et.al. 2506.10966 null
2025-06-12 ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark Kangwei Liu et.al. 2506.10960 null
2025-06-12 SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks Lianghong Guo et.al. 2506.10954 **[link](https://github.com/deepsoftwareanalytics/swe-factory)**
2025-06-12 Build the web for agents, not agents for the web Xing Han Lù et.al. 2506.10953 null
2025-06-12 Monitoring Decomposition Attacks in LLMs with Lightweight Sequential Monitors Chen Yueh-Han et.al. 2506.10949 **[link](https://github.com/yuehhanchen/monitoring-decomposition-attack)**
2025-06-12 Execution Guided Line-by-Line Code Generation Boaz Lavon et.al. 2506.10948 null
2025-06-12 GUARD: Guided Unlearning and Retention via Data Attribution for Large Language Models Evelyn Ma et.al. 2506.10946 null
2025-06-12 Self-Adapting Language Models Adam Zweiger et.al. 2506.10943 null
2025-06-12 Dynamic Epistemic Friction in Dialogue Timothy Obiso et.al. 2506.10934 null
2025-06-12 The Role of Generative AI in Facilitating Social Interactions: A Scoping Review T. T. J. E. Arets et.al. 2506.10927 null
2025-06-12 Robustly Improving LLM Fairness in Realistic Settings via Interpretability Adam Karvonen et.al. 2506.10922 null
2025-06-12 Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization Or Shafran et.al. 2506.10920 null
2025-06-12 Magistral Mistral-AI et.al. 2506.10910 null
2025-06-12 Beyond Gold Standards: Epistemic Ensemble of LLM Judges for Formal Mathematical Reasoning Lan Zhang et.al. 2506.10903 null
2025-06-12 GenPlanX. Generation of Plans and Execution Daniel Borrajo et.al. 2506.10897 null

2025-06-11

Publish Date Title Authors PDF Code
2025-06-11 Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling Tim Z. Xiao et.al. 2506.09998 null
2025-06-11 From Judgment to Interference: Early Stopping LLM Harmful Outputs via Streaming Content Monitoring Yang Li et.al. 2506.09996 null
2025-06-11 Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation Xinyu Yang et.al. 2506.09991 null
2025-06-11 Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMs Hiroshi Matsuda et.al. 2506.09983 null
2025-06-11 When Detection Fails: The Power of Fine-Tuned Models to Generate Human-Like Social Media Text Hillary Dawkins et.al. 2506.09975 null
2025-06-11 SRLAgent: Enhancing Self-Regulated Learning Skills through Gamification and LLM Assistance Wentao Ge et.al. 2506.09968 null
2025-06-11 Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing Junfei Wu et.al. 2506.09965 null
2025-06-11 LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection Challenge Sahar Abdelnabi et.al. 2506.09956 null
2025-06-11 Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking Wuwei Zhang et.al. 2506.09944 null
2025-06-11 VerIF: Verification Engineering for Reinforcement Learning in Instruction Following Hao Peng et.al. 2506.09942 null
2025-06-11 PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants Zheng Zhao et.al. 2506.09902 null
2025-06-11 The Emergence of Abstract Thought in Large Language Models Beyond Any Language Yuxin Chen et.al. 2506.09890 null
2025-06-11 Attention Head Embeddings with Trainable Deep Kernels for Hallucination Detection in LLMs Rodion Oblovatny et.al. 2506.09886 null
2025-06-11 Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning Xiangning Yu et.al. 2506.09853 null
2025-06-11 Dataset of News Articles with Provenance Metadata for Media Relevance Assessment Tomas Peterka et.al. 2506.09847 null
2025-06-11 A Deep Generative Model for the Simulation of Discrete Karst Networks Dany Lauzon et.al. 2506.09832 null
2025-06-11 EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection Christoph Schuhmann et.al. 2506.09827 null
2025-06-11 Metritocracy: Representative Metrics for Lite Benchmarks Ariel Procaccia et.al. 2506.09813 null
2025-06-11 Do LLMs Give Psychometrically Plausible Responses in Educational Assessments? Andreas Säuberli et.al. 2506.09796 null
2025-06-11 Where Journalism Silenced Voices: Exploring Discrimination in the Representation of Indigenous Communities in Bangladesh Abhijit Paul et.al. 2506.09771 null
2025-06-11 Comparing human and LLM proofreading in L2 writing: Impact on lexical and syntactic features Hakyung Sung et.al. 2506.09021 null
2025-06-11 Boosting Rust Unit Test Coverage through Hybrid Program Analysis and Large Language Models Bei Chu et.al. 2506.09002 null
2025-06-11 Towards Better Code Generation: Adaptive Decoding with Uncertainty Guidance Kaifeng He et.al. 2506.08980 null

2025-06-10

Publish Date Title Authors PDF Code
2025-06-10 ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering Yuki Imajuku et.al. 2506.09050 null
2025-06-10 VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning Li Kang et.al. 2506.09049 null
2025-06-10 Understanding Task Vectors in In-Context Learning: Emergence, Functionality, and Limitations Yuxin Dong et.al. 2506.09048 null
2025-06-10 Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation Xiaowen Ma et.al. 2506.09046 null
2025-06-10 Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better Dianyi Wang et.al. 2506.09040 null
2025-06-10 AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions Polina Kirichenko et.al. 2506.09038 null
2025-06-10 FZOO: Fast Zeroth-Order Optimizer for Fine-Tuning Large Language Models towards Adam-Scale Speed Sizhe Dang et.al. 2506.09034 null
2025-06-10 Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning Haozhen Zhang et.al. 2506.09033 null
2025-06-10 Diffuse and Disperse: Image Generation with Representation Regularization Runqian Wang et.al. 2506.09027 null
2025-06-10 e3: Learning to Explore Enables Extrapolation of Test-Time Compute for LLMs Amrith Setlur et.al. 2506.09026 null
2025-06-10 Edit Flows: Flow Matching with Edit Operations Marton Havasi et.al. 2506.09018 null
2025-06-10 Learning to Reason Across Parallel Samples for LLM Reasoning Jianing Qi et.al. 2506.09014 null
2025-06-10 Branched Schrödinger Bridge Matching Sophia Tang et.al. 2506.09007 null
2025-06-10 Do Concept Replacement Techniques Really Erase Unacceptable Concepts? Anudeep Das et.al. 2506.08991 null
2025-06-10 SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning Xiao Liang et.al. 2506.08989 null
2025-06-10 ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations Amirreza Rouhi et.al. 2506.08968 null
2025-06-10 Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model Ailin Huang et.al. 2506.08967 null

2025-06-09

Publish Date Title Authors PDF Code
2025-06-09 Hidden in plain sight: VLMs overlook their visual representations Stephanie Fu et.al. 2506.08008 null
2025-06-09 Dreamland: Controllable World Creation with Simulator and Generative Models Sicheng Mo et.al. 2506.08006 null
2025-06-09 Aligning Text, Images, and 3D Structure Token-by-Token Aadarsh Sahoo et.al. 2506.08002 null
2025-06-09 Reparameterized LLM Training via Orthogonal Equivalence Transformation Zeju Qiu et.al. 2506.08001 null
2025-06-09 MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation Junhao Chen et.al. 2506.07999 null
2025-06-09 Generative Modeling of Weights: Generalization or Memorization? Boya Zeng et.al. 2506.07998 null
2025-06-09 Supporting Construction Worker Well-Being with a Multi-Agent Conversational AI System Fan Yang et.al. 2506.07997 null
2025-06-09 HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization Hongzheng Chen et.al. 2506.07972 null
2025-06-09 SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design Wenxin Tang et.al. 2506.07964 null
2025-06-09 Reinforcing Multimodal Understanding and Generation with Dual Self-rewards Jixiang Hong et.al. 2506.07963 null
2025-06-09 Correlated Errors in Large Language Models Elliot Kim et.al. 2506.07962 null
2025-06-09 TokenBreak: Bypassing Text Classification Models Through Token Manipulation Kasimir Schulz et.al. 2506.07948 null
2025-06-09 Statistical Hypothesis Testing for Auditing Robustness in Language Models Paulius Rauba et.al. 2506.07947 null
2025-06-09 ProtocolLLM: RTL Benchmark for SystemVerilog Generation of Communication Protocols Arnav Sheth et.al. 2506.07945 null
2025-06-09 Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations Yizhen Li et.al. 2506.07943 null
2025-06-09 Adversarial Attack Classification and Robustness Testing for Large Language Models for Code Yang Liu et.al. 2506.07942 null
2025-06-09 Squeeze3D: Your 3D Generation Model is Secretly an Extreme Neural Compressor Rishit Dagli et.al. 2506.07932 null
2025-06-09 Solving Inequality Proofs with Large Language Models Jiayi Sheng et.al. 2506.07927 null
2025-06-09 LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement Dimitris Panagopoulos et.al. 2506.07915 null
2025-06-09 FunDiff: Diffusion Models over Function Spaces for Physics-Informed Generative Modeling Sifan Wang et.al. 2506.07902 null

2025-06-06

Publish Date Title Authors PDF Code
2025-06-06 Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias Yuanzhe Hu et.al. 2506.06280 null
2025-06-06 CoMemo: LVLMs Need Image Context with Image Memory Shi Liu et.al. 2506.06279 null
2025-06-06 Distillation Robustifies Unlearning Bruce W. Lee et.al. 2506.06278 null
2025-06-06 STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis Jiatao Gu et.al. 2506.06276 null
2025-06-06 AdvSumm: Adversarial Training for Bias Mitigation in Text Summarization Mukur Gupta et.al. 2506.06273 null
2025-06-06 PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time Weizhi Zhang et.al. 2506.06254 null
2025-06-06 Visual Graph Arena: Evaluating Visual Conceptualization of Vision and Multimodal Large Language Models Zahra Babaiee et.al. 2506.06242 null
2025-06-06 Bridging External and Parametric Knowledge: Mitigating Hallucination of LLMs with Shared-Private Semantic Synergy in Dual-Stream Knowledge Yi Sui et.al. 2506.06240 null
2025-06-06 CompilerGPT: Leveraging Large Language Models for Analyzing and Acting on Compiler Optimization Reports Peter Pirkelbauer et.al. 2506.06227 null
2025-06-06 PROVSYN: Synthesizing Provenance Graphs for Data Augmentation in Intrusion Detection Systems Yi Huang et.al. 2506.06226 null
2025-06-06 Can Theoretical Physics Research Benefit from Language Agents? Sirui Lu et.al. 2506.06214 null
2025-06-06 Model-Driven Graph Contrastive Learning Ali Azizpour et.al. 2506.06212 null
2025-06-06 Building Models of Neurological Language Henry Watkins et.al. 2506.06208 null
2025-06-06 Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning Sheng Chen et.al. 2506.06205 null
2025-06-06 Does It Run and Is That Enough? Revisiting Text-to-Chart Generation with a Multi-Agent Approach James Ford et.al. 2506.06175 null
2025-06-06 Technical Report for Egocentric Mistake Detection for the HoloAssist Challenge Constantin Patsch et.al. 2506.06174 null
2025-06-06 The Lock-in Hypothesis: Stagnation by Algorithm Tianyi Alex Qiu et.al. 2506.06166 null
2025-06-06 Recommender systems, stigmergy, and the tyranny of popularity Zackary Okun Dunivin et.al. 2506.06162 null
2025-06-06 ENMA: Tokenwise Autoregression for Generative Neural PDE Operators Armand Kassaï Koupaï et.al. 2506.06158 null
2025-06-06 Masked Language Models are Good Heterogeneous Graph Generalizers Jinyu Yang et.al. 2506.06157 null

2025-06-05

Publish Date Title Authors PDF Code
2025-06-05 Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets Lei Hsiung et.al. 2506.05346 null
2025-06-05 Inference-Time Hyper-Scaling with KV Cache Compression Adrian Łańcucki et.al. 2506.05345 null
2025-06-05 SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs Jiahui Wang et.al. 2506.05344 null
2025-06-05 ContentV: Efficient Training of Video Generation Models with Limited Compute Wenfeng Lin et.al. 2506.05343 null
2025-06-05 Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning Xingjian Ran et.al. 2506.05341 null
2025-06-05 VideoMolmo: Spatio-Temporal Grounding Meets Pointing Ghazi Shazan Ahmad et.al. 2506.05336 null
2025-06-05 Search Arena: Analyzing Search-Augmented LLMs Mihran Miroyan et.al. 2506.05334 null
2025-06-05 Unleashing Hour-Scale Video Training for Long Video-Language Understanding Jingyang Lin et.al. 2506.05332 null
2025-06-05 MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning Xinyan Chen et.al. 2506.05331 null
2025-06-05 LSM-2: Learning from Incomplete Wearable Sensor Data Maxwell A. Xu et.al. 2506.05321 null
2025-06-05 Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay Yifan Sun et.al. 2506.05316 null
2025-06-05 Constrained Entropic Unlearning: A Primal-Dual Framework for Large Language Models Taha Entesari et.al. 2506.05314 null
2025-06-05 Learning normalized image densities via dual score matching Florentin Guth et.al. 2506.05310 null
2025-06-05 Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games Niv Eckhaus et.al. 2506.05309 **[link](https://github.com/niveck/LLMafia)**
2025-06-05 ProRefine: Inference-time Prompt Refinement with Textual Feedback Deepak Pandita et.al. 2506.05305 null
2025-06-05 Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos Weifeng Lin et.al. 2506.05302 null
2025-06-05 Sample Complexity and Representation Ability of Test-time Scaling Paradigms Baihe Huang et.al. 2506.05295 null
2025-06-05 Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning Nan Huo et.al. 2506.05278 null
2025-06-05 How to Unlock Time Series Editing? Diffusion-Driven Approach with Multi-Grained Control Hao Yu et.al. 2506.05276 null
2025-06-05 From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos Animesh Gupta et.al. 2506.05274 null

2025-06-04

Publish Date Title Authors PDF Code
2025-06-04 Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector Boyong He et.al. 2506.04211 **[link](https://github.com/heboyong/Diffusion-Domain-Teacher)**
2025-06-04 Language-Image Alignment with Fixed Text Encoders Jingfeng Yang et.al. 2506.04209 null
2025-06-04 EPiC: Towards Lossless Speedup for Reasoning Training through Edge-Preserving CoT Condensation Jinghan Jia et.al. 2506.04205 null
2025-06-04 Cascadia: A Cascade Serving System for Large Language Models Youhe Jiang et.al. 2506.04203 null
2025-06-04 TracLLM: A Generic Framework for Attributing Long Context LLMs Yanting Wang et.al. 2506.04202 null
2025-06-04 R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning Qingfei Zhao et.al. 2506.04185 null
2025-06-04 SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models Yuhao Wu et.al. 2506.04180 null
2025-06-04 SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling Anhao Zhao et.al. 2506.04179 null
2025-06-04 Does Prompt Design Impact Quality of Data Imputation by LLMs? Shreenidhi Srinivasan et.al. 2506.04172 null
2025-06-04 Physics-Constrained Flow Matching: Sampling Generative Models with Hard Constraints Utkarsh Utkarsh et.al. 2506.04171 null
2025-06-04 Neural and Cognitive Impacts of AI: The Influence of Task Subjectivity on Human-LLM Collaboration Matthew Russell et.al. 2506.04167 null
2025-06-04 N $^2$ : A Unified Python Package and Test Bench for Nearest Neighbor-Based Matrix Completion Caleb Chin et.al. 2506.04166 null
2025-06-04 VISCA: Inferring Component Abstractions for Automated End-to-End Testing Parsa Alian et.al. 2506.04161 null
2025-06-04 A Dataset for Addressing Patient's Information Needs related to Clinical Course of Hospitalization Sarvesh Soni et.al. 2506.04156 null
2025-06-04 Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis Kejian Zhu et.al. 2506.04142 null
2025-06-04 Are Lexicon-Based Tools Still the Gold Standard for Valence Analysis in Low-Resource Flemish? Ratna Kandala et.al. 2506.04139 null
2025-06-04 TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems Shaina Raza et.al. 2506.04133 null
2025-06-04 Guided Speculative Inference for Efficient Test-Time Alignment of LLMs Jonathan Geuter et.al. 2506.04118 null
2025-06-04 AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment Anastasiia Ivanova et.al. 2506.04089 null
2025-06-04 Multimodal Tabular Reasoning with Privileged Structured Information Jun-Peng Jiang et.al. 2506.04088 null
2025-06-04 Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback Xiaoying Zhang et.al. 2506.03106 null

2025-06-03

Publish Date Title Authors PDF Code
2025-06-03 Entity-Augmented Neuroscience Knowledge Retrieval Using Ontology and Semantic Understanding Capability of LLM Pralaypati Ta et.al. 2506.03145 null
2025-06-03 Not All Tokens Are Meant to Be Forgotten Xiangyu Zhou et.al. 2506.03142 null
2025-06-03 SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation Siqi Chen et.al. 2506.03139 null
2025-06-03 Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning Yinjie Wang et.al. 2506.03136 null
2025-06-03 Native-Resolution Image Synthesis Zidong Wang et.al. 2506.03131 null
2025-06-03 AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation Lu Qiu et.al. 2506.03126 null
2025-06-03 AUTOCIRCUIT-RL: Reinforcement Learning-Driven LLM for Automated Circuit Topology Generation Prashanth Vijayaraghavan et.al. 2506.03122 null
2025-06-03 Retrieval-Augmented Generation as Noisy In-Context Learning: A Unified Theory and Risk Bounds Yang Guo et.al. 2506.03100 null
2025-06-03 TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models Chetwin Low et.al. 2506.03099 null
2025-06-03 DPO Learning with LLMs-Judge Signal for Computer Use Agents Man Luo et.al. 2506.03095 null
2025-06-03 Literary Evidence Retrieval via Long-Context Language Models Katherine Thai et.al. 2506.03090 null
2025-06-03 SG2VID: Scene Graphs Enable Fine-Grained Control for Video Synthesis Ssharvien Kumar Sivakumar et.al. 2506.03082 null
2025-06-03 ORV: 4D Occupancy-centric Robot Video Generation Xiuyu Yang et.al. 2506.03079 null
2025-06-03 StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs Qijun Luo et.al. 2506.03077 null
2025-06-03 EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models Mingzhe Li et.al. 2506.03067 null
2025-06-03 MAEBE: Multi-Agent Emergent Behavior Framework Sinem Erisken et.al. 2506.03053 null
2025-06-03 Facts Do Care About Your Language: Assessing Answer Quality of Multilingual LLMs Yuval Kansal et.al. 2506.03051 null
2025-06-03 Sample complexity of Schrödinger potential estimation Nikita Puchkin et.al. 2506.03043 null
2025-06-03 Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective Jintian Shao et.al. 2506.03038 null

2025-06-02

Publish Date Title Authors PDF Code
2025-06-02 Guiding Generative Storytelling with Knowledge Graphs Zhijun Pan et.al. 2505.24803 null

2025-05-30

Publish Date Title Authors PDF Code
2025-05-30 Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents Yaxin Luo et.al. 2505.24878 null
2025-05-30 ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL Yu Zhang et.al. 2505.24875 null
2025-05-30 MiniMax-Remover: Taming Bad Noise Helps Video Object Removal Bojia Zi et.al. 2505.24873 null
2025-05-30 MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning Yiqing Liang et.al. 2505.24871 null
2025-05-30 GenSpace: Benchmarking Spatially-Aware Image Generation Zehan Wang et.al. 2505.24870 null
2025-05-30 SiLVR: A Simple Language-based Video Reasoning Framework Ce Zhang et.al. 2505.24869 null
2025-05-30 TalkingHeadBench: A Multi-Modal Benchmark & Analysis of Talking-Head DeepFake Detection Xinqi Xiong et.al. 2505.24866 null
2025-05-30 ViStoryBench: Comprehensive Benchmark Suite for Story Visualization Cailin Zhuang et.al. 2505.24862 null
2025-05-30 MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs Gabrielle Kaili-May Liu et.al. 2505.24858 null
2025-05-30 Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning Shuyao Xu et.al. 2505.24850 null
2025-05-30 MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning Jingyan Shen et.al. 2505.24846 null
2025-05-30 Cascading Adversarial Bias from Injection to Distillation in Language Models Harsh Chaudhari et.al. 2505.24842 null
2025-05-30 Vision LLMs Are Bad at Hierarchical Visual Understanding, and LLMs Are the Bottleneck Yuwen Tan et.al. 2505.24840 null
2025-05-30 VideoCAD: A Large-Scale Video Dataset for Learning UI Interactions and 3D Reasoning from CAD Software Brandon Man et.al. 2505.24838 null
2025-05-30 Improving Reliability and Explainability of Medical Question Answering through Atomic Fact Checking in Retrieval-Augmented LLMs Juraj Vladika et.al. 2505.24830 null
2025-05-30 LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text Li yunhan et.al. 2505.24826 null
2025-05-30 PhySense: Principle-Based Physics Reasoning Benchmarking for Large Language Models Yinggan Xu et.al. 2505.24823 null
2025-05-30 Inference Acceleration of Autoregressive Normalizing Flows by Selective Jacobi Decoding Jiaru Zhang et.al. 2505.24791 null
2025-05-30 Drop Dropout on Single-Epoch Language Model Pretraining Houjun Liu et.al. 2505.24788 null
2025-05-30 OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation Size Wu et.al. 2505.23661 **[link](https://github.com/wusize/openuni)**

2025-05-29

Publish Date Title Authors PDF Code
2025-05-29 From Chat Logs to Collective Insights: Aggregative Question Answering Wentao Zhang et.al. 2505.23765 null
2025-05-29 DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning Ziyin Zhang et.al. 2505.23754 **[link](https://github.com/jiahao004/deeptheorem)**
2025-05-29 ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks Akashah Shabbir et.al. 2505.23752 **[link](https://github.com/mbzuai-oryx/thinkgeo)**
2025-05-29 MAGREF: Masked Guidance for Any-Reference Video Generation Yufan Deng et.al. 2505.23742 **[link](https://github.com/magref-video/magref)**
2025-05-29 Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time Mohamad Chehade et.al. 2505.23729 null
2025-05-29 MuLoCo: Muon is a practical inner optimizer for DiLoCo Benjamin Thérien et.al. 2505.23725 null
2025-05-29 SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA Minrui Luo et.al. 2505.23724 null
2025-05-29 ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering Zexi Liu et.al. 2505.23723 **[link](https://github.com/zeroxleo/ml-agent)**
2025-05-29 Label-Guided In-Context Learning for Named Entity Recognition Fan Bai et.al. 2505.23722 **[link](https://github.com/bflashcp3f/deer)**
2025-05-29 Don't Take the Premise for Granted: Evaluating the Premise Critique Ability of Large Language Models Jinzhe Li et.al. 2505.23715 **[link](https://github.com/mlgroupjlu/premise_critique)**
2025-05-29 SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models Zixiang Xu et.al. 2505.23713 **[link](https://github.com/xzx34/socialmaze)**
2025-05-29 Let's Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM's Math Capability Ruida Wang et.al. 2505.23703 null
2025-05-29 Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation Ziling Cheng et.al. 2505.23701 null
2025-05-29 Data-to-Dashboard: Multi-Agent LLM Framework for Insightful Visualization in Enterprise Analytics Ran Zhang et.al. 2505.23695 **[link](https://github.com/77luvc/d2d_data2dashboard)**
2025-05-29 VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos Tingyu Song et.al. 2505.23693 **[link](https://github.com/sighingsnow/vf-eval)**
2025-05-29 ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term Interactions Beong-woo Kwak et.al. 2505.23662 null
2025-05-29 Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation Hongxiang Zhang et.al. 2505.23657 null
2025-05-29 ARC: Argument Representation and Coverage Analysis for Zero-Shot Long Document Summarization with Instruction Following LLMs Mohamed Elaraby et.al. 2505.23654 null
2025-05-29 How does Transformer Learn Implicit Reasoning? Jiaran Ye et.al. 2505.23653 **[link](https://github.com/jiaran-ye/implicitreasoning)**
2025-05-29 Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems Hoang Pham et.al. 2505.22571 null

2025-05-28

Publish Date Title Authors PDF Code
2025-05-28 Zero-Shot Vision Encoder Grafting via LLM Surrogates Kaiyu Yue et.al. 2505.22664 **[link](https://github.com/facebookresearch/zero)**
2025-05-28 AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models Feng Luo et.al. 2505.22662 null
2025-05-28 GuessArena: Guess Who I Am? A Self-Adaptive Framework for Evaluating LLMs in Domain-Specific Knowledge and Reasoning Qingchen Yu et.al. 2505.22661 null
2025-05-28 3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model Wenbo Hu et.al. 2505.22657 null
2025-05-28 Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents Michael Kirchhof et.al. 2505.22655 null
2025-05-28 The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason Ang Lv et.al. 2505.22653 null
2025-05-28 On Learning Verifiers for Chain-of-Thought Reasoning Maria-Florina Balcan et.al. 2505.22650 null
2025-05-28 Characterizing Bias: Benchmarking Large Language Models in Simplified versus Traditional Chinese Hanjia Lyu et.al. 2505.22645 null
2025-05-28 Learning Composable Chains-of-Thought Fangcong Yin et.al. 2505.22635 null
2025-05-28 Stochastic Chameleons: Irrelevant Context Hallucinations Reveal Class-Based (Mis)Generalization in LLMs Ziling Cheng et.al. 2505.22630 null
2025-05-28 Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding Chengyue Wu et.al. 2505.22618 null
2025-05-28 The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models Ganqu Cui et.al. 2505.22617 null
2025-05-28 Self-Error-Instruct: Generalizing from Errors for LLMs Mathematical Reasoning Erxin Yu et.al. 2505.22591 null
2025-05-28 Precise In-Parameter Concept Erasure in Large Language Models Yoav Gur-Arieh et.al. 2505.22586 null
2025-05-28 DocReRank: Single-Page Hard Negative Query Generation for Training Multi-Modal RAG Rerankers Navve Wasserman et.al. 2505.22584 null
2025-05-28 Less, but Better: Efficient Multilingual Expansion for LLMs via Layer-wise Mixture-of-Experts Xue Zhang et.al. 2505.22582 null
2025-05-28 Fusion Steering: Prompt-Specific Activation Control Waldemar Chang et.al. 2505.22572 null
2025-05-28 Universal Visuo-Tactile Video Understanding for Embodied Interaction Yifan Xie et.al. 2505.22566 null
2025-05-28 Do Large Language Models Think Like the Brain? Sentence-Level Evidence from fMRI and Hierarchical Embeddings Yu Lei et.al. 2505.22563 null
2025-05-28 Diagnosing and Resolving Cloud Platform Instability with Multi-modal RAG LLMs Yifan Wang et.al. 2505.21419 null

2025-05-27

Publish Date Title Authors PDF Code
2025-05-27 How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective Shimao Zhang et.al. 2505.21505 null
2025-05-27 Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making Yihan Wang et.al. 2505.21503 null
2025-05-27 Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers Wei Pang et.al. 2505.21497 null
2025-05-27 Reinforcing General Reasoning without Verifiers Xiangxin Zhou et.al. 2505.21493 null
2025-05-27 Hardware-Efficient Attention for Fast Decoding Ted Zadouri et.al. 2505.21487 null
2025-05-27 Robust Hypothesis Generation: LLM-Automated Language Bias for Inductive Logic Programming Yang Yang et.al. 2505.21486 null
2025-05-27 Are Language Models Consequentialist or Deontological Moral Reasoners? Keenan Samway et.al. 2505.21479 null
2025-05-27 Policy Optimized Text-to-Image Pipeline Design Uri Gadot et.al. 2505.21478 null
2025-05-27 Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration Zijun Liu et.al. 2505.21471 **[link](https://github.com/thunlp-mt/extagents)**
2025-05-27 PropMolFlow: Property-guided Molecule Generation with Geometry-Complete Flow Matching Cheng Zeng et.al. 2505.21469 null
2025-05-27 Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance Shintaro Ozaki et.al. 2505.21458 null
2025-05-27 Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling Xiangxin Zhou et.al. 2505.21452 null
2025-05-27 Words Like Knives: Backstory-Personalized Modeling and Detection of Violent Communication Jocelyn Shen et.al. 2505.21451 null
2025-05-27 OmniSync: Towards Universal Lip Synchronization via Diffusion Transformers Ziqiao Peng et.al. 2505.21448 null
2025-05-27 Can Large Reasoning Models Self-Train? Sheikh Shafayat et.al. 2505.21444 null
2025-05-27 Hume: Introducing System-2 Thinking in Visual-Language-Action Model Haoming Song et.al. 2505.21432 null
2025-05-27 Policy Induction: Predicting Startup Success via Explainable Memory-Augmented In-Context Learning Xianling Mu et.al. 2505.21427 null
2025-05-27 GUARD:Dual-Agent based Backdoor Defense on Chain-of-Thought in Neural Code Generation Naizhu Jin et.al. 2505.21425 null
2025-05-27 Autonomous Multi-Modal LLM Agents for Treatment Planning in Focused Ultrasound Ablation Surgery Lina Zhao et.al. 2505.21418 null
2025-05-27 TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent Dominik Meier et.al. 2505.20118 null
2025-05-27 Adaptive Deep Reasoning: Triggering Deep Thinking When Needed Yunhao Wang et.al. 2505.20101 null

2025-05-26

Publish Date Title Authors PDF Code
2025-05-26 Pangu Light: Weight Re-Initialization for Pruning and Accelerating LLMs Hanting Chen et.al. 2505.20155 null
2025-05-26 UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models Xueyan Zhang et.al. 2505.20154 null
2025-05-26 FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities Jin Wang et.al. 2505.20147 null
2025-05-26 StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs Jialin Yang et.al. 2505.20139 null
2025-05-26 Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers Zhengliang Shi et.al. 2505.20128 null
2025-05-26 Agentic AI Process Observability: Discovering Behavioral Variability Fabiana Fournier et.al. 2505.20127 null
2025-05-26 Understanding Generalization in Diffusion Models via Probability Flow Distance Huijie Zhang et.al. 2505.20123 null
2025-05-26 Named Entity Recognition in Historical Italian: The Case of Giacomo Leopardi's Zibaldone Cristian Santini et.al. 2505.20113 null
2025-05-26 ResSVD: Residual Compensated SVD for Large Language Model Compression Haolei Bai et.al. 2505.20112 null
2025-05-26 Proxy-Free GFlowNet Ruishuo Chen et.al. 2505.20110 null
2025-05-26 Language-Agnostic Suicidal Risk Detection Using Large Language Models June-Woo Kim et.al. 2505.20109 null
2025-05-26 From Data to Modeling: Fully Open-vocabulary Scene Graph Generation Zuyao Chen et.al. 2505.20106 null
2025-05-26 AdaTP: Attention-Debiased Token Pruning for Video Large Language Models Fengyuan Sun et.al. 2505.20100 null
2025-05-26 Large Language Models Meet Knowledge Graphs for Question Answering: Synthesis and Opportunities Chuangtao Ma et.al. 2505.20099 null
2025-05-26 S2LPP: Small-to-Large Prompt Prediction across LLMs Liang Cheng et.al. 2505.20097 null
2025-05-26 Multi-Domain Explainability of Preferences Nitay Calderon et.al. 2505.20088 null
2025-05-26 Safety Through Reasoning: An Empirical Study of Reasoning Guardrail Models Makesh Narsimhan Sreedhar et.al. 2505.20087 null
2025-05-26 Incentivizing Reasoning from Weak Supervision Yige Yuan et.al. 2505.20072 null

2025-05-23

Publish Date Title Authors PDF Code
2025-05-23 The Staircase of Ethics: Probing LLM Value Priorities through Multi-Step Induction to Complex Moral Dilemmas Ya Wu et.al. 2505.18154 null
2025-05-23 Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs Wafa Alghallabi et.al. 2505.18152 null
2025-05-23 Generative Distribution Embeddings Nic Fishman et.al. 2505.18150 null
2025-05-23 Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find Owen Bianchi et.al. 2505.18148 null
2025-05-23 Gaming Tool Preferences in Agentic LLMs Kazem Faghih et.al. 2505.18135 null
2025-05-23 Frankentext: Stitching random text fragments into long-form narratives Chau Minh Pham et.al. 2505.18128 null
2025-05-23 UNJOIN: Enhancing Multi-Table Text-to-SQL Generation via Schema Simplification Poojah Ganesan et.al. 2505.18122 null
2025-05-23 ProgRM: Build Better GUI Agents with Progress Rewards Danyang Zhang et.al. 2505.18121 null
2025-05-23 Bidirectional Knowledge Distillation for Enhancing Sequential Recommendation with Large Language Models Jiongran Wu et.al. 2505.18120 null
2025-05-23 Bridging Supervised Learning and Reinforcement Learning in Math Reasoning Huayu Chen et.al. 2505.18116 null
2025-05-23 Instructify: Demystifying Metadata to Visual Instruction Tuning Data Conversion Jacob Hansen et.al. 2505.18115 null
2025-05-23 Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM Zinuo Li et.al. 2505.18110 null
2025-05-23 ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework Lisheng Huang et.al. 2505.18105 null
2025-05-23 How Can I Publish My LLM Benchmark Without Giving the True Answers Away? Takashi Ishida et.al. 2505.18102 null
2025-05-23 Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL Joey Hong et.al. 2505.18098 null
2025-05-23 DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations Ziqiao Peng et.al. 2505.18096 null
2025-05-23 QwenLong-CPRS: Towards $\infty$ -LLMs with Dynamic Context Optimization Weizhou Shen et.al. 2505.18092 null
2025-05-23 Data Mixing Can Induce Phase Transitions in Knowledge Acquisition Xinran Gu et.al. 2505.18091 null
2025-05-23 Stable Reinforcement Learning for Efficient Reasoning Muzhi Dai et.al. 2505.18086 null
2025-05-23 Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding Xiaoyi Zhang et.al. 2505.18079 null
2025-05-23 VeriFastScore: Speeding up long-form factuality evaluation Rishanth Rajendhran et.al. 2505.16973 **[link](https://github.com/rishanthrajendhran/verifastscore)**

2025-05-22

Publish Date Title Authors PDF Code
2025-05-22 GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning Chengqi Duan et.al. 2505.17022 **[link](https://github.com/gogoduan/got-r1)**
2025-05-22 CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms Shilin Yan et.al. 2505.17020 **[link](https://github.com/shilinyan99/crosslmm)**
2025-05-22 Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO Chengzhuo Tong et.al. 2505.17017 **[link](https://github.com/ziyuguo99/image-generation-cot)**
2025-05-22 R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning Huatong Song et.al. 2505.17005 **[link](https://github.com/rucaibox/r1-searcher-plus)**
2025-05-22 Do Large Language Models Excel in Complex Logical Reasoning with Formal Language? Jin Jiang et.al. 2505.16998 **[link](https://github.com/jiangjin1999/formaleval)**
2025-05-22 X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs Rui Ye et.al. 2505.16997 null
2025-05-22 DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization Chao Zhang et.al. 2505.16995 null
2025-05-22 $\text{R}^2\text{ec}$ : Towards Large Recommender Models with Reasoning Runyang You et.al. 2505.16994 **[link](https://github.com/yryangang/rrec)**
2025-05-22 MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems Rui Ye et.al. 2505.16988 **[link](https://github.com/masworks/maslab)**
2025-05-22 T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning Amartya Chakraborty et.al. 2505.16986 null
2025-05-22 UFT: Unifying Supervised and Reinforcement Fine-Tuning Mingyang Liu et.al. 2505.16984 **[link](https://github.com/liumy2010/uft)**
2025-05-22 LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding Junlong Tong et.al. 2505.16983 **[link](https://github.com/eit-nlp/streamingllm)**
2025-05-22 Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine Adib Bazgir et.al. 2505.16982 null
2025-05-22 Know the Ropes: A Heuristic Strategy for LLM-based Multi-Agent System Design Zhenkun Li et.al. 2505.16979 null
2025-05-22 HyGenar: An LLM-Driven Hybrid Genetic Algorithm for Few-Shot Grammar Generation Weizhi Tang et.al. 2505.16978 **[link](https://github.com/rutatang/hygenar)**
2025-05-22 Creatively Upscaling Images with Global-Regional Priors Yurui Qian et.al. 2505.16976 null
2025-05-22 SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development Yaxin Du et.al. 2505.16975 **[link](https://github.com/justlittlewhite/swe-dev)**
2025-05-22 CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark Ahmed Heakl et.al. 2505.16968 **[link](https://github.com/gustavostahl/cass)**
2025-05-22 Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval Nandan Thakur et.al. 2505.16967 null
2025-05-22 HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving Zhiwen Chen et.al. 2505.15793 null
2025-05-22 Multi-modal Integration Analysis of Alzheimer's Disease Using Large Language Models and Knowledge Graphs Kanan Kiguchi et.al. 2505.15747 null

2025-05-21

Publish Date Title Authors PDF Code
2025-05-21 Learning to Reason via Mixture-of-Thought for Logical Reasoning Tong Zheng et.al. 2505.15817 **[link](https://github.com/zhengkid/Truth_Table_Logical_Reasoning)**
2025-05-21 On the creation of narrow AI: hierarchy and nonlocality of neural network skills Eric J. Michaud et.al. 2505.15811 null
2025-05-21 Neural Conditional Transport Maps Carlos Rodriguez-Pardo et.al. 2505.15808 null
2025-05-21 Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering Hwan Chang et.al. 2505.15805 null
2025-05-21 STAR-R1: Spacial TrAnsformation Reasoning by Reinforcing Multimodal LLMs Zongzhao Li et.al. 2505.15804 null
2025-05-21 Interspatial Attention for Efficient 4D Human Video Generation Ruizhi Shao et.al. 2505.15800 null
2025-05-21 Reverse Engineering Human Preferences with Reinforcement Learning Lisa Alazraki et.al. 2505.15795 null
2025-05-21 Long-Form Information Alignment Evaluation Beyond Atomic Facts Danna Zheng et.al. 2505.15792 null
2025-05-21 Large Language Models as Computable Approximations to Solomonoff Induction Jun Wan et.al. 2505.15784 null
2025-05-21 IA-T2I: Internet-Augmented Text-to-Image Generation Chuanhao Li et.al. 2505.15779 null
2025-05-21 Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space Zhen Zhang et.al. 2505.15778 **[link](https://github.com/eric-ai-lab/soft-thinking)**
2025-05-21 Beyond Hard and Soft: Hybrid Context Compression for Balancing Local and Global Information Retention Huanxuan Liao et.al. 2505.15774 null
2025-05-21 Constructing a 3D Town from a Single Image Kaizhi Zheng et.al. 2505.15765 null
2025-05-21 Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval Taiye Chen et.al. 2505.15753 null
2025-05-21 Evolutionary Computation and Large Language Models: A Survey of Methods, Synergies, and Applications Dikshit Chauhan et.al. 2505.15741 null
2025-05-21 HybridProver: Augmenting Theorem Proving with LLM-Driven Proof Synthesis and Refinement Jilin Hu et.al. 2505.15740 null
2025-05-21 Alignment Under Pressure: The Case for Informed Adversaries When Evaluating LLM Defenses Xiaoxue Yang et.al. 2505.15738 null
2025-05-21 DEBATE, TRAIN, EVOLVE: Self Evolution of Language Model Reasoning Gaurav Srivastava et.al. 2505.15734 null

2025-05-20

Publish Date Title Authors PDF Code
2025-05-20 Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning Haolei Xu et.al. 2505.14684 null
2025-05-20 NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search Sunhao Dai et.al. 2505.14680 null
2025-05-20 UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models Xiaojie Gu et.al. 2505.14679 null
2025-05-20 Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning Jiaer Xia et.al. 2505.14677 null
2025-05-20 Training-Free Watermarking for Autoregressive Image Generation Yu Tong et.al. 2505.14673 null
2025-05-20 Quartet: Native FP4 Training Can Be Optimal for Large Language Models Roberto L. Castro et.al. 2505.14669 null
2025-05-20 ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions Bufang Yang et.al. 2505.14668 null
2025-05-20 SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment Wonje Jeung et.al. 2505.14667 null
2025-05-20 Abacus: A Cost-Based Optimizer for Semantic Operator Systems Matthew Russo et.al. 2505.14661 null
2025-05-20 Cost-Augmented Monte Carlo Tree Search for LLM-Assisted Planning Zihao Zhang et.al. 2505.14656 null
2025-05-20 Beyond Words: Multimodal LLM Knows When to Speak Zikai Liao et.al. 2505.14654 null
2025-05-20 General-Reasoner: Advancing LLM Reasoning Across All Domains Xueguang Ma et.al. 2505.14652 null
2025-05-20 Think Only When You Need with Large Hybrid-Reasoning Models Lingjie Jiang et.al. 2505.14631 null
2025-05-20 KERL: Knowledge-Enhanced Personalized Recipe Recommendation using Large Language Models Fnu Mohbat et.al. 2505.14629 **[link](https://github.com/mohbattharani/kerl)**
2025-05-20 Debating for Better Reasoning: An Unsupervised Multimodal Approach Ashutosh Adhikari et.al. 2505.14627 null
2025-05-20 TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning Zhangchen Xu et.al. 2505.14625 null
2025-05-20 Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs Morgan Lindsay Heisler et.al. 2505.14620 null
2025-05-20 Linear Control of Test Awareness Reveals Differential Compliance in Reasoning Models Sahar Abdelnabi et.al. 2505.14617 null
2025-05-20 SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas Anjiang Wei et.al. 2505.14615 null
2025-05-20 sudoLLM : On Multi-role Alignment of Language Models Soumadeep Saha et.al. 2505.14607 null
2025-05-20 Sense and Sensitivity: Examining the Influence of Semantic Recall on Long Context Code Reasoning Adam Štorek et.al. 2505.13353 null

2025-05-19

Publish Date Title Authors PDF Code
2025-05-19 Mean Flows for One-step Generative Modeling Zhengyang Geng et.al. 2505.13447 null
2025-05-19 Unlocking Non-Invasive Brain-to-Text Dulhan Jayalath et.al. 2505.13446 null
2025-05-19 Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards Xiaoyuan Liu et.al. 2505.13445 null
2025-05-19 Optimizing Anytime Reasoning via Budget Relative Policy Optimization Penghui Qi et.al. 2505.13438 **[link](https://github.com/sail-sg/anytimereasoner)**
2025-05-19 Synthetic-Powered Predictive Inference Meshi Bashari et.al. 2505.13432 null
2025-05-19 Fine-tuning Quantized Neural Networks with Zeroth-order Optimization Sifeng Shang et.al. 2505.13430 null
2025-05-19 Learnware of Language Models: Specialized Small Language Models Can Do Big Zhi-Hao Tan et.al. 2505.13425 null
2025-05-19 Make Still Further Progress: Chain of Thoughts for Tabular Data Leaderboard Si-Yang Liu et.al. 2505.13421 null
2025-05-19 Dementia Through Different Eyes: Explainable Modeling of Human and LLM Perceptions for Early Awareness Lotem Peled-Cohen et.al. 2505.13418 null
2025-05-19 Gluon: Making Muon & Scion Great Again! (Bridging Theory and Practice of LMO-based Optimizers for LLMs) Artem Riabinin et.al. 2505.13416 null
2025-05-19 AutoMathKG: The automated mathematical knowledge graph based on LLM and vector database Rong Bian et.al. 2505.13406 null
2025-05-19 MR. Judge: Multimodal Reasoner as a Judge Renjie Pi et.al. 2505.13403 null
2025-05-19 Thinkless: LLM Learns When to Think Gongfan Fang et.al. 2505.13379 **[link](https://github.com/vainf/thinkless)**
2025-05-19 Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation Yasi Zhang et.al. 2505.13377 null
2025-05-19 Seeing, Saying, Solving: An LLM-to-TL Framework for Cooperative Robots Dan BW Choe et.al. 2505.13376 null
2025-05-19 Minimum-Excess-Work Guidance Christopher Kolloff et.al. 2505.13375 null
2025-05-19 What Prompts Don't Say: Understanding and Managing Underspecification in LLM Prompts Chenyang Yang et.al. 2505.13360 null
2025-05-19 One-Step Offline Distillation of Diffusion-based Models via Koopman Modeling Nimrod Berman et.al. 2505.13358 **[link](https://github.com/azencot-group/kdm)**
2025-05-19 Multi-Armed Bandits Meet Large Language Models Djallel Bouneffouf et.al. 2505.13355 null

2025-05-16

Publish Date Title Authors PDF Code
2025-05-16 QVGen: Pushing the Limit of Quantized Video Generative Models Yushi Huang et.al. 2505.11497 null
2025-05-16 SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning Yige Xu et.al. 2505.11484 null
2025-05-16 Improving Assembly Code Performance with Large Language Models via Reinforcement Learning Anjiang Wei et.al. 2505.11480 null
2025-05-16 HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages Zhilin Wang et.al. 2505.11475 null
2025-05-16 Disentangling Reasoning and Knowledge in Medical Large Language Models Rahul Thapa et.al. 2505.11462 null
2025-05-16 ProxyPrompt: Securing System Prompts against Prompt Extraction Attacks Zhixiong Zhuang et.al. 2505.11459 null
2025-05-16 LLMs unlock new paths to monetizing exploits Nicholas Carlini et.al. 2505.11449 null
2025-05-16 Is Compression Really Linear with Code Intelligence? Xianzhen Luo et.al. 2505.11441 null
2025-05-16 MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production Chao Jin et.al. 2505.11432 null
2025-05-16 When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs Xiaomin Li et.al. 2505.11423 null
2025-05-16 EdgeWisePersona: A Dataset for On-Device User Profiling from Natural Language Interactions Patryk Bartkowiak et.al. 2505.11417 null
2025-05-16 MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems Yinsicheng Jiang et.al. 2505.11415 null
2025-05-16 CARES: Comprehensive Evaluation of Safety and Adversarial Robustness in Medical LLMs Sijia Chen et.al. 2505.11413 null
2025-05-16 Visual Planning: Let's Think Only with Images Yi Xu et.al. 2505.11409 null
2025-05-16 EmotionHallucer: Evaluating Emotion Hallucinations in Multimodal Large Language Models Bohao Xing et.al. 2505.11405 null
2025-05-16 Can AI automatically analyze public opinion? A LLM agents-based agentic pipeline for timely public opinion analysis Jing Liu et.al. 2505.11401 null
2025-05-16 GuideBench: Benchmarking Domain-Oriented Guideline Following for LLM Agents Lingxiao Diao et.al. 2505.11368 null
2025-05-16 Phare: A Safety Probe for Large Language Models Pierre Le Jeune et.al. 2505.11365 null
2025-05-16 LegoSLM: Connecting LLM with Speech Encoder using CTC Posteriors Rao Ma et.al. 2505.11352 null
2025-05-16 Benchmarking Critical Questions Generation: A Challenging Reasoning Task for Large Language Models Banca Calvo Figueras et.al. 2505.11341 null

2025-05-15

Publish Date Title Authors PDF Code
2025-05-15 T2A-Feedback: Improving Basic Capabilities of Text-to-Audio Generation via Fine-grained AI Feedback Zehan Wang et.al. 2505.10561 null
2025-05-15 Neural Thermodynamic Laws for Large Language Model Training Ziming Liu et.al. 2505.10559 null
2025-05-15 Flowing Through Hilbert Space: Quantum-Enhanced Generative Models for Lattice Field Theory Jehu Martinez et.al. 2505.10553 null
2025-05-15 CheXGenBench: A Unified Benchmark For Fidelity, Privacy and Utility of Synthetic Chest Radiographs Raman Dutt et.al. 2505.10496 **[link](https://github.com/Raman1121/CheXGenBench)**
2025-05-15 RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs Vibha Belavadi et.al. 2505.10495 null
2025-05-15 Can You Really Trust Code Copilots? Evaluating Large Language Models from a Code Security Perspective Yutao Mou et.al. 2505.10494 **[link](https://github.com/murraytom/cov-eval)**
2025-05-15 CL-RAG: Bridging the Gap in Retrieval-Augmented Generation with Curriculum Learning Shaohan Wang et.al. 2505.10493 null
2025-05-15 Campus AI vs Commercial AI: A Late-Breaking Study on How LLM As-A-Service Customizations Shape Trust and Usage Patterns Leon Hannig et.al. 2505.10490 null
2025-05-15 UniEval: Unified Holistic Evaluation for Unified Multimodal Understanding and Generation Yi Li et.al. 2505.10483 null
2025-05-15 Large Language Models for Cancer Communication: Evaluating Linguistic Quality, Safety, and Accessibility in Generative AI Agnik Saha et.al. 2505.10472 null
2025-05-15 AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge Ranjan Sapkota et.al. 2505.10468 null
2025-05-15 Superposition Yields Robust Neural Scaling Yizhou liu et.al. 2505.10465 **[link](https://github.com/liuyz0/superpositionscaling)**
2025-05-15 Are Large Language Models Robust in Understanding Code Against Semantics-Preserving Mutations? Pedro Orvalho et.al. 2505.10443 null
2025-05-15 Learning to Think: Information-Theoretic Reinforcement Fine-Tuning for LLMs Jingyao Wang et.al. 2505.10425 null
2025-05-15 Are LLM-generated plain language summaries truly understandable? A large-scale crowdsourced evaluation Yue Guo et.al. 2505.10409 null
2025-05-15 Two-Stage Generative Model for Intracranial Aneurysm Meshes with Morphological Marker Conditioning Wenhao Ding et.al. 2505.10407 **[link](https://github.com/anonymousaneug/aneug)**
2025-05-15 Visual Fidelity Index for Generative Semantic Communications with Critical Information Embedding Jianhao Huang et.al. 2505.10405 null
2025-05-15 Rethinking Repetition Problems of LLMs in Code Generation Yihong Dong et.al. 2505.10402 **[link](https://github.com/lyc127/rpg)**
2025-05-15 Multi-domain Multilingual Sentiment Analysis in Industry: Predicting Aspect-based Opinion Quadruples Benjamin White et.al. 2505.10389 null
2025-05-15 Are Sparse Autoencoders Useful for Java Function Bug Detection? Rui Melo et.al. 2505.10375 null
2025-05-15 Beyond Likes: How Normative Feedback Complements Engagement Signals on Social Media Yuchen Wu et.al. 2505.09583 null
2025-05-15 SafePath: Conformal Prediction for Safe LLM-Based Autonomous Navigation Achref Doula et.al. 2505.09427 null

2025-05-14

Publish Date Title Authors PDF Code
2025-05-14 Customizing a Large Language Model for VHDL Design of High-Performance Microprocessors Nicolas Dupuis et.al. 2505.09610 null
2025-05-14 Adversarial Suffix Filtering: a Defense Pipeline for LLMs David Khachaturov et.al. 2505.09602 null
2025-05-14 How Hungry is AI? Benchmarking Energy, Water, and Carbon Footprint of LLM Inference Nidhal Jegham et.al. 2505.09598 null
2025-05-14 WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models Abdullah Mushtaq et.al. 2505.09595 null
2025-05-14 Ethics and Persuasion in Reinforcement Learning from Human Feedback: A Procedural Rhetorical Approach Shannon Lodoen et.al. 2505.09576 null
2025-05-14 MIGRATION-BENCH: Repository-Level Code Migration Benchmark from Java 8 Linbo Liu et.al. 2505.09569 null
2025-05-14 Layered Unlearning for Adversarial Relearning Timothy Qian et.al. 2505.09500 **[link](https://github.com/12tqian/layered-unlearning)**
2025-05-14 Card Sorting Simulator: Augmenting Design of Logical Information Architectures with Large Language Models Eduard Kuric et.al. 2505.09478 null
2025-05-14 Deploying Foundation Model-Enabled Air and Ground Robots in the Field: Challenges and Opportunities Zachary Ravichandran et.al. 2505.09477 null
2025-05-14 Omni-R1: Do You Really Need Audio to Fine-Tune Your Audio LLM? Andrew Rouditchenko et.al. 2505.09439 null
2025-05-14 Evaluating GPT- and Reasoning-based Large Language Models on Physics Olympiad Problems: Surpassing Human Performance and Implications for Educational Assessment Paul Tschisgale et.al. 2505.09438 null
2025-05-14 CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios Raghav Garg et.al. 2505.09436 **[link](https://github.com/kapilsprinklr/cxmarena)**
2025-05-14 The Influence of Human-inspired Agentic Sophistication in LLM-driven Strategic Reasoners Vince Trencsenyi et.al. 2505.09396 null
2025-05-14 Quantum-Enhanced Parameter-Efficient Learning for Typhoon Trajectory Forecasting Chen-Yu Liu et.al. 2505.09395 null
2025-05-14 Qwen3 Technical Report An Yang et.al. 2505.09388 null
2025-05-14 Efficient Modelling of Lyman-α opacity fluctuations during late EoR Barun Maity et.al. 2505.09369 null
2025-05-14 Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis Bingxin Ke et.al. 2505.09358 null
2025-05-14 Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures Chenggang Zhao et.al. 2505.09343 null
2025-05-14 Granite-speech: open-source speech-aware LLMs with strong English ASR capabilities George Saon et.al. 2505.08699 null

2025-05-13

Publish Date Title Authors PDF Code
2025-05-13 PCS-UQ: Uncertainty Quantification via the Predictability-Computability-Stability Framework Abhineet Agarwal et.al. 2505.08784 null
2025-05-13 CodePDE: An Inference Framework for LLM-driven PDE Solver Generation Shanda Li et.al. 2505.08783 null
2025-05-13 Generative Molecular Design with Steerable and Granular Synthesizability Control Jeff Guo et.al. 2505.08774 null
2025-05-13 SPAT: Sensitivity-based Multihead-attention Pruning on Time Series Forecasting Models Suhan Guo et.al. 2505.08768 null
2025-05-13 AC-Reason: Towards Theory-Guided Actual Causality Reasoning with Large Language Models Yanxi Zhang et.al. 2505.08750 null
2025-05-13 DeepMath-Creative: A Benchmark for Evaluating Mathematical Creativity of Large Language Models Xiaoyang Chen et.al. 2505.08744 **[link](https://github.com/deepmathllm/deepmath)**
2025-05-13 Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies Xiaoliang Luo et.al. 2505.08739 null
2025-05-13 NurValues: Real-World Nursing Values Evaluation for Large Language Models in Clinical Context Ben Yao et.al. 2505.08734 null
2025-05-13 Securing RAG: A Risk Assessment and Mitigation Framework Lukas Ammann et.al. 2505.08728 null
2025-05-13 Memorization-Compression Cycles Improve Generalization Fangyuan Yu et.al. 2505.08727 null
2025-05-13 PWC-MoE: Privacy-Aware Wireless Collaborative Mixture of Experts Yang Su et.al. 2505.08719 null
2025-05-13 LLM-based Prompt Ensemble for Reliable Medical Entity Recognition from EHRs K M Sajjadul Islam et.al. 2505.08704 null
2025-05-13 A Survey of Deep Learning for Complex Speech Spectrograms Yuying Xie et.al. 2505.08694 null
2025-05-13 Adaptive Schema-aware Event Extraction with Retrieval-Augmented Generation Sheng Liang et.al. 2505.08690 null
2025-05-13 Revealing economic facts: LLMs know more than they say Marcus Buckmann et.al. 2505.08662 null
2025-05-13 Enhancing Software Development with Context-Aware Conversational Agents: A User Study on Developer Interactions with Chatbots Glaucia Melo et.al. 2505.08648 null
2025-05-13 WixQA: A Multi-Dataset Benchmark for Enterprise Retrieval-Augmented Generation Dvir Cohen et.al. 2505.08643 null
2025-05-13 TRAIL: Trace Reasoning and Agentic Issue Localization Darshan Deshpande et.al. 2505.08638 null
2025-05-13 Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models Donghoon Kim et.al. 2505.08622 null
2025-05-13 Codifying Character Logic in Role-Playing Letian Peng et.al. 2505.07705 **[link](https://github.com/KomeijiForce/Codified_Profile_Koishiday_2025)**
2025-05-13 OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit Arun S. Maiya et.al. 2505.07672 **[link](https://github.com/amaiya/onprem)**

2025-05-12

Publish Date Title Authors PDF Code
2025-05-12 H $^{\mathbf{3}}$ DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning Yiyang Lu et.al. 2505.07819 null
2025-05-12 DanceGRPO: Unleashing GRPO on Visual Generation Zeyue Xue et.al. 2505.07818 null
2025-05-12 Continuous Visual Autoregressive Generation via Score Maximization Chenze Shao et.al. 2505.07812 **[link](https://github.com/shaochenze/ear)**
2025-05-12 Improving Trajectory Stitching with Flow Models Reece O'Mahoney et.al. 2505.07802 null
2025-05-12 Overflow Prevention Enhances Long-Context Recurrent LLMs Assaf Ben-Kish et.al. 2505.07793 null
2025-05-12 Domain Regeneration: How well do LLMs match syntactic properties of text domains? Da Ju et.al. 2505.07784 null
2025-05-12 Relative Overfitting and Accept-Reject Framework Yanxin Liu et.al. 2505.07783 null
2025-05-12 MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering Rushi Qiang et.al. 2505.07782 null
2025-05-12 Synthesizing Diverse Network Flow Datasets with Scalable Dynamic Multigraph Generation Arya Grayeli et.al. 2505.07777 null
2025-05-12 Agent RL Scaling Law: Agent RL with Spontaneous Code Execution for Mathematical Problem Solving Xinji Mai et.al. 2505.07773 **[link](https://github.com/anonymize-author/agentrl)**
2025-05-12 Enhancing Code Generation via Bidirectional Comment-Level Mutual Grounding Yifeng Di et.al. 2505.07768 null
2025-05-12 LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention Jiangling Zhang et.al. 2505.07734 null
2025-05-12 Spoken Language Understanding on Unseen Tasks With In-Context Learning Neeraj Agrawal et.al. 2505.07731 null
2025-05-12 Circuit Partitioning Using Large Language Models for Quantum Compilation and Simulations Pranav Sinha et.al. 2505.07711 null
2025-05-12 PatchTrack: A Comprehensive Analysis of ChatGPT's Influence on Pull Request Outcomes Daniel Ogenrwot et.al. 2505.07700 null
2025-05-12 SpecRouter: Adaptive Routing for Multi-Level Speculative Decoding in Large Language Models Hang Wu et.al. 2505.07680 null
2025-05-12 Benchmarking Retrieval-Augmented Generation for Chemistry Xianrui Zhong et.al. 2505.07671 null
2025-05-12 A Case Study Investigating the Role of Generative AI in Quality Evaluations of Epics in Agile Software Development Werner Geyer et.al. 2505.07664 null

2025-05-09

Publish Date Title Authors PDF Code
2025-05-09 From Millions of Tweets to Actionable Insights: Leveraging LLMs for User Profiling Vahid Rahimzadeh et.al. 2505.06184 null
2025-05-09 A Large Language Model-Enhanced Q-learning for Capacitated Vehicle Routing Problem with Time Windows Linjiang Cao et.al. 2505.06178 null
2025-05-09 MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills Niladri Shekhar Dutt et.al. 2505.06176 null
2025-05-09 Turbo-ICL: In-Context Learning-Based Turbo Equalization Zihang Song et.al. 2505.06175 null
2025-05-09 A Scaling Law for Token Efficiency in LLM Fine-Tuning Under Fixed Compute Budgets Ryan Lagasse et.al. 2505.06150 null
2025-05-09 Can Prompting LLMs Unlock Hate Speech Detection across Languages? A Zero-shot and Few-shot Study Faeze Ghorbanpour et.al. 2505.06149 null
2025-05-09 ELA-ZSON: Efficient Layout-Aware Zero-Shot Object Navigation Agent with Hierarchical Planning Jiawei Hou et.al. 2505.06131 null
2025-05-09 Constraints to Lorentz violation and ultrahigh-energy electrons in D-foamy space-times Chengyi Li et.al. 2505.06121 null
2025-05-09 LLMs Get Lost In Multi-Turn Conversation Philippe Laban et.al. 2505.06120 **[link](https://github.com/microsoft/lost_in_conversation)**
2025-05-09 Photovoltaic Defect Image Generator with Boundary Alignment Smoothing Constraint for Domain Shift Mitigation Dongying Li et.al. 2505.06117 null
2025-05-09 LLMs Outperform Experts on Challenging Biology Benchmarks Lennart Justen et.al. 2505.06108 null
2025-05-09 Free and Fair Hardware: A Pathway to Copyright Infringement-Free Verilog Generation using LLMs Sam Bush et.al. 2505.06096 null
2025-05-09 Assessing Tenstorrent's RISC-V MatMul Acceleration Capabilities Hiari Pizzini Cavagna et.al. 2505.06085 null
2025-05-09 Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information Joshua Harris et.al. 2505.06046 null
2025-05-09 Short-circuiting Shortcuts: Mechanistic Investigation of Shortcuts in Text Classification Leon Eshuijs et.al. 2505.06032 **[link](https://github.com/watermeleon/shortcut_mechanisms)**
2025-05-09 Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation Stefan Vasilev et.al. 2505.06027 null
2025-05-09 Exploring the Feasibility of Multilingual Grammatical Error Correction with a Single LLM up to 9B parameters: A Comparative Study of 17 Models Dawid Wisniewski et.al. 2505.06004 null
2025-05-09 Task-Adapter++: Task-specific Adaptation with Order-aware Alignment for Few-shot Action Recognition Congqi Cao et.al. 2505.06002 null
2025-05-09 Offline Multi-agent Reinforcement Learning via Score Decomposition Dan Qiao et.al. 2505.05968 null
2025-05-09 GEORCE: A Fast New Control Algorithm for Computing Geodesics Frederik Möbius Rygaard et.al. 2505.05961 null
2025-05-09 EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation Biao Yi et.al. 2505.05440 null
2025-05-09 LiTransProQA: an LLM-based Literary Translation evaluation metric with Professional Question Answering Ran Zhang et.al. 2505.05423 null

2025-05-08

Publish Date Title Authors PDF Code
2025-05-08 3D Scene Generation: A Survey Beichen Wen et.al. 2505.05474 **[link](https://github.com/hzxie/awesome-3d-scene-generation)**
2025-05-08 StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant Haibo Wang et.al. 2505.05467 null
2025-05-08 ComPO: Preference Alignment via Comparison Oracles Peter Chen et.al. 2505.05465 null
2025-05-08 Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging Shiqi Chen et.al. 2505.05464 **[link](https://github.com/shiqichen17/vlm_merging)**
2025-05-08 Conversational Process Model Redesign Nataliia Klievtsova et.al. 2505.05453 null
2025-05-08 clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations Chalamalasetti Kranti et.al. 2505.05445 null
2025-05-08 GesPrompt: Leveraging Co-Speech Gestures to Augment LLM-Based Interaction in Virtual Reality Xiyun Hu et.al. 2505.05441 null
2025-05-08 Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data Yudong Wang et.al. 2505.05427 null
2025-05-08 Frame In, Frame Out: Do LLMs Generate More Biased News Headlines than Humans? Valeria Pastorino et.al. 2505.05406 null
2025-05-08 Modelling and Verifying Neuronal Archetypes in Coq Abdorrahim Bahrami et.al. 2505.05362 null
2025-05-08 DSDrive: Distilling Large Language Model for Lightweight End-to-End Autonomous Driving with Unified Reasoning and Planning Wenru Liu et.al. 2505.05360 null
2025-05-08 Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source Localization Sooyoung Park et.al. 2505.05343 **[link](https://github.com/swimmiing/ACL-SSL)**
2025-05-08 FLAM: Frame-Wise Language-Audio Modeling Yusong Wu et.al. 2505.05335 null
2025-05-08 ICon: In-Context Contribution for Automatic Data Selection Yixin Yang et.al. 2505.05327 null
2025-05-08 Toward Reasonable Parrots: Why Large Language Models Should Argue with Us by Design Elena Musi et.al. 2505.05298 null
2025-05-08 PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes Ahmed Abdelreheem et.al. 2505.05288 null
2025-05-08 HEXGEN-TEXT2SQL: Optimizing LLM Inference Request Scheduling for Agentic Text-to-SQL Workflow You Peng et.al. 2505.05286 null
2025-05-08 Latte: Transfering LLMs` Latent-level Knowledge for Few-shot Tabular Learning Ruxue Shi et.al. 2505.05237 null
2025-05-08 MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection Zhihao Zhang et.al. 2505.04594 null
2025-05-08 Defining and Quantifying Creative Behavior in Popular Image Generators Aditi Ramaswamy et.al. 2505.04497 null

2025-05-07

Publish Date Title Authors PDF Code
2025-05-07 EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning Zhenghao Xing et.al. 2505.04623 null
2025-05-07 On Path to Multimodal Generalist: General-Level and General-Bench Hao Fei et.al. 2505.04620 null
2025-05-07 OmniGIRL: A Multilingual and Multimodal Benchmark for GitHub Issue Resolution Lianghong Guo et.al. 2505.04606 null
2025-05-07 ZeroSearch: Incentivize the Search Capability of LLMs without Searching Hao Sun et.al. 2505.04588 null
2025-05-07 SlideItRight: Using AI to Find Relevant Slides and Provide Feedback for Open-Ended Questions Chloe Qianhui Zhao et.al. 2505.04584 null
2025-05-07 Comparative Analysis of Carbon Footprint in Manual vs. LLM-Assisted Code Development Kuen Sum Cheung et.al. 2505.04521 null
2025-05-07 Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs Yehui Tang et.al. 2505.04519 null
2025-05-07 Detecting Spelling and Grammatical Anomalies in Russian Poetry Texts Ilya Koziev et.al. 2505.04507 null
2025-05-07 A Design Space for the Critical Validation of LLM-Generated Tabular Data Madhav Sachdeva et.al. 2505.04487 null
2025-05-07 Efficient Flow Matching using Latent Variables Anirban Samaddar et.al. 2505.04486 null
2025-05-07 CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation Jiahao Li et.al. 2505.04481 null
2025-05-07 TrajEvo: Designing Trajectory Prediction Heuristics via LLM-driven Evolution Zhikai Zhao et.al. 2505.04480 null
2025-05-07 Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration Shigeki Karita et.al. 2505.04457 null
2025-05-07 M2Rec: Multi-scale Mamba for Efficient Sequential Recommendation Qianru Zhang et.al. 2505.04445 null
2025-05-07 Towards Effectively Leveraging Execution Traces for Program Repair with Code LLMs Mirazul Haque et.al. 2505.04441 null
2025-05-07 OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models Xiaoyu Xu et.al. 2505.04416 null
2025-05-07 YABLoCo: Yet Another Benchmark for Long Context Code Generation Aidar Valeev et.al. 2505.04406 null
2025-05-07 Large Means Left: Political Bias in Large Language Models Increases with Their Number of Parameters David Exler et.al. 2505.04393 null

2025-05-06

Publish Date Title Authors PDF Code
2025-05-06 WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch Zimu Lu et.al. 2505.03733 null
2025-05-06 Graph Drawing for LLMs: An Empirical Evaluation Walter Didimo et.al. 2505.03678 null
2025-05-06 PhysLLM: Harnessing Large Language Models for Cross-Modal Remote Physiological Sensing Yiping Xie et.al. 2505.03621 null
2025-05-06 From Pixels to Polygons: A Survey of Deep Learning Approaches for Medical Image-to-Mesh Reconstruction Fengming Lin et.al. 2505.03599 null
2025-05-06 LlamaFirewall: An open source guardrail system for building secure AI agents Sahana Chennabasappa et.al. 2505.03574 null
2025-05-06 Say It Another Way: A Framework for User-Grounded Paraphrasing Cléa Chataigner et.al. 2505.03563 null
2025-05-06 Real-Time Person Image Synthesis Using a Flow Matching Model Jiwoo Jeong et.al. 2505.03562 null
2025-05-06 A Comprehensive Survey of Large AI Models for Future Communications: Foundations, Applications and Challenges Feibo Jiang et.al. 2505.03556 null
2025-05-06 A Hashgraph-Inspired Consensus Mechanism for Reliable Multi-Model Reasoning Kolawole E. Ogunsina et.al. 2505.03553 null
2025-05-06 STORY2GAME: Generating (Almost) Everything in an Interactive Fiction Game Eric Zhou et.al. 2505.03547 null
2025-05-06 Faster MoE LLM Inference for Extremely Large Models Haoqi Yang et.al. 2505.03531 null
2025-05-06 Causal Intervention Framework for Variational Auto Encoder Mechanistic Interpretability Dip Roy et.al. 2505.03530 null
2025-05-06 Ruled by the Representation Space: On the University's Embrace of Large Language Models Katia Schwerzmann et.al. 2505.03513 null
2025-05-06 Modality-Guided Dynamic Graph Fusion and Temporal Diffusion for Self-Supervised RGB-T Tracking Shenglan Li et.al. 2505.03507 **[link](https://github.com/lishenglana/gdstrack)**
2025-05-06 BadLingual: A Novel Lingual-Backdoor Attack against Large Language Models Zihan Wang et.al. 2505.03501 null
2025-05-06 Augmenting Human Cognition through Everyday AR Xiaoan Liu et.al. 2505.03492 null
2025-05-06 A new membership inference attack that spots memorization in generative and predictive models: Loss-Based with Reference Model algorithm (LBRM) Faiz Taleb et.al. 2505.03490 null
2025-05-06 am-ELO: A Stable Framework for Arena-based LLM Evaluation Zirui Liu et.al. 2505.03475 null
2025-05-06 Evaluation of LLMs on Long-tail Entity Linking in Historical Documents Marta Boscariol et.al. 2505.03473 null
2025-05-06 Uncertainty-Aware Large Language Models for Explainable Disease Diagnosis Shuang Zhou et.al. 2505.03467 null
2025-05-06 Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation Gerard Pons et.al. 2505.02737 null

2025-05-05

Publish Date Title Authors PDF Code
2025-05-05 Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation Lu Ling et.al. 2505.02836 null
2025-05-05 AutoLibra: Agent Metric Induction from Open-Ended Feedback Hao Zhu et.al. 2505.02820 null
2025-05-05 ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations Dmitriy Shopkhoev et.al. 2505.02819 null
2025-05-05 Knowing You Don't Know: Learning When to Continue Search in Multi-round RAG through Self-Practicing Diji Yang et.al. 2505.02811 null
2025-05-05 Towards Quantifying the Hessian Structure of Neural Networks Zhaorui Dong et.al. 2505.02809 null
2025-05-05 Generating HomeAssistant Automations Using an LLM-based Chatbot Mathyas Giudici et.al. 2505.02802 null
2025-05-05 HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models Zheng Lin et.al. 2505.02795 null
2025-05-05 Giving Simulated Cells a Voice: Evolving Prompt-to-Intervention Models for Cellular Control Nam H. Le et.al. 2505.02766 null
2025-05-05 Bye-bye, Bluebook? Automating Legal Procedure with Large Language Models Matthew Dahl et.al. 2505.02763 null
2025-05-05 FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models Zhouliang Yu et.al. 2505.02735 null
2025-05-05 Enhancing LLMs' Clinical Reasoning with Real-World Data from a Nationwide Sepsis Registry Junu Kim et.al. 2505.02722 null
2025-05-05 Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play Yemin Shi et.al. 2505.02707 null
2025-05-05 Structure Causal Models and LLMs Integration in Medical Visual Question Answering Zibo Xu et.al. 2505.02703 null
2025-05-05 Exploring LLM-Powered Role and Action-Switching Pedagogical Agents for History Education in Virtual Reality Zihao Zhu et.al. 2505.02699 null
2025-05-05 AI Standardized Patient Improves Human Conversations in Advanced Cancer Care Kurtis Haut et.al. 2505.02694 null
2025-05-05 Predicting Movie Hits Before They Happen with LLMs Shaghayegh Agah et.al. 2505.02693 null
2025-05-05 Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models Xiaobao Wu et.al. 2505.02686 null
2025-05-05 A Survey on Progress in LLM Alignment from the Perspective of Reward Design Miaomiao Ji et.al. 2505.02666 null
2025-05-05 A Survey of Slow Thinking-based Reasoning LLMs using Reinforced Learning and Inference-time Scaling Law Qianjun Pan et.al. 2505.02665 null

2025-05-02

Publish Date Title Authors PDF Code
2025-05-02 GENMO: A GENeralist Model for Human MOtion Jiefeng Li et.al. 2505.01425 null
2025-05-02 Provable Efficiency of Guidance in Diffusion Models for General Data Distribution Gen Li et.al. 2505.01382 null
2025-05-02 TRAVELER: A Benchmark for Evaluating Temporal Reasoning across Vague, Implicit and Explicit References Svenja Kenneweg et.al. 2505.01325 null
2025-05-02 Helping Big Language Models Protect Themselves: An Enhanced Filtering and Summarization System Sheikh Samit Muhaimin et.al. 2505.01315 null
2025-05-02 Document Retrieval Augmented Fine-Tuning (DRAFT) for safety-critical software assessments Regan Bolton et.al. 2505.01307 null
2025-05-02 ViSA-Flow: Accelerating Robot Skill Learning via Large-Scale Video Semantic Action Flow Changhe Chen et.al. 2505.01288 null
2025-05-02 Scoring-Assisted Generative Exploration for Proteins (SAGE-Prot): A Framework for Multi-Objective Protein Optimization via Iterative Sequence Generation and Evaluation Hocheol Lim et.al. 2505.01277 null
2025-05-02 FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing Gaoxiang Cong et.al. 2505.01263 null
2025-05-02 Enhancing Obsolescence Forecasting with Deep Generative Data Augmentation: A Semi-Supervised Framework for Low-Data Industrial Applications Elie Saad et.al. 2505.01261 null
2025-05-02 Digital Pathway Curation (DPC): a comparative pipeline to assess the reproducibility, consensus and accuracy across Gemini, PubMed, and scientific reviewers in biomedical research Flavio Lichtenstein et.al. 2505.01259 null
2025-05-02 EvalxNLP: A Framework for Benchmarking Post-Hoc Explainability Methods on NLP Models Mahdi Dhaini et.al. 2505.01238 null
2025-05-02 A Combinatorial Proof of Universal Optimality for Computing a Planar Convex Hull Ivor van der Hoog et.al. 2505.01194 null
2025-05-02 LLM Security: Vulnerabilities, Attacks, Defenses, and Countermeasures Francisco Aguilera-Martínez et.al. 2505.01177 null
2025-05-02 On the Limitations of Steering in Language Model Alignment Chebrolu Niranjan et.al. 2505.01162 null
2025-05-02 Methodological Foundations for AI-Driven Survey Question Generation Ted K. Mburu et.al. 2505.01150 null
2025-05-02 Retrieval-Augmented Generation in Biomedicine: A Survey of Technologies, Datasets, and Clinical Applications Jiawei He et.al. 2505.01146 null
2025-05-02 Evaluating the Impact of Data Cleaning on the Quality of Generated Pull Request Descriptions Kutay Tire et.al. 2505.01120 null
2025-05-02 Incorporating Inductive Biases to Energy-based Generative Models Yukun Li et.al. 2505.01111 null
2025-05-02 MateICL: Mitigating Attention Dispersion in Large-Scale In-Context Learning Murtadha Ahmed et.al. 2505.01110 null
2025-05-02 Any-to-Any Vision-Language Model for Multimodal X-ray Imaging and Radiological Report Generation Daniele Molino et.al. 2505.01091 null

2025-05-01

Publish Date Title Authors PDF Code
2025-05-01 T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Dongzhi Jiang et.al. 2505.00703 null
2025-05-01 GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution Aditya Arora et.al. 2505.00687 null
2025-05-01 MINERVA: Evaluating Complex Video Reasoning Arsha Nagrani et.al. 2505.00681 null
2025-05-01 Steering Large Language Models with Register Analysis for Arbitrary Style Transfer Xinchen Yang et.al. 2505.00679 null
2025-05-01 Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions Yiming Du et.al. 2505.00675 null
2025-05-01 DeepCritic: Deliberate Critique with Large Language Models Wenkai Yang et.al. 2505.00662 null
2025-05-01 Large Language Models Understanding: an Inherent Ambiguity Barrier Daniel N. Nissani et.al. 2505.00654 null
2025-05-01 Open-Source LLM-Driven Federated Transformer for Predictive IoV Management Yazan Otoum et.al. 2505.00651 null
2025-05-01 Investigating Task Arithmetic for Zero-Shot Information Retrieval Marco Braga et.al. 2505.00649 null
2025-05-01 The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them) Zihao Wang et.al. 2505.00626 null
2025-05-01 FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation Chaitali Bhattacharyya et.al. 2505.00624 null
2025-05-01 Combining LLMs with Logic-Based Framework to Explain MCTS Ziyan An et.al. 2505.00610 null
2025-05-01 Can LLMs Help Improve Analogical Reasoning For Strategic Decisions? Experimental Evidence from Humans and GPT-4 Phanish Puranam et.al. 2505.00603 null
2025-05-01 Block Circulant Adapter for Large Language Models Xinyu Ding et.al. 2505.00582 null
2025-05-01 FreqKV: Frequency Domain Key-Value Compression for Efficient Context Window Extension Jushi Kai et.al. 2505.00570 null
2025-05-01 Triggering Hallucinations in LLMs: A Quantitative Study of Prompt-Induced Hallucination in Large Language Models Makoto Sato et.al. 2505.00557 null
2025-05-01 Leveraging Partial SMILES Validation Scheme for Enhanced Drug Design in Reinforcement Learning Frameworks Xinyu Wang et.al. 2505.00530 null
2025-05-01 HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection Deanna Emery et.al. 2505.00506 null
2025-05-01 UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces Alaa Saleh et.al. 2505.00472 null
2025-05-01 A General Model for Linearly Polarized Optical Vector Beams Jonathan Nichols et.al. 2505.00471 null

2025-04-30

Publish Date Title Authors PDF Code
2025-04-30 ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction Qihao Liu et.al. 2504.21855 null
2025-04-30 TRUST: An LLM-Based Dialogue System for Trauma Understanding and Structured Assessments Sichang Tu et.al. 2504.21851 null
2025-04-30 3D Stylization via Large Reconstruction Model Ipek Oztas et.al. 2504.21836 null
2025-04-30 From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems Huan Zhang et.al. 2504.21815 null
2025-04-30 Why Compress What You Can Generate? When GPT-4o Generation Ushers in Image Compression Fields Yixin Gao et.al. 2504.21814 null
2025-04-30 An Empirical Study on the Effectiveness of Large Language Models for Binary Code Understanding Xiuwei Shang et.al. 2504.21803 null
2025-04-30 MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness Junsheng Huang et.al. 2504.21773 null
2025-04-30 Anatomical Similarity as a New Metric to Evaluate Brain Generative Models Bahram Jafrasteh et.al. 2504.21771 null
2025-04-30 LASHED: LLMs And Static Hardware Analysis for Early Detection of RTL Bugs Baleegh Ahmad et.al. 2504.21770 null
2025-04-30 LLM-based Interactive Imitation Learning for Robotic Manipulation Jonas Werner et.al. 2504.21769 null
2025-04-30 CodeFlowBench: A Multi-turn, Iterative Benchmark for Complex Code Generation Sizhe Wang et.al. 2504.21751 null
2025-04-30 TheraQuest: A Gamified, LLM-Powered Simulation for Massage Therapy Training Shengqian Wang et.al. 2504.21735 null
2025-04-30 LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics Marc Glocker et.al. 2504.21716 null
2025-04-30 XBreaking: Explainable Artificial Intelligence for Jailbreaking LLMs Marco Arazzi et.al. 2504.21700 null
2025-04-30 Hoist with His Own Petard: Inducing Guardrails to Facilitate Denial-of-Service Attacks on Retrieval-Augmented Generation of LLMs Pan Suo et.al. 2504.21680 null
2025-04-30 Traceback of Poisoning Attacks to Retrieval-Augmented Generation Baolei Zhang et.al. 2504.21668 null
2025-04-30 Meeseeks: An Iterative Benchmark Evaluating LLMs Multi-Turn Instruction-Following Ability Jiaming Wang et.al. 2504.21625 null
2025-04-30 RDF-Based Structured Quality Assessment Representation of Multilingual LLM Evaluations Jonas Gwozdz et.al. 2504.21605 null
2025-04-30 Leveraging Pre-trained Large Language Models with Refined Prompting for Online Task and Motion Planning Huihui Guo et.al. 2504.21596 null
2025-04-30 DNB-AI-Project at SemEval-2025 Task 5: An LLM-Ensemble Approach for Automated Subject Indexing Lisa Kluge et.al. 2504.21589 null
2025-04-30 End-to-end Audio Deepfake Detection from RAW Waveforms: a RawNet-Based Approach with Cross-Dataset Evaluation Andrea Di Pierno et.al. 2504.20923 **[link](https://github.com/adipiz99/RawNetLite)**

2025-04-29

Publish Date Title Authors PDF Code
2025-04-29 YoChameleon: Personalized Vision and Language Generation Thao Nguyen et.al. 2504.20998 null
2025-04-29 Toward Efficient Exploration by Large Language Model Agents Dilip Arumugam et.al. 2504.20997 null
2025-04-29 X-Fusion: Introducing New Modality to Frozen Large Language Models Sicheng Mo et.al. 2504.20996 null
2025-04-29 TesserAct: Learning 4D Embodied World Models Haoyu Zhen et.al. 2504.20995 null
2025-04-29 ACE: A Security Architecture for LLM-Integrated App Systems Evan Li et.al. 2504.20984 null
2025-04-29 Jekyll-and-Hyde Tipping Point in an AI's Behavior Neil F. Johnson et.al. 2504.20980 null
2025-04-29 Real-Time Wayfinding Assistant for Blind and Low-Vision Users Dabbrata Das et.al. 2504.20976 null
2025-04-29 SetKE: Knowledge Editing for Knowledge Elements Overlap Yifan Wei et.al. 2504.20972 null
2025-04-29 AegisLLM: Scaling Agentic Systems for Self-Reflective Defense in LLM Security Zikui Cai et.al. 2504.20965 null
2025-04-29 OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification Shangyu Li et.al. 2504.20964 null
2025-04-29 Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models Maryna Vyshnyvetska et.al. 2504.20951 null
2025-04-29 Trace-of-Thought: Enhanced Arithmetic Problem Solving via Reasoning Distillation From Large to Small Language Models Tyler McDonald et.al. 2504.20946 null
2025-04-29 ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification Ziqing Fan et.al. 2504.20930 null
2025-04-29 DYNAMAX: Dynamic computing for Transformers and Mamba based architectures Miguel Nogales et.al. 2504.20922 null
2025-04-29 An Empirical Study on the Capability of LLMs in Decomposing Bug Reports Zhiyuan Chen et.al. 2504.20911 null
2025-04-29 Evaluating Generative Models for Tabular Data: Novel Metrics and Benchmarking Dayananda Herurkar et.al. 2504.20900 null
2025-04-29 LELANTE: LEveraging LLM for Automated ANdroid TEsting Shamit Fatin et.al. 2504.20896 null
2025-04-29 The Leaderboard Illusion Shivalika Singh et.al. 2504.20879 null
2025-04-29 AI-GenBench: A New Ongoing Benchmark for AI-Generated Image Detection Lorenzo Pellegrini et.al. 2504.20865 null
2025-04-29 LLM-Generated Fake News Induces Truth Decay in News Ecosystem: A Case Study on Neural News Recommendation Beizhe Hu et.al. 2504.20013 null
2025-04-29 From Concept to Practice: an Automated LLM-aided UVM Machine for RTL Verification Junhao Ye et.al. 2504.19959 null
2025-04-29 The Automation Advantage in AI Red Teaming Rob Mulla et.al. 2504.19855 null

2025-04-28

Publish Date Title Authors PDF Code
2025-04-28 AutoJudge: Judge Decoding Without Manual Annotation Roman Garipov et.al. 2504.20039 null
2025-04-28 Better To Ask in English? Evaluating Factual Accuracy of Multilingual LLMs in English and Low-Resource Languages Pritika Rohera et.al. 2504.20022 null
2025-04-28 Modular Machine Learning: An Indispensable Path towards New-Generation Large Language Models Xin Wang et.al. 2504.20020 null
2025-04-28 Applying LLM-Powered Virtual Humans to Child Interviews in Child-Centered Design Linshi Li et.al. 2504.20016 null
2025-04-28 Towards Automated Scoping of AI for Social Good Projects Jacob Emmerson et.al. 2504.20010 null
2025-04-28 Chatbot Arena Meets Nuggets: Towards Explanations and Diagnostics in the Evaluation of LLM Responses Sahel Sharifymoghaddam et.al. 2504.20006 null
2025-04-28 Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom Rishika Sen et.al. 2504.20000 null
2025-04-28 TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons Emre Can Acikgoz et.al. 2504.19982 null
2025-04-28 Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets Adam Younsi et.al. 2504.19981 null
2025-04-28 Securing Agentic AI: A Comprehensive Threat Model and Mitigation Framework for Generative AI Agents Vineeth Sai Narajala et.al. 2504.19956 null
2025-04-28 Enhancing Surgical Documentation through Multimodal Visual-Temporal Transformers and Generative AI Hugo Georgenthum et.al. 2504.19918 null
2025-04-28 Can AI Agents Design and Implement Drug Discovery Pipelines? Khachik Smbatyan et.al. 2504.19912 null
2025-04-28 GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets Mingqian He et.al. 2504.19898 null
2025-04-28 CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition Quynh Phung et.al. 2504.19894 null
2025-04-28 DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images Mamadou Keita et.al. 2504.19876 null
2025-04-28 semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage Ke Hong et.al. 2504.19867 null
2025-04-28 CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback Chenhan Jiang et.al. 2504.19860 null

2025-04-25

Publish Date Title Authors PDF Code
2025-04-25 TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation Gwen Yidou Weng et.al. 2504.18535 null
2025-04-25 Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation Shivam Duggal et.al. 2504.18509 null
2025-04-25 Action-Minimization Meets Generative Modeling: Efficient Transition Path Sampling with the Onsager-Machlup Functional Sanjeev Raja et.al. 2504.18506 null
2025-04-25 Facets, Taxonomies, and Syntheses: Navigating Structured Representations in LLM-Assisted Literature Review Raymond Fok et.al. 2504.18496 null
2025-04-25 Investigating Co-Constructive Behavior of Large Language Models in Explanation Dialogues Leandra Fichtel et.al. 2504.18483 null
2025-04-25 Generative Induction of Dialogue Task Schemas with Streaming Refinement and Simulated Interactions James D. Finch et.al. 2504.18474 null
2025-04-25 PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts Yiming Wang et.al. 2504.18428 null
2025-04-25 Kimi-Audio Technical Report KimiTeam et.al. 2504.18425 null
2025-04-25 LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning Rui Li et.al. 2504.18424 null
2025-04-25 LLMpatronous: Harnessing the Power of LLMs For Vulnerability Detection Rajesh Yarra et.al. 2504.18423 null
2025-04-25 BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs Hongyu Wang et.al. 2504.18415 null
2025-04-25 Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providers Jared Moore et.al. 2504.18412 **[link](https://github.com/jlcmoore/llms-as-therapists)**
2025-04-25 Can Code Outlove Blood? A LLM-based VR Experience to Prompt Reflection on Parental Verbal Abuse Jiaying Fu et.al. 2504.18410 null
2025-04-25 HepatoGEN: Generating Hepatobiliary Phase MRI with Perceptual and Adversarial Models Jens Hooge et.al. 2504.18405 null
2025-04-25 Bridge the Domains: Large Language Models Enhanced Cross-domain Sequential Recommendation Qidong Liu et.al. 2504.18383 null
2025-04-25 Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant Lei Shen et.al. 2504.18373 null
2025-04-25 ThreMoLIA: Threat Modeling of Large Language Model-Integrated Applications Felix Viktor Jedrzejewski et.al. 2504.18369 null
2025-04-25 Enhanced Sampling, Public Dataset and Generative Model for Drug-Protein Dissociation Dynamics Maodong Li et.al. 2504.18367 null
2025-04-25 Revisiting Data Auditing in Large Vision-Language Models Hongyu Zhu et.al. 2504.18349 null
2025-04-25 Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review Toghrul Abbasli et.al. 2504.18346 null
2025-04-25 DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training Xiaoyu Tian et.al. 2504.17565 null

2025-04-24

Publish Date Title Authors PDF Code
2025-04-24 Replay to Remember: Retaining Domain Knowledge in Streaming Language Models Sneh Pillai et.al. 2504.17780 null
2025-04-24 The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs Piotr Nawrot et.al. 2504.17768 null
2025-04-24 Step1X-Edit: A Practical Framework for General Image Editing Shiyu Liu et.al. 2504.17761 null
2025-04-24 Towards Robust LLMs: an Adversarial Robustness Measurement Framework Natan Levy et.al. 2504.17723 null
2025-04-24 Multilingual Performance Biases of Large Language Models in Education Vansh Gupta et.al. 2504.17720 null
2025-04-24 Ensemble Bayesian Inference: Leveraging Small Language Models to Achieve LLM-level Accuracy in Profile Matching Tasks Haru-Tada Sato et.al. 2504.17685 null
2025-04-24 INSIGHT: Bridging the Student-Teacher Gap in Times of Large Language Models Jarne Thys et.al. 2504.17677 null
2025-04-24 Energy Considerations of Large Language Model Inference and Efficiency Optimizations Jared Fernandez et.al. 2504.17674 null
2025-04-24 Cross-region Model Training with Communication-Computation Overlapping and Delay Compensation Ying Zhu et.al. 2504.17672 null
2025-04-24 DiMeR: Disentangled Mesh Reconstruction Model Lutao Jiang et.al. 2504.17670 null
2025-04-24 Towards a HIPAA Compliant Agentic AI System in Healthcare Subash Neupane et.al. 2504.17669 null
2025-04-24 Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics Zena Al-Khalili et.al. 2504.17665 null
2025-04-24 Likelihood-Free Variational Autoencoders Chen Xu et.al. 2504.17622 null
2025-04-24 L3: DIMM-PIM Integrated Architecture and Coordination for Scalable Long-Context LLM Inference Qingyuan Liu et.al. 2504.17584 null
2025-04-24 HalluLens: LLM Hallucination Benchmark Yejin Bang et.al. 2504.17550 null
2025-04-24 A Comprehensive Survey of Knowledge-Based Vision Question Answering Systems: The Lifecycle of Knowledge in Visual Reasoning Task Jiaqi Deng et.al. 2504.17547 null
2025-04-24 Auditing the Ethical Logic of Generative AI Models W. Russell Neuman et.al. 2504.17544 null
2025-04-24 Large Language Model-Driven Concolic Execution for Highly Structured Test Input Generation Haoxin Tu et.al. 2504.17542 null
2025-04-24 Towards Machine-Generated Code for the Resolution of User Intentions Justus Flerlage et.al. 2504.17531 null

2025-04-23

Publish Date Title Authors PDF Code
2025-04-23 Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of Light Ali Hassani et.al. 2504.16922 null
2025-04-23 IberBench: LLM Evaluation on Iberian Languages José Ángel González et.al. 2504.16921 null
2025-04-23 OptimAI: Optimization from Natural Language Using LLM-Powered AI Agents Raghav Thind et.al. 2504.16918 null
2025-04-23 DreamO: A Unified Framework for Image Customization Chong Mou et.al. 2504.16915 null
2025-04-23 Tracing Thought: Using Chain-of-Thought Reasoning to Identify the LLM Behind AI-Generated Text Shifali Agrahari et.al. 2504.16913 null
2025-04-23 BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation Ruotong Wang et.al. 2504.16907 null
2025-04-23 Practical approaches for crystal structure predictions with inpainting generation and universal interatomic potentials Peichen Zhong et.al. 2504.16893 null
2025-04-23 Do Large Language Models know who did what to whom? Joseph M. Denning et.al. 2504.16884 null
2025-04-23 Enhancing Critical Thinking with AI: A Tailored Warning System for RAG Models Xuyang Zhu et.al. 2504.16883 null
2025-04-23 Context-Enhanced Vulnerability Detection Based on Large Language Model Yixin Yang et.al. 2504.16877 null
2025-04-23 Exploring How LLMs Capture and Represent Domain-Specific Knowledge Mirian Hipolito Garcia et.al. 2504.16871 null
2025-04-23 Planning with Diffusion Models for Target-Oriented Dialogue Systems Hanwen Du et.al. 2504.16858 null
2025-04-23 Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification Alexander Shvets et.al. 2504.16856 null
2025-04-23 Monte Carlo Planning with Large Language Model for Text-Based Game Agents Zijing Shi et.al. 2504.16855 null
2025-04-23 Improving Significant Wave Height Prediction Using Chronos Models Yilin Zhai et.al. 2504.16834 null
2025-04-23 LRASGen: LLM-based RESTful API Specification Generation Sida Deng et.al. 2504.16833 null
2025-04-23 GreenMind: A Next-Generation Vietnamese Large Language Model for Structured and Logical Reasoning Luu Quy Tung et.al. 2504.16832 null
2025-04-23 Process Reward Models That Think Muhammad Khalifa et.al. 2504.16828 null
2025-04-23 LLM-assisted Graph-RAG Information Extraction from IFC Data Sima Iranmanesh et.al. 2504.16813 null
2025-04-23 Decoupled Global-Local Alignment for Improving Compositional Understanding Xiaoxing Hu et.al. 2504.16801 null
2025-04-23 CAPO: Cost-Aware Prompt Optimization Tom Zehle et.al. 2504.16005 **[link](https://github.com/finitearth/capo)**
2025-04-23 From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs Yaxiong Wu et.al. 2504.15965 null
2025-04-23 Language Models to Support Multi-Label Classification of Industrial Data Waleed Abdeen et.al. 2504.15922 null

2025-04-22

Publish Date Title Authors PDF Code
2025-04-22 TTRL: Test-Time Reinforcement Learning Yuxin Zuo et.al. 2504.16084 null
2025-04-22 MR. Video: "MapReduce" is the Principle for Long Video Understanding Ziqi Pang et.al. 2504.16082 null
2025-04-22 LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities Thomas Schmied et.al. 2504.16078 null
2025-04-22 PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models Shi Qiu et.al. 2504.16074 null
2025-04-22 Boosting Generative Image Modeling via Joint Image-Feature Synthesis Theodoros Kouzelis et.al. 2504.16064 null
2025-04-22 Automated Static Vulnerability Detection via a Holistic Neuro-symbolic Approach Penghui Li et.al. 2504.16057 null
2025-04-22 Honey, I Shrunk the Language Model: Impact of Knowledge Distillation Methods on Performance and Explainability Daniel Hendriks et.al. 2504.16056 null
2025-04-22 Certified Mitigation of Worst-Case LLM Copyright Infringement Jingyu Zhang et.al. 2504.16046 null
2025-04-22 LLMs meet Federated Learning for Scalable and Secure IoT Management Yazan Otoum et.al. 2504.16032 null
2025-04-22 LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale Joya Chen et.al. 2504.16030 null
2025-04-22 Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V3 Ahmed R. Sadik et.al. 2504.16027 null
2025-04-22 Token-Aware Coding Flow: A Study with Nano Surge in Reasoning Model Junwei Hu et.al. 2504.15989 null
2025-04-22 Deep learning of point processes for modeling high-frequency data Yoshihiro Gyotoku et.al. 2504.15944 null
2025-04-22 FairTranslate: An English-French Dataset for Gender Bias Evaluation in Machine Translation by Overcoming Gender Binarity Fanny Jourdan et.al. 2504.15941 **[link](https://github.com/fanny-jourdan/FairTranslate)**
2025-04-22 Low-Rank Adaptation of Neural Fields Anh Truong et.al. 2504.15933 null
2025-04-22 StreamRL: Scalable, Heterogeneous, and Elastic RL for LLMs with Disaggregated Stream Generation Yinmin Zhong et.al. 2504.15930 null
2025-04-22 ViSMaP: Unsupervised Hour-long Video Summarisation by Meta-Prompting Jian Hu et.al. 2504.15921 null
2025-04-22 Synergistic Weak-Strong Collaboration by Aligning Preferences Yizhu Jiao et.al. 2504.15188 null

2025-04-21

Publish Date Title Authors PDF Code
2025-04-21 Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning Jie Cheng et.al. 2504.15275 **[link](https://github.com/cjreinforce/pure)**
2025-04-21 Interpretable Locomotion Prediction in Construction Using a Memory-Driven LLM Agent With Chain-of-Thought Reasoning Ehsan Ahmadi et.al. 2504.15263 null
2025-04-21 CRUST-Bench: A Comprehensive Benchmark for C-to-safe-Rust Transpilation Anirudh Khatry et.al. 2504.15254 **[link](https://github.com/anirudhkhatry/crust-bench)**
2025-04-21 Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators Yilun Zhou et.al. 2504.15253 **[link](https://github.com/salesforceairesearch/jetts-benchmark)**
2025-04-21 MR. Guard: Multilingual Reasoning Guardrail using Curriculum Learning Yahan Yang et.al. 2504.15241 null
2025-04-21 A Self-Improving Coding Agent Maxime Robeyns et.al. 2504.15228 null
2025-04-21 EvalAgent: Discovering Implicit Evaluation Criteria from the Web Manya Wadhwa et.al. 2504.15219 null
2025-04-21 DRAGON: Distributional Rewards Optimize Diffusion Generative Models Yatong Bai et.al. 2504.15217 null
2025-04-21 Integrating Symbolic Execution into the Fine-Tuning of Code-Generating LLMs Marina Sakharova et.al. 2504.15210 null
2025-04-21 Compute-Optimal LLMs Provably Generalize Better With Scale Marc Finzi et.al. 2504.15208 null
2025-04-21 Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges Nandan Thakur et.al. 2504.15205 null
2025-04-21 Zero-Shot, But at What Cost? Unveiling the Hidden Overhead of MILS's LLM-CLIP Framework for Image Captioning Yassir Benhammou et.al. 2504.15199 null
2025-04-21 Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform Xianpan Zhou et.al. 2504.15182 null
2025-04-21 The Synthetic Imputation Approach: Generating Optimal Synthetic Texts For Underrepresented Categories In Supervised Classification Tasks Joan C. Timoneda et.al. 2504.15160 null
2025-04-21 EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models Ziwen Xu et.al. 2504.15133 **[link](https://github.com/zjunlp/easyedit)**
2025-04-21 Kuwain 1.5B: An Arabic SLM via Language Injection Khalil Hennara et.al. 2504.15120 null
2025-04-21 Rethinking the Potential of Multimodality in Collaborative Problem Solving Diagnosis with Large Language Models K. Wong et.al. 2504.15093 null
2025-04-21 Empowering AI to Generate Better AI Code: Guided Generation of Deep Learning Projects with LLMs Chen Xie et.al. 2504.15080 null
2025-04-21 Think2SQL: Reinforce LLM Reasoning Capabilities for Text2SQL Simone Papicchio et.al. 2504.15077 null

2025-04-18

Publish Date Title Authors PDF Code
2025-04-18 Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Yang Yue et.al. 2504.13837 null
2025-04-18 Science Hierarchography: Hierarchical Organization of Science Literature Muhan Gao et.al. 2504.13834 null
2025-04-18 Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning Yixuan Even Xu et.al. 2504.13818 null
2025-04-18 Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations Chenghao Xiao et.al. 2504.13816 null
2025-04-18 BadApex: Backdoor Attack Based on Adaptive Optimization Mechanism of Black-box Large Language Models Zhengxian Wu et.al. 2504.13775 null
2025-04-18 DP2Unlearning: An Efficient and Guaranteed Unlearning Framework for LLMs Tamim Al Mahmud et.al. 2504.13774 null
2025-04-18 Detecting Malicious Source Code in PyPI Packages with LLMs: Does RAG Come in Handy? Motunrayo Ibiyo et.al. 2504.13769 null
2025-04-18 ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis Andrea Rigo et.al. 2504.13745 null
2025-04-18 Controlled Territory and Conflict Tracking (CONTACT): (Geo-)Mapping Occupied Territory from Open Source Intelligence Paul K. Mandal et.al. 2504.13730 null
2025-04-18 MLEP: Multi-granularity Local Entropy Patterns for Universal AI-generated Image Detection Lin Yuan et.al. 2504.13726 null
2025-04-18 OpenDeception: Benchmarking and Investigating AI Deceptive Behaviors via Open-ended Interaction Simulation Yichen Wu et.al. 2504.13707 null
2025-04-18 Exploring Multimodal Prompt for Visualization Authoring with Large Language Models Zhen Wen et.al. 2504.13700 null
2025-04-18 Intelligent Interaction Strategies for Context-Aware Cognitive Augmentation Xiangrong et.al. 2504.13684 null
2025-04-18 Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results Andrea Santilli et.al. 2504.13677 null
2025-04-18 Large Language Models Will Change The Way Children Think About Technology And Impact Every Interaction Paradigm Russell Beale et.al. 2504.13667 null
2025-04-18 Do Prompt Patterns Affect Code Quality? A First Empirical Assessment of ChatGPT-Generated Code Antonio Della Porta et.al. 2504.13656 null
2025-04-18 Exploring the Potential for Large Language Models to Demonstrate Rational Probabilistic Beliefs Gabriel Freedman et.al. 2504.13644 null
2025-04-18 Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling Shaomu Tan et.al. 2504.13630 null
2025-04-18 Divergent LLM Adoption and Heterogeneous Convergence Paths in Research Writing Cong William Lin et.al. 2504.13629 null
2025-04-18 Visual Intention Grounding for Egocentric Assistants Pengzhan Sun et.al. 2504.13621 null
2025-04-18 SkyReels-V2: Infinite-length Film Generative Model Guibin Chen et.al. 2504.13074 null

2025-04-17

Publish Date Title Authors PDF Code
2025-04-17 Aligning Constraint Generation with Design Intent in Parametric CAD Evan Casey et.al. 2504.13178 null
2025-04-17 SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs Haoxuan Li et.al. 2504.13172 null
2025-04-17 Sleep-time Compute: Beyond Inference Scaling at Test-time Kevin Lin et.al. 2504.13171 **[link](https://github.com/letta-ai/sleep-time-compute)**
2025-04-17 Exploring Expert Failures Improves LLM Agent Tuning Li-Cheng Lan et.al. 2504.13145 null
2025-04-17 Energy-Based Reward Models for Robust Language Model Alignment Anamika Lochab et.al. 2504.13134 null
2025-04-17 Science-T2I: Addressing Scientific Illusions in Image Synthesis Jialuo Li et.al. 2504.13129 null
2025-04-17 LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard Varun Rao et.al. 2504.13125 null
2025-04-17 VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models Haojian Huang et.al. 2504.13122 **[link](https://github.com/haroldchen19/vistadpo)**
2025-04-17 UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models Guanlong Jiao et.al. 2504.13109 null
2025-04-17 EventVAD: Training-Free Event-Aware Video Anomaly Detection Yihua Shao et.al. 2504.13092 null
2025-04-17 Retrieval-Augmented Generation with Conflicting Evidence Han Wang et.al. 2504.13079 **[link](https://github.com/hannight/ramdocs)**
2025-04-17 An All-Atom Generative Model for Designing Protein Complexes Ruizhe Chen et.al. 2504.13075 null
2025-04-17 Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models Sudesh Ramesh Bhagat et.al. 2504.13068 null
2025-04-17 ArtistAuditor: Auditing Artist Style Pirate in Text-to-Image Generation Models Linkang Du et.al. 2504.13061 **[link](https://github.com/jozenn/artistauditor)**
2025-04-17 GraphAttack: Exploiting Representational Blindspots in LLM Safety Mechanisms Sinan He et.al. 2504.13052 null
2025-04-17 Design Topological Materials by Reinforcement Fine-Tuned Generative Model Haosheng Xu et.al. 2504.13048 null
2025-04-17 How Large Language Models Are Changing MOOC Essay Answers: A Comparison of Pre- and Post-LLM Responses Leo Leppänen et.al. 2504.13038 null
2025-04-17 InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning Zheng Wang et.al. 2504.13032 null
2025-04-17 ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images Sangwook Kim et.al. 2504.13023 null

2025-04-16

Publish Date Title Authors PDF Code
2025-04-16 BitNet b1.58 2B4T Technical Report Shuming Ma et.al. 2504.12285 null
2025-04-16 HLS-Eval: A Benchmark and Framework for Evaluating LLMs on High-Level Synthesis Design Tasks Stefan Abi-Karam et.al. 2504.12268 null
2025-04-16 VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate Zhihang Yuan et.al. 2504.12259 null
2025-04-16 FLIP Reasoning Challenge Andreas Plesner et.al. 2504.12256 null
2025-04-16 AnomalyGen: An Automated Semantic Log Sequence Generation Framework with LLM for Anomaly Detection Xinyu Li et.al. 2504.12250 null
2025-04-16 MOS: Towards Effective Smart Contract Vulnerability Detection through Mixture-of-Experts Tuning of Large Language Models Hang Yuan et.al. 2504.12234 null
2025-04-16 Watermarking Needs Input Repetition Masking David Khachaturov et.al. 2504.12229 null
2025-04-16 Coding-Prior Guided Diffusion Network for Video Deblurring Yike Liu et.al. 2504.12222 null
2025-04-16 d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning Siyan Zhao et.al. 2504.12216 null
2025-04-16 From Requirements to Architecture: Semi-Automatically Generating Software Architectures Tobias Eisenreich et.al. 2504.12192 null
2025-04-16 What Do Large Language Models Know? Tacit Knowledge as a Potential Causal-Explanatory Structure Céline Budding et.al. 2504.12187 null
2025-04-16 SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data Suyoung Bae et.al. 2504.12185 null
2025-04-16 Deep Generative Models for Bayesian Inference on High-Rate Sensor Data: Applications in Automotive Radar and Medical Imaging Tristan S. W. Stevens et.al. 2504.12154 null
2025-04-16 ARCeR: an Agentic RAG for the Automated Definition of Cyber Ranges Matteo Lupinacci et.al. 2504.12143 null
2025-04-16 Multilingual Contextualization of Large Language Models for Document-Level Machine Translation Miguel Moura Ramos et.al. 2504.12140 null
2025-04-16 Clarifying Ambiguities: on the Role of Ambiguity Types in Prompting Methods for Clarification Generation Anfu Tang et.al. 2504.12113 null
2025-04-16 Towards LLM Agents for Earth Observation Chia Hsiang Kao et.al. 2504.12110 null
2025-04-16 Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation Shizhan Cai et.al. 2504.12108 null
2025-04-16 Gauging Overprecision in LLMs: An Empirical Study Adil Bahaj et.al. 2504.12098 null
2025-04-16 Reasoning-Based AI for Startup Evaluation (R.A.I.S.E.): A Memory-Augmented, Multi-Step Decision Framework Jack Preuveneers et.al. 2504.12090 null
2025-04-16 Elucidating the Design Space of Multimodal Protein Language Models Cheng-Yen Hsieh et.al. 2504.11454 null

2025-04-15

Publish Date Title Authors PDF Code
2025-04-15 Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception Ziqi Pang et.al. 2504.11457 null
2025-04-15 DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning Zhiwei He et.al. 2504.11456 null
2025-04-15 TextArena Leon Guertler et.al. 2504.11442 null
2025-04-15 Masculine Defaults via Gendered Discourse in Podcasts and Large Language Models Maria Teleki et.al. 2504.11431 null
2025-04-15 A Dual-Space Framework for General Knowledge Distillation of Large Language Models Xue Zhang et.al. 2504.11426 null
2025-04-15 Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts Quanyu Long et.al. 2504.11420 null
2025-04-15 Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning Ali Taghibakhshi et.al. 2504.11409 null
2025-04-15 RankAlign: A Ranking View of the Generator-Validator Gap in Large Language Models Juan Diego Rodriguez et.al. 2504.11381 null
2025-04-15 Ring Artifacts Correction Based on Global-Local Features Interaction Guidance in the Projection Domain Yunze Liu et.al. 2504.11375 null
2025-04-15 Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions Wang Bill Zhu et.al. 2504.11373 null
2025-04-15 DataSentinel: A Game-Theoretic Detection of Prompt Injection Attacks Yupei Liu et.al. 2504.11358 null
2025-04-15 A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce Wei Xiong et.al. 2504.11343 null
2025-04-15 Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints Ruicheng Ao et.al. 2504.11320 null
2025-04-15 Learning to Be A Doctor: Searching for Effective Medical Agent Architectures Yangyang Zhuang et.al. 2504.11301 null
2025-04-15 The Obvious Invisible Threat: LLM-Powered GUI Agents' Vulnerability to Fine-Print Injections Chaoran Chen et.al. 2504.11281 null
2025-04-15 From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs Guocong Li et.al. 2504.11277 null
2025-04-15 Towards Automated Safety Requirements Derivation Using Agent-based RAG Balahari Vignesh Balu et.al. 2504.11243 null
2025-04-15 Nondeterministic Polynomial-time Problem Challenge: An Ever-Scaling Reasoning Benchmark for LLMs Chang Yang et.al. 2504.11239 null
2025-04-15 AutoRAN: Automated and Zero-Touch Open RAN Systems Stefano Maxenti et.al. 2504.11233 null

2025-04-14

Publish Date Title Authors PDF Code
2025-04-14 xVerify: Efficient Answer Verifier for Reasoning Model Evaluations Ding Chen et.al. 2504.10481 null
2025-04-14 InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Jinguo Zhu et.al. 2504.10479 null
2025-04-14 Art3D: Training-Free 3D Generation from Flat-Colored Illustration Xiaoyan Cong et.al. 2504.10466 null
2025-04-14 M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models Junxiong Wang et.al. 2504.10449 null
2025-04-14 Multimodal Long Video Modeling Based on Temporal Dynamic Context Haoran Hao et.al. 2504.10443 null
2025-04-14 Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing Taihang Hu et.al. 2504.10434 null
2025-04-14 LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models Minqian Liu et.al. 2504.10430 null
2025-04-14 Can We Edit LLMs for Long-Tail Biomedical Knowledge? Xinhao Yi et.al. 2504.10421 null
2025-04-14 CliniChat: A Multi-Source Knowledge-Driven Framework for Clinical Interview Dialogue Reconstruction and Evaluation Jing Chen et.al. 2504.10418 null
2025-04-14 LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models Parshin Shojaee et.al. 2504.10415 **[link](https://github.com/deep-symbolic-mathematics/llm-srbench)**
2025-04-14 HUMOTO: A 4D Dataset of Mocap Human Object Interactions Jiaxin Lu et.al. 2504.10414 null
2025-04-14 Performance of Large Language Models in Supporting Medical Diagnosis and Treatment Diogo Sousa et.al. 2504.10405 null
2025-04-14 Can LLMs Assist Expert Elicitation for Probabilistic Causal Modeling? Olha Shaposhnyk et.al. 2504.10397 null
2025-04-14 LLM-driven Constrained Copy Generation through Iterative Refinement Varun Vasudevan et.al. 2504.10391 null
2025-04-14 SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning Yiting Wang et.al. 2504.10369 null
2025-04-14 S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models Wenyuan Zhang et.al. 2504.10368 null
2025-04-14 FingER: Content Aware Fine-grained Evaluation with Reasoning for AI-Generated Videos Rui Chen et.al. 2504.10358 null
2025-04-14 MultiLoKo: a multilingual local knowledge benchmark for LLMs spanning 31 languages Dieuwke Hupkes et.al. 2504.10356 null
2025-04-14 Forecasting from Clinical Textual Time Series: Adaptations of the Encoder and Decoder Language Model Families Shahriar Noroozizadeh et.al. 2504.10340 null
2025-04-14 Heimdall: test-time scaling on the generative verification Wenlei Shi et.al. 2504.10337 null

2025-04-11

Publish Date Title Authors PDF Code
2025-04-11 Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images Boyang Deng et.al. 2504.08727 null
2025-04-11 DocAgent: A Multi-Agent System for Automated Code Documentation Generation Dayu Yang et.al. 2504.08725 null
2025-04-11 Generating Fine Details of Entity Interactions Xinyi Gu et.al. 2504.08714 null
2025-04-11 Large Language Models as Span Annotators Zdeněk Kasner et.al. 2504.08697 null
2025-04-11 SeaView: Software Engineering Agent Visual Interface for Enhanced Workflow Timothy Bula et.al. 2504.08696 null
2025-04-11 TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning Hang Ni et.al. 2504.08694 null
2025-04-11 Fast-Slow-Thinking: Complex Task Solving with Large Language Models Yiliu Sun et.al. 2504.08690 null
2025-04-11 Voice Interaction With Conversational AI Could Facilitate Thoughtful Reflection and Substantive Revision in Writing Jiho Kim et.al. 2504.08687 null
2025-04-11 Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Team Seawead et.al. 2504.08685 null
2025-04-11 Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning Fangzhi Xu et.al. 2504.08672 null
2025-04-11 Variability-Driven User-Story Generation using LLM and Triadic Concept Analysis Alexandre Bazin et.al. 2504.08666 null
2025-04-11 Safe Flow Matching: Robot Motion Planning with Control Barrier Functions Xiaobing Dai et.al. 2504.08661 null
2025-04-11 Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents Alessio Buscemi et.al. 2504.08640 null
2025-04-11 MooseAgent: A LLM Based Multi-agent Framework for Automating Moose Simulation Tao Zhang et.al. 2504.08621 null
2025-04-11 Analyzing 16,193 LLM Papers for Fun and Profits Zhiqiu Xia et.al. 2504.08619 null
2025-04-11 ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration Yongsheng Yu et.al. 2504.08591 null
2025-04-11 COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails Miguel Espinosa et.al. 2504.08548 null
2025-04-11 Slicing the Gaussian Mixture Wasserstein Distance Moritz Piening et.al. 2504.08544 null
2025-04-11 Task Memory Engine (TME): Enhancing State Awareness for Multi-Step LLM Agent Tasks Ye Ye et.al. 2504.08525 null
2025-04-11 Adopting Large Language Models to Automated System Integration Robin D. Pesl et.al. 2504.08490 null
2025-04-11 Scaling Laws for Native Multimodal Models Mustafa Shukor et.al. 2504.07951 null
2025-04-11 An LLM-Driven Multi-Agent Debate System for Mendelian Diseases Xinyang Zhou et.al. 2504.07881 null
2025-04-11 Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Yichun Yin et.al. 2504.07866 null

2025-04-10

Publish Date Title Authors PDF Code
2025-04-10 C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing Zhongyang Li et.al. 2504.07964 **[link](https://github.com/tianyi-lab/c3po)**
2025-04-10 PixelFlow: Pixel-Space Generative Models with Flow Shoufa Chen et.al. 2504.07963 **[link](https://github.com/shoufachen/pixelflow)**
2025-04-10 VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning Yukun Qi et.al. 2504.07956 null
2025-04-10 Porting an LLM based Application from ChatGPT to an On-Premise Environment Teemu Paloniemi et.al. 2504.07907 null
2025-04-10 Redefining Machine Translation on Social Network Services with Large Language Models Hongcheng Guo et.al. 2504.07901 null
2025-04-10 How do Large Language Models Understand Relevance? A Mechanistic Interpretability Perspective Qi Liu et.al. 2504.07898 null
2025-04-10 DiverseFlow: Sample-Efficient Diverse Mode Coverage in Flows Mashrur M. Morshed et.al. 2504.07894 null
2025-04-10 Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge Riccardo Cantini et.al. 2504.07887 **[link](https://github.com/SCAlabUnical/CLEAR-Bias_LLM_benchmark)**
2025-04-10 Token Level Routing Inference System for Edge Devices Jianshu She et.al. 2504.07878 null
2025-04-10 Robust Hallucination Detection in LLMs via Adaptive Token Selection Mengjia Niu et.al. 2504.07863 null
2025-04-10 Understanding Learner-LLM Chatbot Interactions and the Impact of Prompting Guidelines Cansu Koyuturk et.al. 2504.07840 null
2025-04-10 Deceptive Automated Interpretability: Language Models Coordinating to Fool Oversight Systems Simon Lermen et.al. 2504.07831 null
2025-04-10 MOSAIC: Modeling Social AI for Content Dissemination and Regulation in Multi-Agent Simulations Genglin Liu et.al. 2504.07830 null
2025-04-10 Cluster-Driven Expert Pruning for Mixture-of-Experts Large Language Models Hongcheng Guo et.al. 2504.07807 **[link](https://github.com/fighoture/moe_unsupervised_pruning)**
2025-04-10 A System for Comprehensive Assessment of RAG Frameworks Mattia Rengo et.al. 2504.07803 **[link](https://github.com/eustema-s-p-a/scarf)**
2025-04-10 FairEval: Evaluating Fairness in LLM-Based Recommendations with Personality Awareness Chandan Kumar Sah et.al. 2504.07801 null
2025-04-10 Plan-and-Refine: Diverse and Comprehensive Retrieval-Augmented Generation Alireza Salemi et.al. 2504.07794 **[link](https://github.com/alirezasalemi7/pr-rag)**

2025-04-09

Publish Date Title Authors PDF Code
2025-04-09 Sculpting Subspaces: Constrained Full Fine-Tuning in LLMs for Continual Learning Nikhil Shivakumar Nayak et.al. 2504.07097 null
2025-04-09 OmniCaptioner: One Captioner to Rule Them All Yiting Lu et.al. 2504.07089 null
2025-04-09 KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs Elan Markowitz et.al. 2504.07087 null
2025-04-09 Identifying Unknown Stochastic Dynamics via Finite expression methods Senwei Liang et.al. 2504.07085 null
2025-04-09 DeduCE: Deductive Consistency as a Framework to Evaluate LLM Reasoning Atharva Pandey et.al. 2504.07080 null
2025-04-09 A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models Zhouhang Xie et.al. 2504.07070 null
2025-04-09 HalluciNot: Hallucination Detection Through Context and Common Knowledge Verification Bibek Paudel et.al. 2504.07069 null
2025-04-09 TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling Liang-Hsuan Tseng et.al. 2504.07053 null
2025-04-09 To Backtrack or Not to Backtrack: When Sequential Search Limits Model Reasoning Tian Qin et.al. 2504.07052 null
2025-04-09 Distilling Textual Priors from LLM to Efficient Image Fusion Ran Zhang et.al. 2504.07029 null
2025-04-09 Evaluating Retrieval Augmented Generative Models for Document Queries in Transportation Safety Chad Melton et.al. 2504.07022 null
2025-04-09 LLM-IFT: LLM-Powered Information Flow Tracking for Secure Hardware Nowfel Mashnoor et.al. 2504.07015 null
2025-04-09 Latent Diffusion U-Net Representations Contain Positional Embeddings and Anomalies Jonas Loos et.al. 2504.07008 **[link](https://github.com/JonasLoos/sd-representation-anomalies)**
2025-04-09 Towards LLMs Robustness to Changes in Prompt Format Styles Lilian Ngweta et.al. 2504.06969 null
2025-04-09 Review of Case-Based Reasoning for LLM Agents: Theoretical Foundations, Architectural Components, and Cognitive Integration Kostas Hatalis et.al. 2504.06943 null
2025-04-09 FeedbackEval: A Benchmark for Evaluating Large Language Models in Feedback-Driven Code Repair Tasks Dekun Dai et.al. 2504.06939 null
2025-04-09 The Importance of Being Discrete: Measuring the Impact of Discretization in End-to-End Differentially Private Synthetic Data Georgi Ganev et.al. 2504.06923 null
2025-04-09 Identifying Aspects in Peer Reviews Sheng Lu et.al. 2504.06910 null
2025-04-09 EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation Diljeet Jagpal et.al. 2504.06861 null
2025-04-09 LVC: A Lightweight Compression Framework for Enhancing VLMs in Long Video Understanding Ziyi Wang et.al. 2504.06835 null
2025-04-09 Navigating the Rabbit Hole: Emergent Biases in LLM-Generated Attack Narratives Targeting Mental Health Groups Rijul Magu et.al. 2504.06160 null
2025-04-09 Leanabell-Prover: Posttraining Scaling in Formal Reasoning Jingyuan Zhang et.al. 2504.06122 null
2025-04-09 CAI: An Open, Bug Bounty-Ready Cybersecurity AI Víctor Mayoral-Vilches et.al. 2504.06017 null

2025-04-08

Publish Date Title Authors PDF Code
2025-04-08 GOLLuM: Gaussian Process Optimized LLMs -- Reframing LLM Finetuning through Bayesian Optimization Bojana Ranković et.al. 2504.06265 null
2025-04-08 OmniSVG: A Unified Scalable Vector Graphics Generation Model Yiying Yang et.al. 2504.06263 null
2025-04-08 Hogwild! Inference: Parallel LLM Generation via Concurrent Attention Gleb Rodionov et.al. 2504.06261 null
2025-04-08 FEABench: Evaluating Language Models on Multiphysics Reasoning Ability Nayantara Mudur et.al. 2504.06260 null
2025-04-08 Transfer between Modalities with MetaQueries Xichen Pan et.al. 2504.06256 null
2025-04-08 Electronic Structure Guided Inverse Design Using Generative Models Shuyi Jia et.al. 2504.06249 null
2025-04-08 LExT: Towards Evaluating Trustworthiness of Natural Language Explanations Krithi Shailya et.al. 2504.06227 null
2025-04-08 Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation Biao Zhang et.al. 2504.06225 null
2025-04-08 Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs Dongyang Fan et.al. 2504.06219 null
2025-04-08 From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models Chejian Xu et.al. 2504.06214 null
2025-04-08 TxGemma: Efficient and Agentic LLMs for Therapeutics Eric Wang et.al. 2504.06196 null
2025-04-08 ARLO: A Tailorable Approach for Transforming Natural Language Software Requirements into Architecture using LLMs Tooraj Helmi et.al. 2504.06143 null
2025-04-08 QGen Studio: An Adaptive Question-Answer Generation, Training and Evaluation Platform Movina Moses et.al. 2504.06136 null
2025-04-08 FaceCloak: Learning to Protect Face Templates Sudipta Banerjee et.al. 2504.06131 null
2025-04-08 Nonuniform-Tensor-Parallelism: Mitigating GPU failure impact for Scaled-up LLM Training Daiyaan Arfeen et.al. 2504.06095 null
2025-04-08 Multi-Sense Embeddings for Language Models and Knowledge Distillation Qitong Wang et.al. 2504.06036 null
2025-04-08 Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi Monojit Choudhury et.al. 2504.06011 null
2025-04-08 Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG Hengran Zhang et.al. 2504.05220 null

2025-04-07

Publish Date Title Authors PDF Code
2025-04-07 Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations Pedro Ferreira et.al. 2504.05294 null
2025-04-07 The challenge of uncertainty quantification of large language models in medicine Zahra Atf et.al. 2504.05278 null
2025-04-07 Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation Yucheng Chu et.al. 2504.05276 null
2025-04-07 Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models Yang Yan et.al. 2504.05262 null
2025-04-07 How to evaluate control measures for LLM agents? A trajectory from today to superintelligence Tomek Korbak et.al. 2504.05259 null
2025-04-07 Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models Adrián Bazaga et.al. 2504.05258 null
2025-04-07 LLM-based Automated Grading with Human-in-the-Loop Hang Li et.al. 2504.05239 null
2025-04-07 Mapping biodiversity at very-high resolution in Europe César Leblanc et.al. 2504.05231 null
2025-04-07 LLM-Alignment Live-Streaming Recommendation Yueyang Liu et.al. 2504.05217 null
2025-04-07 Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling Hengran Zhang et.al. 2504.05216 null
2025-04-07 Post-Training Language Models for Continual Relation Extraction Sefika Efeoglu et.al. 2504.05214 null
2025-04-07 Quantum Program Linting with LLMs: Emerging Results from a Comparative Study Seung Yeob Shin et.al. 2504.05204 null
2025-04-07 P2Mark: Plug-and-play Parameter-intrinsic Watermarking for Neural Speech Generation Yong Ren et.al. 2504.05197 null
2025-04-07 Concise Reasoning via Reinforcement Learning Mehdi Fatemi et.al. 2504.05185 null
2025-04-07 BRIDGES: Bridging Graph Modality and Large Language Models within EDA Tasks Wei Li et.al. 2504.05180 null
2025-04-07 Learning symmetries in datasets Veronica Sanz et.al. 2504.05174 null
2025-04-07 Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness Dongzhuoran Zhou et.al. 2504.05163 null
2025-04-07 DDPM Score Matching and Distribution Learning Sinho Chewi et.al. 2504.05161 null
2025-04-07 Pr $εε$ mpt: Sanitizing Sensitive Prompts for LLMs Amrita Roy Chowdhury et.al. 2504.05147 null

2025-04-04

Publish Date Title Authors PDF Code
2025-04-04 MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models Wulin Xie et.al. 2504.03641 null
2025-04-04 Do Larger Language Models Imply Better Reasoning? A Pretraining Scaling Law for Reasoning Xinyi Wang et.al. 2504.03635 null
2025-04-04 Enhancing Causal Effect Estimation with Diffusion-Generated Data Li Chen et.al. 2504.03630 null
2025-04-04 Align to Structure: Aligning Large Language Models with Structural Information Zae Myung Kim et.al. 2504.03622 null
2025-04-04 VISTA-OCR: Towards generative and interactive end to end OCR models Laziz Hamdi et.al. 2504.03621 null
2025-04-04 Multilingual Retrieval-Augmented Generation for Knowledge-Intensive Task Leonardo Ranaldi et.al. 2504.03616 null
2025-04-04 Autonomous and Self-Adapting System for Synthetic Media Detection and Attribution Aref Azizpour et.al. 2504.03615 null
2025-04-04 AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset Bingxiang He et.al. 2504.03612 null
2025-04-04 APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay Akshara Prabhakar et.al. 2504.03601 null
2025-04-04 EnrichIndex: Using LLMs to Enrich Retrieval Indices Offline Peter Baile Chen et.al. 2504.03598 null
2025-04-04 Hallucination Detection on a Budget: Efficient Bayesian Estimation of Semantic Entropy Kamil Ciosek et.al. 2504.03579 null
2025-04-04 SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement Runnan Fang et.al. 2504.03561 null
2025-04-04 Agentic Knowledgeable Self-awareness Shuofei Qiao et.al. 2504.03553 null
2025-04-04 Diverse In-Context Example Selection After Decomposing Programs and Aligned Utterances Improves Semantic Parsing Mayank Kothyari et.al. 2504.03541 null
2025-04-04 HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration Boyuan Wang et.al. 2504.03536 null
2025-04-04 Neutralizing the Narrative: AI-Powered Debiasing of Online News Articles Chen Wei Kuo et.al. 2504.03520 null
2025-04-04 Structured Legal Document Generation in India: A Model-Agnostic Wrapper Approach with VidhikDastaavej Shubham Kumar Nigam et.al. 2504.03486 null
2025-04-04 D-Garment: Physics-Conditioned Latent Diffusion for Dynamic Garment Deformations Antoine Dumoulin et.al. 2504.03468 null
2025-04-04 Generating ensembles of spatially-coherent in-situ forecasts using flow matching David Landry et.al. 2504.03463 null
2025-04-04 Conditioning Diffusions Using Malliavin Calculus Jakiw Pidstrigach et.al. 2504.03461 null
2025-04-04 A Survey of Large Language Models in Mental Health Disorder Detection on Social Media Zhuohan Ge et.al. 2504.02800 null
2025-04-04 RBT4DNN: Requirements-based Testing of Neural Networks Nusrat Jahan Mozumder et.al. 2504.02737 null
2025-04-04 Why do LLMs attend to the first token? Federico Barbero et.al. 2504.02732 null

2025-04-03

Publish Date Title Authors PDF Code
2025-04-03 Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models Mateusz Pach et.al. 2504.02821 null
2025-04-03 Efficient Autoregressive Shape Generation via Octree-Based Adaptive Tokenization Kangle Deng et.al. 2504.02817 null
2025-04-03 Generative Evaluation of Complex Reasoning in Large Language Models Haowei Lin et.al. 2504.02810 null
2025-04-03 MegaMath: Pushing the Limits of Open Math Corpora Fan Zhou et.al. 2504.02807 null
2025-04-03 A Framework for Robust Cognitive Evaluation of LLMs Karin de Langis et.al. 2504.02789 null
2025-04-03 From Consumption to Collaboration: Measuring Interaction Patterns to Augment Human Cognition in Open-Ended Tasks Joshua Holstein et.al. 2504.02780 null
2025-04-03 BT-ACTION: A Test-Driven Approach for Modular Understanding of User Instruction Leveraging Behaviour Trees and LLMs Alexander Leszczynski et.al. 2504.02779 null
2025-04-03 MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs Jaap Jumelet et.al. 2504.02768 null
2025-04-03 How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices? Andres Algaba et.al. 2504.02767 null
2025-04-03 Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model Shengjun Zhang et.al. 2504.02764 null
2025-04-03 Echoes of the hidden: Uncovering coordination beyond network structure Shahar Somin et.al. 2504.02757 null
2025-04-03 Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study Aryan Agrawal et.al. 2504.02733 null
2025-04-03 ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization Kehua Feng et.al. 2504.02725 null
2025-04-03 TeleMoM: Consensus-Driven Telecom Intelligence via Mixture of Models Xinquan Wang et.al. 2504.02712 null
2025-04-03 The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context Nikhil Verma et.al. 2504.02708 null
2025-04-03 LLM for Complex Reasoning Task: An Exploratory Study in Fermi Problems Zishuo Liu et.al. 2504.02671 null
2025-04-03 Affordable AI Assistants with Knowledge Graph of Thoughts Maciej Besta et.al. 2504.02670 null
2025-04-03 VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step Hanyang Wang et.al. 2504.01956 null
2025-04-03 Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation Baban Gain et.al. 2504.01919 null

2025-04-02

Publish Date Title Authors PDF Code
2025-04-02 The LLM Wears Prada: Analysing Gender Bias and Stereotypes through Online Shopping Data Massimiliano Luca et.al. 2504.01951 null
2025-04-02 OpenCodeReasoning: Advancing Data Distillation for Competitive Coding Wasi Uddin Ahmad et.al. 2504.01943 null
2025-04-02 A Unified Approach to Analysis and Design of Denoising Markov Models Yinuo Ren et.al. 2504.01938 null
2025-04-02 Critical Thinking: Which Kinds of Complexity Govern Optimal Reasoning Length? Celine Lee et.al. 2504.01935 null
2025-04-02 Review, Refine, Repeat: Understanding Iterative Decoding of AI Agents with Dynamic Evaluation and Selection Souradip Chakraborty et.al. 2504.01931 null
2025-04-02 A thorough benchmark of automatic text classification: From traditional approaches to large language models Washington Cunha et.al. 2504.01930 null
2025-04-02 Is the Reversal Curse a Binding Problem? Uncovering Limitations of Transformers from a Basic Generalization Failure Boshi Wang et.al. 2504.01928 null
2025-04-02 Gen-C: Populating Virtual Worlds with Generative Crowds Andreas Panayiotou et.al. 2504.01924 null
2025-04-02 Advancing AI-Scientist Understanding: Making LLM Think Like a Physicist with Interpretable Reasoning Yinggan Xu et.al. 2504.01911 null
2025-04-02 Build Code Needs Maintenance Too: A Study on Refactoring and Technical Debt in Build Systems Anwar Ghammam et.al. 2504.01907 null
2025-04-02 STAR-1: Safer Alignment of Reasoning LLMs with 1K Data Zijun Wang et.al. 2504.01903 null
2025-04-02 Multi-fidelity Parameter Estimation Using Conditional Diffusion Models Caroline Tatsuoka et.al. 2504.01894 null
2025-04-02 TransientTables: Evaluating LLMs' Reasoning on Temporally Evolving Semi-structured Tables Abhilash Shankarampeta et.al. 2504.01879 null
2025-04-02 Interpreting Emergent Planning in Model-Free Reinforcement Learning Thomas Bush et.al. 2504.01871 null
2025-04-02 From Code Generation to Software Testing: AI Copilot with Context-Based RAG Yuchen Wang et.al. 2504.01866 null
2025-04-02 Cross-Lingual Consistency: A Novel Inference Framework for Advancing Reasoning in Large Language Models Zhiwei Yu et.al. 2504.01857 null
2025-04-02 Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks Ali Al-Kaswan et.al. 2504.01850 null
2025-04-02 PaperBench: Evaluating AI's Ability to Replicate AI Research Giulio Starace et.al. 2504.01848 null

2025-03-31

Publish Date Title Authors PDF Code
2025-03-31 Consistent Subject Generation via Contrastive Instantiated Concepts Lee Hsin-Ying et.al. 2503.24387 null
2025-03-31 Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation Shengqiong Wu et.al. 2503.24379 null
2025-03-31 Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models Rui Wang et.al. 2503.24377 **[link](https://github.com/devoallen/awesome-reasoning-economy-papers)**
2025-03-31 Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 Yi Chen et.al. 2503.24376 **[link](https://github.com/tencentarc/seed-bench-r1)**
2025-03-31 Effectively Controlling Reasoning Models through Thinking Intervention Tong Wu et.al. 2503.24370 null
2025-03-31 SQuat: Subspace-orthogonal KV Cache Quantization Hao Wang et.al. 2503.24358 null
2025-03-31 ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion Rana Muhammad Shahroz Khan et.al. 2503.24354 null
2025-03-31 BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models Alok Abhishek et.al. 2503.24310 null
2025-03-31 A Systematic Evaluation of LLM Strategies for Mental Health Text Analysis: Fine-tuning vs. Prompt Engineering vs. RAG Arshia Kermani et.al. 2503.24307 null
2025-03-31 Is analogy enough to draw novel adjective-noun inferences? Hayley Ross et.al. 2503.24293 null
2025-03-31 Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning Jiacheng Lin et.al. 2503.24289 **[link](https://github.com/linjc16/Rec-R1)**
2025-03-31 Evaluating and Designing Sparse Autoencoders by Approximating Quasi-Orthogonality Sewoong Lee et.al. 2503.24277 **[link](https://github.com/sewoonglee/top-afa-sae)**
2025-03-31 Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation Dun Yuan et.al. 2503.24245 null
2025-03-31 What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models Qiyuan Zhang et.al. 2503.24235 null
2025-03-31 Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes Daichi Otsuka et.al. 2503.24229 null
2025-03-31 PAARS: Persona Aligned Agentic Retail Shoppers Saab Mansour et.al. 2503.24228 null
2025-03-31 Synthetic News Generation for Fake News Classification Abdul Sittar et.al. 2503.24206 null
2025-03-31 TwT: Thinking without Tokens by Habitual Reasoning Distillation with Multi-Teachers' Guidance Jingxian Xu et.al. 2503.24198 null
2025-03-31 Text2Tracks: Prompt-based Music Recommendation via Generative Retrieval Enrico Palumbo et.al. 2503.24193 null
2025-03-31 Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms Shuoming Zhang et.al. 2503.24191 null

2025-03-28

Publish Date Title Authors PDF Code
2025-03-28 Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions Mohammad Almansoori et.al. 2503.22678 null
2025-03-28 DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness Ruining Li et.al. 2503.22677 null
2025-03-28 QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks? Belinda Z. Li et.al. 2503.22674 null
2025-03-28 Unicorn: Text-Only Data Synthesis for Vision Language Model Training Xiaomin Yu et.al. 2503.22655 **[link](https://github.com/yu-xm/unicorn)**
2025-03-28 Using AI to Summarize US Presidential Campaign TV Advertisement Videos, 1952-2012 Adam Breuer et.al. 2503.22589 **[link](https://github.com/adambreuer/ai-summarizevid)**
2025-03-28 LLM-enabled Instance Model Generation Fengjunjie Pan et.al. 2503.22587 null
2025-03-28 Historical Ink: Exploring Large Language Models for Irony Detection in 19th-Century Spanish Kevin Cohen et.al. 2503.22585 **[link](https://github.com/historicalink/ironydetection)**
2025-03-28 Drop the Golden Apples: Identifying Third-Party Reuse by DB-Less Software Composition Analysis Lyuye Zhang et.al. 2503.22576 null
2025-03-28 RELD: Regularization by Latent Diffusion Models for Image Restoration Pasquale Cascarano et.al. 2503.22563 null
2025-03-28 Niyama : Breaking the Silos of LLM Inference Serving Kanishk Goel et.al. 2503.22562 null
2025-03-28 Bridging the Dimensional Chasm: Uncover Layer-wise Dimensional Reduction in Transformers through Token Correlation Zhuo-Yang Song et.al. 2503.22547 null
2025-03-28 Exploiting Mixture-of-Experts Redundancy Unlocks Multimodal Generative Abilities Raman Dutt et.al. 2503.22517 null
2025-03-28 Unlocking LLM Repair Capabilities in Low-Resource Programming Languages Through Cross-Language Translation and Multi-Agent Refinement Wenqiang Luo et.al. 2503.22512 null
2025-03-28 WorkTeam: Constructing Workflows from Natural Language with Multi-Agents Hanchao Liu et.al. 2503.22473 null
2025-03-28 Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey Shengyue Guan et.al. 2503.22458 null
2025-03-28 Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning Abdullah Vanlioglu et.al. 2503.22456 null
2025-03-28 STADE: Standard Deviation as a Pruning Metric Diego Coello de Portugal Mecke et.al. 2503.22451 **[link](https://github.com/coello-dev/stade)**
2025-03-28 NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving Fuhao Li et.al. 2503.22436 null
2025-03-28 CoSIL: Software Issue Localization via LLM-Driven Code Repository Graph Searching Zhonghao Jiang et.al. 2503.22424 **[link](https://github.com/zhonghaojiang/cosil)**
2025-03-28 Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis Jiangyong Huang et.al. 2503.22420 null

2025-03-27

Publish Date Title Authors PDF Code
2025-03-27 A Unified Image-Dense Annotation Generation Model for Underwater Scenes Hongkai Lin et.al. 2503.21771 **[link](https://github.com/hongklin/tide)**
2025-03-27 Exploring the Evolution of Physics Cognition in Video Generation: A Survey Minghui Lin et.al. 2503.21765 **[link](https://github.com/minnie-lin/awesome-physics-cognition-based-video-generation)**
2025-03-27 MemInsight: Autonomous Memory Augmentation for LLM Agents Rana Salama et.al. 2503.21760 null
2025-03-27 Fwd2Bot: LVLM Visual Token Compression with Double Forward Bottleneck Adrian Bulat et.al. 2503.21757 null
2025-03-27 A Unified Framework for Diffusion Bridge Problems: Flow Matching and Schrödinger Matching into One Minyoung Kim et.al. 2503.21756 null
2025-03-27 VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness Dian Zheng et.al. 2503.21755 **[link](https://github.com/vchitect/vbench)**
2025-03-27 3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models Yuhan Zhang et.al. 2503.21745 null
2025-03-27 GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics Arsham Gholamzadeh Khoee et.al. 2503.21735 null
2025-03-27 Effective Skill Unlearning through Intervention and Abstention Yongce Li et.al. 2503.21730 **[link](https://github.com/trustworthy-ml-lab/effective_skill_unlearning)**
2025-03-27 Collab: Controlled Decoding using Mixture of Agents for LLM Alignment Souradip Chakraborty et.al. 2503.21720 null
2025-03-27 CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers? Jiefu Ou et.al. 2503.21717 null
2025-03-27 Enhancing Repository-Level Software Repair via Repository-Aware Knowledge Graphs Boyang Yang et.al. 2503.21710 null
2025-03-27 Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data Zhiyuan Ma et.al. 2503.21694 **[link](https://github.com/theericma/triplaneturbo)**
2025-03-27 LLM-Gomoku: A Large Language Model-Based System for Strategic Gomoku with Self-Play and Reinforcement Learning Hui Wang et.al. 2503.21683 null
2025-03-27 A friendly introduction to triangular transport Maximilian Ramgraber et.al. 2503.21673 null
2025-03-27 COMI-LINGUA: Expert Annotated Large-Scale Dataset for Multitask NLP in Hindi-English Code-Mixing Rajvee Sheth et.al. 2503.21670 null
2025-03-27 UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning Zhengxi Lu et.al. 2503.21620 null
2025-03-27 A Measure Based Generalizable Approach to Understandability Vikas Kushwaha et.al. 2503.21615 null
2025-03-27 Evaluating book summaries from internal knowledge in Large Language Models: a cross-model and semantic consistency approach Javier Coronado-Blázquez et.al. 2503.21613 null
2025-03-27 Prompt, Divide, and Conquer: Bypassing Large Language Model Safety Filters via Segmented and Distributed Prompt Processing Johan Wahréus et.al. 2503.21598 null
2025-03-27 Beyond Believability: Accurate Human Behavior Simulation with Fine-Tuned LLMs Yuxuan Lu et.al. 2503.20749 null

2025-03-26

Publish Date Title Authors PDF Code
2025-03-26 Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark Sondos Mahmoud Bsharat et.al. 2503.20786 null
2025-03-26 Understanding R1-Zero-Like Training: A Critical Perspective Zichen Liu et.al. 2503.20783 null
2025-03-26 Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields Shijie Zhou et.al. 2503.20776 null
2025-03-26 Reliable algorithm selection for machine learning-guided design Clara Fannjiang et.al. 2503.20767 null
2025-03-26 MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search Yunhai Hu et.al. 2503.20757 null
2025-03-26 Continual learning via probabilistic exchangeable sequence modelling Hanwen Xing et.al. 2503.20725 null
2025-03-26 From Annotation to Adaptation: Metrics, Synthetic Data, and Aspect Extraction for Aspect-Based Sentiment Analysis with Large Language Models Nikita Neveditsin et.al. 2503.20715 null
2025-03-26 GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection Xingyu Peng et.al. 2503.20682 null
2025-03-26 Vision as LoRA Han Wang et.al. 2503.20680 null
2025-03-26 BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation Yuyang Peng et.al. 2503.20672 null
2025-03-26 TAMA: A Human-AI Collaborative Thematic Analysis Framework Using Multi-Agent LLMs for Clinical Interviews Huimin Xu et.al. 2503.20666 null
2025-03-26 ARMO: Autoregressive Rigging for Multi-Category Objects Mingze Sun et.al. 2503.20663 null
2025-03-26 TN-Eval: Rubric and Evaluation Protocols for Measuring the Quality of Behavioral Therapy Notes Raj Sanjay Shah et.al. 2503.20648 null
2025-03-26 Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging Han Wu et.al. 2503.20641 null
2025-03-26 Collaborative Storytelling and LLM: A Linguistic Analysis of Automatically-Generated Role-Playing Game Sessions Alessandro Maisto et.al. 2503.20623 null
2025-03-26 Diffusion Counterfactuals for Image Regressors Trung Duc Ha et.al. 2503.20595 null
2025-03-26 Supply chain network rewiring dynamics at the firm-level Tobias Reisch et.al. 2503.20594 null
2025-03-26 What to Retrieve for Effective Retrieval-Augmented Code Generation? An Empirical Study and Beyond Wenchao Gu et.al. 2503.20589 null
2025-03-26 Synthetic Data Augmentation for Cross-domain Implicit Discourse Relation Recognition Frances Yung et.al. 2503.20588 null

2025-03-25

Publish Date Title Authors PDF Code
2025-03-25 CoLLM: A Large Language Model for Composed Image Retrieval Chuong Huynh et.al. 2503.19910 **[link](https://github.com/hmchuong/CoLLM)**
2025-03-25 Scaling Vision Pre-Training to 4K Resolution Baifeng Shi et.al. 2503.19903 null
2025-03-25 ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models Fernando Julio Cendra et.al. 2503.19902 null
2025-03-25 CausalRAG: Integrating Causal Graphs into Retrieval-Augmented Generation Nengbo Wang et.al. 2503.19878 null
2025-03-25 Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking Xiaoyu Tian et.al. 2503.19855 null
2025-03-25 FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs Carlos Plou et.al. 2503.19850 null
2025-03-25 A Comparative Analysis of Word Segmentation, Part-of-Speech Tagging, and Named Entity Recognition for Historical Chinese Sources, 1900-1950 Zhao Fang et.al. 2503.19844 null
2025-03-25 FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model Jun Zhou et.al. 2503.19839 null
2025-03-25 TopoGEN: topology-driven microstructure generation for in silico modeling of fiber network mechanics Sara Cardona et.al. 2503.19832 null
2025-03-25 IgCraft: A versatile sequence generation framework for antibody discovery and engineering Matthew Greenig et.al. 2503.19821 null
2025-03-25 PAVE: Patching and Adapting Video Large Language Models Zhuoming Liu et.al. 2503.19794 null
2025-03-25 Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models Kartik Thakral et.al. 2503.19783 null
2025-03-25 ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation Haoyu Fu et.al. 2503.19755 null
2025-03-25 Inducing Personality in LLM-Based Honeypot Agents: Measuring the Effect on Human-Like Agenda Generation Lewis Newsham et.al. 2503.19752 null
2025-03-25 Optimizing Photonic Structures with Large Language Model Driven Algorithm Discovery Haoran Yin et.al. 2503.19742 null
2025-03-25 Writing as a testbed for open ended agents Sian Gooding et.al. 2503.19711 null
2025-03-25 AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation Itay Nakash et.al. 2503.19693 null
2025-03-25 AIGC-assisted Federated Learning for Vehicular Edge Intelligence: Vehicle Selection, Resource Allocation and Model Augmentation Xianke Qiang et.al. 2503.19676 null
2025-03-25 CoSimGen: Controllable Diffusion Model for Simultaneous Image and Mask Generation Rupak Bose et.al. 2503.19661 null
2025-03-25 HausaNLP at SemEval-2025 Task 3: Towards a Fine-Grained Model-Aware Hallucination Detection Maryam Bala et.al. 2503.19650 null

2025-03-24

Publish Date Title Authors PDF Code
2025-03-24 Equivariant Image Modeling Ruixiao Dong et.al. 2503.18948 **[link](https://github.com/drx-code/EquivariantModeling)**
2025-03-24 Aether: Geometric-Aware Unified World Modeling Aether Team et.al. 2503.18945 null
2025-03-24 SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding Mingze Xu et.al. 2503.18943 null
2025-03-24 Video-T1: Test-Time Scaling for Video Generation Fangfu Liu et.al. 2503.18942 null
2025-03-24 Exploring Training and Inference Scaling Laws in Generative Retrieval Hongru Cai et.al. 2503.18941 null
2025-03-24 CoMP: Continual Multimodal Pre-training for Vision Foundation Models Yitong Chen et.al. 2503.18931 **[link](https://github.com/SliMM-X/CoMP-MM)**
2025-03-24 Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training Brian R. Bartoldson et.al. 2503.18929 null
2025-03-24 Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models Meng Cao et.al. 2503.18923 null
2025-03-24 xKV: Cross-Layer SVD for KV-Cache Compression Chi-Chih Chang et.al. 2503.18893 null
2025-03-24 AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration Zhexuan Wang et.al. 2503.18891 null
2025-03-24 I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders Andrey Galichin et.al. 2503.18878 null
2025-03-24 A semantic communication-based workload-adjustable transceiver for wireless AI-generated content (AIGC) delivery Runze Cheng et.al. 2503.18874 null
2025-03-24 Reimagining Memory Access for LLM Inference: Compression-Aware Memory Controller Design Rui Xie et.al. 2503.18869 null
2025-03-24 Structuring Scientific Innovation: A Framework for Modeling and Discovering Impactful Knowledge Combinations Junlan Chen et.al. 2503.18865 null
2025-03-24 3DSwapping: Texture Swapping For 3D Object From Single Reference Image Xiao Cao et.al. 2503.18853 null
2025-03-24 EconEvals: Benchmarks and Litmus Tests for LLM Agents in Unknown Environments Sara Fish et.al. 2503.18825 null
2025-03-24 Defeating Prompt Injections by Design Edoardo Debenedetti et.al. 2503.18813 null
2025-03-24 Classical Planning with LLM-Generated Heuristics: Challenging the State of the Art with Python Code Augusto B. Corrêa et.al. 2503.18809 null
2025-03-24 REALM: A Dataset of Real-World LLM Use Cases Jingwen Cheng et.al. 2503.18792 null
2025-03-24 BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache Dayou Du et.al. 2503.18773 null

2025-03-21

Publish Date Title Authors PDF Code
2025-03-21 Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique Yansi Li et.al. 2503.17363 null
2025-03-21 Position: Interactive Generative Video as Next-Generation Game Engine Jiwen Yu et.al. 2503.17359 null
2025-03-21 OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement Yihe Deng et.al. 2503.17352 null
2025-03-21 Efficient Intent-Based Filtering for Multi-Party Conversations Using Knowledge Distillation from LLMs Reem Gody et.al. 2503.17336 null
2025-03-21 CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities Yuxuan Zhu et.al. 2503.17332 **[link](https://github.com/uiuc-kang-lab/cve-bench)**
2025-03-21 LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition Language Kun Chu et.al. 2503.17309 null
2025-03-21 Bugdar: AI-Augmented Secure Code Review for GitHub Pull Requests John Naulty et.al. 2503.17302 null
2025-03-21 Offline Model-Based Optimization: Comprehensive Review Minsu Kim et.al. 2503.17286 null
2025-03-21 CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement Gaifan Zhang et.al. 2503.17279 null
2025-03-21 Unsupervised Joint Learning of Optical Flow and Intensity with Event Cameras Shuang Guo et.al. 2503.17262 null
2025-03-21 SafeMERGE: Preserving Safety Alignment in Fine-Tuned Large Language Models via Selective Layer-Wise Model Merging Aladin Djuhera et.al. 2503.17239 null
2025-03-21 FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs Albert Sawczyn et.al. 2503.17229 null
2025-03-21 Neuro-Symbolic Scene Graph Conditioning for Synthetic Image Dataset Generation Giacomo Savazzi et.al. 2503.17224 null
2025-03-21 Automating Adjudication of Cardiovascular Events Using Large Language Models Sonish Sivarajkumar et.al. 2503.17222 null
2025-03-21 TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning Sheng Wang et.al. 2503.17195 null
2025-03-21 LLMs Love Python: A Study of LLMs' Bias for Programming Languages and Libraries Lukas Twist et.al. 2503.17181 null
2025-03-21 D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens Panpan Wang et.al. 2503.17155 null
2025-03-21 Modifying Large Language Model Post-Training for Diverse Creative Writing John Joon Young Chung et.al. 2503.17126 null
2025-03-21 Large Language Model Compression via the Nested Activation-Aware Decomposition Jun Lu et.al. 2503.17101 null
2025-03-21 A Study into Investigating Temporal Robustness of LLMs Jonas Wallat et.al. 2503.17073 null