Updated on 2025.06.28
Usage instructions: here
Contents
- Adversarial attacks
- Poisoning attacks
- Generative models safety
- Data privacy
- Model Privacy
- Forensics
- AIGC
Adversarial attacks
2025-06-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-26 | Generative Adversarial Evasion and Out-of-Distribution Detection for UAV Cyber-Attacks | Deepak Kumar Panda et.al. | 2506.21142 | null |
2025-06-26 | Curriculum-Guided Antifragile Reinforcement Learning for Secure UAV Deconfliction under Observation-Space Attacks | Deepak Kumar Panda et.al. | 2506.21129 | null |
2025-06-26 | Robust Policy Switching for Antifragile Reinforcement Learning for UAV Deconfliction in Adversarial Environments | Deepak Kumar Panda et.al. | 2506.21127 | null |
2025-06-26 | Boosting Generative Adversarial Transferability with Self-supervised Vision Transformer Features | Shangbo Wu et.al. | 2506.21046 | null |
2025-06-26 | E-FreeM2: Efficient Training-Free Multi-Scale and Cross-Modal News Verification via MLLMs | Van-Hoang Phan et.al. | 2506.20944 | null |
2025-06-25
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-25 | Empowering Digital Agriculture: A Privacy-Preserving Framework for Data Sharing and Collaborative Research | Osama Zafar et.al. | 2506.20872 | null |
2025-06-25 | Universal and Efficient Detection of Adversarial Data through Nonuniform Impact on Network Layers | Furkan Mumcu et.al. | 2506.20816 | null |
2025-06-25 | Poster: Enhancing GNN Robustness for Network Intrusion Detection via Agent-based Analysis | Zhonghao Zhan et.al. | 2506.20806 | null |
2025-06-25 | On Convolutions, Intrinsic Dimension, and Diffusion Models | Kin Kwan Leung et.al. | 2506.20705 | null |
2025-06-25 | Vulnerability Disclosure through Adaptive Black-Box Adversarial Attacks on NIDS | Sabrine Ennaji et.al. | 2506.20576 | null |
2025-06-25 | Multimodal Representation Learning and Fusion | Qihang Jin et.al. | 2506.20494 | null |
2025-06-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-24 | Adversarial Attacks on Deep Learning-Based False Data Injection Detection in Differential Relays | Ahmad Mohammad Saber et.al. | 2506.19302 | null |
2025-06-24 | DRO-Augment Framework: Robustness by Synergizing Wasserstein Distributionally Robust Optimization and Data Augmentation | Jiaming Hu et.al. | 2506.17874 | null |
2025-06-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-23 | Amplifying Machine Learning Attacks Through Strategic Compositions | Yugeng Liu et.al. | 2506.18870 | null |
2025-06-23 | SpaNN: Detecting Multiple Adversarial Patches on CNNs by Spanning Saliency Thresholds | Mauricio Byrd Victorica et.al. | 2506.18591 | null |
2025-06-23 | DUMB and DUMBer: Is Adversarial Training Worth It in the Real World? | Francesco Marchiori et.al. | 2506.18516 | null |
2025-06-23 | Sharpening the Spear: Adaptive Expert-Guided Adversarial Attack Against DRL-based Autonomous Driving Policies | Junchao Fan et.al. | 2506.18304 | null |
2025-06-23 | Semantic Structure-Aware Generative Attacks for Enhanced Adversarial Transferability | Jongoh Jeong et.al. | 2506.18248 | null |
2025-06-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-22 | An Attack Method for Medical Insurance Claim Fraud Detection based on Generative Adversarial Network | Yining Pang et.al. | 2506.19871 | null |
2025-06-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-21 | Optimization-Free Patch Attack on Stereo Depth Estimation | Hangcheng Liu et.al. | 2506.17632 | null |
2025-06-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model | Side Liu et.al. | 2506.17162 | null |
2025-06-20 | Robust Training with Data Augmentation for Medical Imaging Classification | Josué Martínez-Martínez et.al. | 2506.17133 | null |
2025-06-20 | Large Language Model Unlearning for Source Code | Xue Jiang et.al. | 2506.17125 | null |
2025-06-20 | DepthVanish: Optimizing Adversarial Interval Structures for Stereo-Depth-Invisible Patches | Yun Xing et.al. | 2506.16690 | null |
2025-06-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-19 | Robustness Evaluation of OCR-based Visual Document Understanding under Multi-Modal Adversarial Attacks | Dong Nguyen Tien et.al. | 2506.16407 | null |
2025-06-19 | MBA: Multimodal Bidirectional Attack for Referring Expression Segmentation Models | Xingbai Chen et.al. | 2506.16157 | null |
2025-06-19 | Adversarial Attacks and Detection in Visual Place Recognition for Safer Robot Navigation | Connor Malone et.al. | 2506.15988 | **[link](https://github.com/QVPR/aarapsiproject)** |
2025-06-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-18 | PolyGuard: Massive Multi-Domain Safety Policy-Grounded Guardrail Dataset | Mintong Kang et.al. | 2506.19054 | null |
2025-06-18 | VLMInferSlow: Evaluating the Efficiency Robustness of Large Vision-Language Models as a Service | Xiasi Wang et.al. | 2506.15755 | null |
2025-06-18 | Insights on Adversarial Attacks for Tabular Machine Learning via a Systematic Literature Review | Salijona Dyrmishi et.al. | 2506.15506 | null |
2025-06-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-17 | Busting the Paper Ballot: Voting Meets Adversarial Machine Learning | Kaleel Mahmood et.al. | 2506.14582 | **[link](https://github.com/votercenter/busting-the-ballot)** |
2025-06-17 | Doppelgänger Method: Breaking Role Consistency in LLM Agent via Prompt-based Transferable Adversarial Attack | Daewon Kang et.al. | 2506.14539 | null |
2025-06-17 | A Comparative Study on Proactive and Passive Detection of Deepfake Speech | Chia-Hua Wu et.al. | 2506.14398 | **[link](https://github.com/nii-yamagishilab/antispoofing-watermark)** |
2025-06-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-16 | Navigating the Black Box: Leveraging LLMs for Effective Text-Level Graph Injection Attacks | Yuefei Lyu et.al. | 2506.13276 | null |
2025-06-16 | Position: Certified Robustness Does Not (Yet) Imply Model Security | Andrew C. Cullen et.al. | 2506.13024 | null |
2025-06-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-15 | The Safety Reminder: A Soft Prompt to Reactivate Delayed Safety Awareness in Vision-Language Models | Peiyuan Tang et.al. | 2506.15734 | null |
2025-06-15 | Constraint-Guided Prediction Refinement via Deterministic Diffusion Trajectories | Pantelis Dogoulis et.al. | 2506.12911 | null |
2025-06-15 | Intriguing Frequency Interpretation of Adversarial Robustness for CNNs and ViTs | Lu Chen et.al. | 2506.12875 | null |
2025-06-15 | Active Adversarial Noise Suppression for Image Forgery Localization | Rongxuan Peng et.al. | 2506.12871 | null |
2025-06-15 | NAP-Tuning: Neural Augmented Prompt Tuning for Adversarially Robust Vision-Language Models | Jiaming Zhang et.al. | 2506.12706 | null |
2025-06-15 | Alphabet Index Mapping: Jailbreaking LLMs through Semantic Dissimilarity | Bilal Saleh Husain et.al. | 2506.12685 | null |
2025-06-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-14 | Existence of Adversarial Examples for Random Convolutional Networks via Isoperimetric Inequalities on $\mathbb{so}(d)$ | Amit Daniely et.al. | 2506.12613 | null |
2025-06-14 | On the existence of consistent adversarial attacks in high-dimensional linear classification | Matteo Vilucchio et.al. | 2506.12454 | null |
2025-06-14 | Exploring the Secondary Risks of Large Language Models | Jiawei Chen et.al. | 2506.12382 | null |
2025-06-13
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-13 | Learning Causality for Modern Machine Learning | Yongqiang Chen et.al. | 2506.12226 | null |
2025-06-13 | Improving Large Language Model Safety with Contrastive Representation Learning | Samuel Simko et.al. | 2506.11938 | **[link](https://github.com/samuelsimko/crl-llm-defense)** |
2025-06-13 | A Neural Rejection System Against Universal Adversarial Perturbations in Radio Signal Classification | Lu Zhang et.al. | 2506.11901 | null |
2025-06-13 | Attention-based Adversarial Robust Distillation in Radio Signal Classifications for Low-Power IoT Devices | Lu Zhang et.al. | 2506.11892 | null |
2025-06-13 | TrustGLM: Evaluating the Robustness of GraphLLMs Against Prompt, Text, and Structure Attacks | Qihai Zhang et.al. | 2506.11844 | null |
2025-06-13 | Differential Privacy in Machine Learning: From Symbolic AI to LLMs | Francisco Aguilera-Martínez et.al. | 2506.11687 | null |
2025-06-13 | KCES: Training-Free Defense for Robust Graph Neural Networks via Kernel Complexity | Yaning Jia et.al. | 2506.11611 | null |
2025-06-13 | Investigating Vulnerabilities and Defenses Against Audio-Visual Attacks: A Comprehensive Survey Emphasizing Multimodal Models | Jinming Wen et.al. | 2506.11521 | null |
2025-06-12
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-12 | Lattice Climber Attack: Adversarial attacks for randomized mixtures of classifiers | Lucas Gnecco-Heredia et.al. | 2506.10888 | **[link](https://github.com/lucasgneccoh/lattice_climber_attack)** |
2025-06-12 | Efficiency Robustness of Dynamic Deep Learning Systems | Ravishka Rathnasuriya et.al. | 2506.10831 | **[link](https://github.com/anonymous-sok/sok_submission)** |
2025-06-12 | Unsourced Adversarial CAPTCHA: A Bi-Phase Adversarial CAPTCHA Framework | Xia Du et.al. | 2506.10685 | null |
2025-06-12 | Assessing the Resilience of Automotive Intrusion Detection Systems to Adversarial Manipulation | Stefano Longari et.al. | 2506.10620 | null |
2025-06-12 | Boosting Adversarial Transferability for Hyperspectral Image Classification Using 3D Structure-invariant Transformation and Intermediate Feature Distance | Chun Liu et.al. | 2506.10459 | null |
2025-06-11
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-11 | GenBreak: Red Teaming Text-to-Image Generators Using Large Language Models | Zilong Wang et.al. | 2506.10047 | null |
2025-06-11 | A look at adversarial attacks on radio waveforms from discrete latent space | Attanasia Garuso et.al. | 2506.09896 | null |
2025-06-11 | Evasion Attacks Against Bayesian Predictive Models | Pablo G. Arce et.al. | 2506.09640 | **[link](https://github.com/pablogarciarce/advreg)** |
2025-06-11 | Effective Red-Teaming of Policy-Adherent Agents | Itay Nakash et.al. | 2506.09600 | null |
2025-06-11 | LLMs Cannot Reliably Judge (Yet?): A Comprehensive Assessment on the Robustness of LLM-as-a-Judge | Songze Li et.al. | 2506.09443 | **[link](https://github.com/s3ic-lab/robustjudge)** |
2025-06-11 | Adversarial Surrogate Risk Bounds for Binary Classification | Natalie S. Frank et.al. | 2506.09348 | null |
2025-06-11 | AdversariaL attacK sAfety aLIgnment(ALKALI): Safeguarding LLMs through GRACE: Geometric Representation-Aware Contrastive Enhancement- Introducing Adversarial Vulnerability Quality Index (AVQI) | Danush Khanna et.al. | 2506.08885 | null |
2025-06-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-10 | PatchGuard: Adversarially Robust Anomaly Detection and Localization through Vision Transformers and Pseudo Anomalies | Mojtaba Nafez et.al. | 2506.09237 | **[link](https://github.com/rohban-lab/patchgaurd)** |
2025-06-10 | Adversarial Text Generation with Dynamic Contextual Perturbation | Hetvi Waghela et.al. | 2506.09148 | null |
2025-06-10 | Towards Robust Deep Reinforcement Learning against Environmental State Perturbation | Chenxu Wang et.al. | 2506.08961 | null |
2025-06-10 | Efficient Robust Conformal Prediction via Lipschitz-Bounded Networks | Thomas Massena et.al. | 2506.05434 | null |
2025-06-09
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-09 | SHIELD: Secure Hypernetworks for Incremental Expansion Learning Defense | Patryk Krukowski et.al. | 2506.08255 | null |
2025-06-09 | Adversarial Attack Classification and Robustness Testing for Large Language Models for Code | Yang Liu et.al. | 2506.07942 | null |
2025-06-09 | Enhancing Adversarial Robustness with Conformal Prediction: A Framework for Guaranteed Model Reliability | Jie Bao et.al. | 2506.07804 | **[link](https://github.com/bjbbbb/enhancing-adversarial-robustness-with-conformal-prediction)** |
2025-06-09 | ProARD: progressive adversarial robustness distillation: provide wide range of robust students | Seyedhamidreza Mousavi et.al. | 2506.07666 | null |
2025-06-09 | Explore the vulnerability of black-box models via diffusion models | Jiacheng Shi et.al. | 2506.07590 | null |
2025-06-08
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-08 | D2R: dual regularization loss with collaborative adversarial generation for model robustness | Zhenyu Liu et.al. | 2506.07056 | null |
2025-06-08 | Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text | Yize Cheng et.al. | 2506.07001 | null |
2025-06-08 | Boosting Adversarial Transferability via Commonality-Oriented Gradient Optimization | Yanting Gao et.al. | 2506.06992 | null |
2025-06-07
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-07 | Rewriting the Budget: A General Framework for Black-Box Attacks Under Cost Asymmetry | Mahdi Salmani et.al. | 2506.06933 | null |
2025-06-07 | KNN-Defense: Defense against 3D Adversarial Point Clouds using Nearest-Neighbor Search | Nima Jamali et.al. | 2506.06906 | null |
2025-06-06
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-06 | SDN-Based False Data Detection With Its Mitigation and Machine Learning Robustness for In-Vehicle Networks | Long Dang et.al. | 2506.06556 | null |
2025-06-06 | SATversary: Adversarial Attacks on Satellite Fingerprinting | Joshua Smailes et.al. | 2506.06119 | null |
2025-06-05
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-05 | Exploring Adversarial Watermarking in Transformer-Based Models: Transferability and Robustness Against Defense Mechanism for Medical Images | Rifat Sadik et.al. | 2506.06389 | null |
2025-06-05 | Explainer-guided Targeted Adversarial Attacks against Binary Code Similarity Detection Models | Mingjie Chen et.al. | 2506.05430 | null |
2025-06-05 | Identifying and Understanding Cross-Class Features in Adversarial Training | Zeming Wei et.al. | 2506.05032 | null |
2025-06-05 | Robustness as Architecture: Designing IQA Models to Withstand Adversarial Perturbations | Igor Meleshin et.al. | 2506.04951 | null |
2025-06-05 | Fool the Stoplight: Realistic Adversarial Patch Attacks on Traffic Light Detectors | Svetlana Pavlitska et.al. | 2506.04823 | null |
2025-06-05 | Influence Functions for Edge Edits in Non-Convex Graph Neural Networks | Jaeseung Heo et.al. | 2506.04694 | null |
2025-06-05 | Towards Better Generalization via Distributional Input Projection Network | Yifan Hao et.al. | 2506.04690 | null |
2025-06-05 | Normative Conflicts and Shallow AI Alignment | Raphaël Millière et.al. | 2506.04679 | null |
2025-06-05 | RAID: A Dataset for Testing the Adversarial Robustness of AI-Generated Image Detectors | Hicham Eddoubi et.al. | 2506.03988 | **[link](https://github.com/pralab/raid)** |
2025-06-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-04 | Poisoning Behavioral-based Worker Selection in Mobile Crowdsensing using Generative Adversarial Networks | Ruba Nasser et.al. | 2506.05403 | null |
2025-06-04 | Sylva: Tailoring Personalized Adversarial Defense in Pre-trained Models via Collaborative Fine-tuning | Tianyu Qi et.al. | 2506.05402 | null |
2025-06-04 | Prediction Inconsistency Helps Achieve Generalizable Detection of Adversarial Examples | Sicong Han et.al. | 2506.03765 | null |
2025-06-04 | Robustness of Prompting: Enhancing Robustness of Large Language Models Against Prompting Attacks | Lin Mu et.al. | 2506.03627 | null |
2025-06-04 | Across Programming Language Silos: A Study on Cross-Lingual Retrieval-augmented Code Generation | Qiming Zhu et.al. | 2506.03535 | null |
2025-06-03
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-03 | Attacking Attention of Foundation Models Disrupts Downstream Tasks | Hondamunige Prasanna Silva et.al. | 2506.05394 | null |
2025-06-03 | Dynamic Epsilon Scheduling: A Multi-Factor Adaptive Perturbation Budget for Adversarial Training | Alan Mitkiy et.al. | 2506.04263 | null |
2025-06-03 | Adversarial Attacks on Robotic Vision Language Action Models | Eliot Krzysztof Jones et.al. | 2506.03350 | **[link](https://github.com/eliotjones1/robogcg)** |
2025-06-03 | Privacy Leaks by Adversaries: Adversarial Iterations for Membership Inference Attack | Jing Xue et.al. | 2506.02711 | null |
2025-06-03 | Poster: FedBlockParadox -- A Framework for Simulating and Securing Decentralized Federated Learning | Gabriele Digregorio et.al. | 2506.02679 | null |
2025-06-03 | Tarallo: Evading Behavioral Malware Detectors in the Problem Space | Gabriele Digregorio et.al. | 2506.02660 | **[link](https://github.com/necst/Tarallo)** |
2025-06-03 | BitBypass: A New Direction in Jailbreaking Aligned Large Language Models with Bitstream Camouflage | Kalyan Nakka et.al. | 2506.02479 | null |
2025-06-03 | Adversarial control of synchronization in complex oscillator networks | Yasutoshi Nagahama et.al. | 2506.02403 | **[link](https://github.com/kztakemoto/advSync)** |
2025-06-03 | On the Stability of Graph Convolutional Neural Networks: A Probabilistic Perspective | Ning Zhang et.al. | 2506.01213 | null |
2025-06-02
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-02 | Robust Satisficing Gaussian Process Bandits Under Adversarial Attacks | Artun Saday et.al. | 2506.01625 | null |
2025-06-02 | Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation | Yuan Gan et.al. | 2506.01591 | null |
2025-06-02 | Enhancing Diffusion-based Unrestricted Adversarial Attacks via Adversary Preferences Alignment | Kaixun Jiang et.al. | 2506.01511 | null |
2025-06-02 | Adversarial learning for nonparametric regression: Minimax rate and adaptive estimation | Jingfu Peng et.al. | 2506.01267 | null |
2025-06-01
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-01 | Fighting Fire with Fire (F3): A Training-free and Efficient Visual Adversarial Example Purification Method in LVLMs | Yudong Zhang et.al. | 2506.01064 | null |
2025-06-01 | CAPAA: Classifier-Agnostic Projector-Based Adversarial Attack | Zhan Li et.al. | 2506.00978 | null |
2025-06-01 | Breaking Latent Prior Bias in Detectors for Generalizable AIGC Image Detection | Yue Zhou et.al. | 2506.00874 | null |
2025-06-01 | SafeGenes: Evaluating the Adversarial Robustness of Genomic Foundation Models | Huixin Zhan et.al. | 2506.00821 | null |
2025-05-31
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-31 | Poster: Adapting Pretrained Vision Transformers with LoRA Against Attack Vectors | Richard E. Neddo et.al. | 2506.00661 | null |
2025-05-31 | Con Instruction: Universal Jailbreaking of Multimodal Large Language Models via Non-Textual Modalities | Jiahui Geng et.al. | 2506.00548 | null |
2025-05-31 | Adversarial Machine Learning for Robust Password Strength Estimation | Pappu Jha et.al. | 2506.00373 | null |
2025-05-31 | Towards Effective and Efficient Adversarial Defense with Diffusion Models for Robust Visual Tracking | Long Xu et.al. | 2506.00325 | null |
2025-05-31 | Practical Adversarial Attacks on Stochastic Bandits via Fake Data Injection | Qirun Zeng et.al. | 2505.21938 | null |
2025-05-30
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-30 | Adversarial Threat Vectors and Risk Mitigation for Retrieval-Augmented Generation Systems | Chris M. Ward et.al. | 2506.00281 | null |
2025-05-30 | 3D Gaussian Splat Vulnerabilities | Matthew Hull et.al. | 2506.00280 | null |
2025-05-30 | Black-box Adversarial Attacks on CNN-based SLAM Algorithms | Maria Rafaela Gkeka et.al. | 2505.24654 | null |
2025-05-30 | A Flat Minima Perspective on Understanding Augmentations and Model Robustness | Weebum Yoo et.al. | 2505.24592 | null |
2025-05-30 | Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors | Andrea Pedrotti et.al. | 2505.24523 | null |
2025-05-30 | Learning Safety Constraints for Large Language Models | Xin Chen et.al. | 2505.24445 | null |
2025-05-30 | Adversarial Preference Learning for Robust LLM Alignment | Yuanfu Wang et.al. | 2505.24369 | null |
2025-05-30 | Light as Deception: GPT-driven Natural Relighting Against Vision-Language Pre-training Models | Ying Yang et.al. | 2505.24227 | null |
2025-05-30 | The Butterfly Effect in Pathology: Exploring Security in Pathology Foundation Models | Jiashuai Liu et.al. | 2505.24141 | null |
2025-05-30 | Adversarially Robust AI-Generated Image Detection for Free: An Information Theoretic Perspective | Ruixuan Zhang et.al. | 2505.22604 | null |
2025-05-29
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-29 | SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents | Kunlun Zhu et.al. | 2505.23559 | **[link](https://github.com/ulab-uiuc/safescientist)** |
2025-05-29 | TRAP: Targeted Redirecting of Agentic Preferences | Hangoo Kang et.al. | 2505.23518 | null |
2025-05-29 | Adaptive Jailbreaking Strategies Based on the Semantic Understanding Capabilities of Large Language Models | Mingyu Yu et.al. | 2505.23404 | null |
2025-05-29 | Adversarial Semantic and Label Perturbation Attack for Pedestrian Attribute Recognition | Weizhe Kong et.al. | 2505.23313 | **[link](https://github.com/event-ahu/openpar)** |
2025-05-29 | Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion | Chunlong Xie et.al. | 2505.23266 | null |
2025-05-29 | The Meeseeks Mesh: Spatially Consistent 3D Adversarial Objects for BEV Detector | Aixuan Li et.al. | 2505.22499 | null |
2025-05-28
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-28 | Understanding Adversarial Training with Energy-based Models | Mujtaba Hussain Mirza et.al. | 2505.22486 | null |
2025-05-28 | Seeing the Threat: Vulnerabilities in Vision-Language Models to Adversarial Attack | Juan Ren et.al. | 2505.21967 | null |
2025-05-28 | Rethinking Gradient-based Adversarial Attacks on Point Cloud Classification | Jun Chen et.al. | 2505.21854 | null |
2025-05-28 | The Silent Saboteur: Imperceptible Adversarial Attacks against Black-Box Retrieval-Augmented Generation Systems | Hongru Song et.al. | 2505.18583 | null |
2025-05-27
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-27 | What is Adversarial Training for Diffusion Models? | Briglia Maria Rosaria et.al. | 2505.21742 | null |
2025-05-27 | Preventing Adversarial AI Attacks Against Autonomous Situational Awareness: A Maritime Case Study | Mathew J. Walter et.al. | 2505.21609 | null |
2025-05-27 | Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment | Xiaojun Jia et.al. | 2505.21494 | null |
2025-05-27 | Attribute-Efficient PAC Learning of Sparse Halfspaces with Constant Malicious Noise Rate | Shiwei Zeng et.al. | 2505.21430 | null |
2025-05-27 | A Framework for Adversarial Analysis of Decision Support Systems Prior to Deployment | Brett Bissey et.al. | 2505.21414 | null |
2025-05-27 | Boosting Adversarial Transferability via High-Frequency Augmentation and Hierarchical-Gradient Fusion | Yayin Zheng et.al. | 2505.21181 | null |
2025-05-27 | TabAttackBench: A Benchmark for Adversarial Attacks on Tabular Data | Zhipeng He et.al. | 2505.21027 | null |
2025-05-27 | Breaking Dataset Boundaries: Class-Agnostic Targeted Adversarial Attacks | Taïga Gonçalves et.al. | 2505.20782 | null |
2025-05-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-26 | Comparing Neural Network Encodings for Logic-based Explainability | Levi Cordeiro Carvalho et.al. | 2505.20269 | null |
2025-05-26 | Attention! You Vision Language Model Could Be Maliciously Manipulated | Xiaosen Wang et.al. | 2505.19911 | null |
2025-05-26 | One Surrogate to Fool Them All: Universal, Transferable, and Targeted Adversarial Attacks with CLIP | Binyan Xu et.al. | 2505.19840 | **[link](https://github.com/binyxu/univintruder)** |
2025-05-26 | TESSER: Transfer-Enhancing Adversarial Attacks from Vision Transformers via Spectral and Semantic Regularization | Amira Guesmi et.al. | 2505.19613 | null |
2025-05-25
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-25 | Curvature Dynamic Black-box Attack: revisiting adversarial robustness via dynamic curvature estimation | Peiran Sun et.al. | 2505.19194 | null |
2025-05-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-24 | Security Concerns for Large Language Models: A Survey | Miles Q. Li et.al. | 2505.18889 | null |
2025-05-24 | Audio Jailbreak Attacks: Exposing Vulnerabilities in SpeechGPT in a White-Box Framework | Binhao Ma et.al. | 2505.18864 | null |
2025-05-24 | Mal-D2GAN: Double-Detector based GAN for Malware Generation | Nam Hoang Thanh et.al. | 2505.18806 | null |
2025-05-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Towards more transferable adversarial attack in black-box manner | Chun Tong Lei et.al. | 2505.18097 | null |
2025-05-23 | CAMME: Adaptive Deepfake Image Detection with Multi-Modal Cross-Attention | Naseem Khan et.al. | 2505.18035 | **[link](https://github.com/magnet300/camme)** |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015 | **[link](https://github.com/shashankskagnihotri/benchmarking_reliability_generalization)** |
2025-05-23 | Superplatforms Have to Attack AI Agents | Jianghao Lin et.al. | 2505.17861 | null |
2025-05-23 | Temporal Consistency Constrained Transferable Adversarial Attacks with Background Mixup for Action Recognition | Ping Li et.al. | 2505.17807 | **[link](https://github.com/mlvccn/bmtc_transferattackvid)** |
2025-05-23 | EVADE: Multimodal Benchmark for Evasive Content Detection in E-Commerce Applications | Ancheng Xu et.al. | 2505.17654 | null |
2025-05-23 | Ownership Verification of DNN Models Using White-Box Adversarial Attacks with Specified Probability Manipulation | Teruki Sano et.al. | 2505.17579 | null |
2025-05-23 | What You Read Isn't What You Hear: Linguistic Sensitivity in Deepfake Speech Detection | Binh Nguyen et.al. | 2505.17513 | null |
2025-05-23 | Enhancing Adversarial Robustness of Vision Language Models via Adversarial Mixture Prompt Tuning | Shiji Zhao et.al. | 2505.17509 | null |
2025-05-23 | VEAttack: Downstream-agnostic Vision Encoder Attack against Large Vision Language Models | Hefei Mei et.al. | 2505.17440 | null |
2025-05-23 | When Are Concepts Erased From Diffusion Models? | Kevin Lu et.al. | 2505.17013 | null |
2025-05-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-22 | Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms | Baran Hashemi et.al. | 2505.17190 | null |
2025-05-22 | MixAT: Combining Continuous and Discrete Adversarial Training for LLMs | Csaba Dékány et.al. | 2505.16947 | null |
2025-05-22 | CAIN: Hijacking LLM-Humans Conversations via a Two-Stage Malicious System Prompt Generation and Refining Framework | Viet Pham et.al. | 2505.16888 | null |
2025-05-22 | Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected Vulnerability | Punya Syon Pandey et.al. | 2505.16789 | **[link](https://github.com/psyonp/accidental_misalignment)** |
2025-05-22 | Experimental robustness benchmark of quantum neural network on a superconducting quantum processor | Hai-Feng Zhang et.al. | 2505.16714 | null |
2025-05-22 | AdvReal: Adversarial Patch Generation Framework with Application to Adversarial Safety Evaluation of Object Detection Systems | Yuanhao Huang et.al. | 2505.16402 | **[link](https://github.com/huangyh98/advreal)** |
2025-05-22 | Chain-of-Thought Poisoning Attacks against R1-based Retrieval-Augmented Generation Systems | Hongru Song et.al. | 2505.16367 | null |
2025-05-22 | Accelerating Targeted Hard-Label Adversarial Attacks in Low-Query Black-Box Settings | Arjhun Swaminathan et.al. | 2505.16313 | **[link](https://github.com/mdppml/tea)** |
2025-05-22 | SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning | Kaiwen Zhou et.al. | 2505.16186 | null |
2025-05-22 | TRAIL: Transferable Robust Adversarial Images via Latent diffusion | Yuhao Xue et.al. | 2505.16166 | null |
2025-05-22 | A Few Large Shifts: Layer-Inconsistency Based Minimal Overhead Adversarial Example Detection | Sanggeon Yun et.al. | 2505.12586 | null |
2025-05-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-21 | Reverse Engineering Human Preferences with Reinforcement Learning | Lisa Alazraki et.al. | 2505.15795 | null |
2025-05-21 | Beyond Classification: Evaluating Diffusion Denoised Smoothing for Security-Utility Trade off | Yury Belousov et.al. | 2505.15594 | null |
2025-05-21 | My Face Is Mine, Not Yours: Facial Protection Against Diffusion Model Face Swapping | Hon Ming Yam et.al. | 2505.15336 | null |
2025-05-21 | Blind Spot Navigation: Evolutionary Discovery of Sensitive Semantic Concepts for LVLMs | Zihao Pan et.al. | 2505.15265 | null |
2025-05-21 | Few-Shot Adversarial Low-Rank Fine-Tuning of Vision-Language Models | Sajjad Ghiasvand et.al. | 2505.15130 | null |
2025-05-21 | A Survey On Secure Machine Learning | Taobo Liao et.al. | 2505.15124 | null |
2025-05-21 | Robustness Evaluation of Graph-based News Detection Using Network Structural Information | Xianghua Zeng et.al. | 2505.14453 | null |
2025-05-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-20 | GraphemeAug: A Systematic Approach to Synthesized Hard Negative Keyword Spotting Examples | Harry Zhang et.al. | 2505.14814 | null |
2025-05-20 | Adverseness vs. Equilibrium: Exploring Graph Adversarial Resilience through Dynamic Equilibrium | Xinxin Fan et.al. | 2505.14463 | null |
2025-05-20 | Universal Acoustic Adversarial Attacks for Flexible Control of Speech-LLMs | Rao Ma et.al. | 2505.14286 | null |
2025-05-20 | Adversarial Training from Mean Field Perspective | Soichiro Kumano et.al. | 2505.14021 | null |
2025-05-20 | Safety2Drive: Safety-Critical Scenario Benchmark for the Evaluation of Autonomous Driving | Jingzheng Li et.al. | 2505.13872 | null |
2025-05-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-19 | FlowPure: Continuous Normalizing Flows for Adversarial Purification | Elias Collaert et.al. | 2505.13280 | **[link](https://github.com/distrinet/flowpure)** |
2025-05-19 | Quantum Algorithms for Causal Estimands | Rishi Goel et.al. | 2505.12873 | null |
2025-05-19 | Language Models That Walk the Talk: A Framework for Formal Fairness Certificates | Danqing Chen et.al. | 2505.12767 | null |
2025-05-19 | On the Mechanisms of Adversarial Data Augmentation for Robust and Adaptive Transfer Learning | Hana Satou et.al. | 2505.12681 | null |
2025-05-19 | Spiking Neural Network: a low power solution for physical layer authentication | Jung Hoon Lee et.al. | 2505.12647 | null |
2025-05-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-18 | SPIRIT: Patching Speech Language Models against Jailbreak Attacks | Amirbek Djanibekov et.al. | 2505.13541 | null |
2025-05-18 | A Survey of Attacks on Large Language Models | Wenrui Xu et.al. | 2505.12567 | null |
2025-05-18 | EVALOOP: Assessing LLM Robustness in Programming from a Self-consistency Perspective | Sen Fang et.al. | 2505.12185 | null |
2025-05-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-17 | FABLE: A Localized, Targeted Adversarial Attack on Weather Forecasting Models | Yue Deng et.al. | 2505.12167 | null |
2025-05-17 | Black-box Adversaries from Latent Space: Unnoticeable Attacks on Human Pose and Shape Estimation | Zhiying Li et.al. | 2505.12009 | null |
2025-05-17 | Adversarial Robustness for Unified Multi-Modal Encoders via Efficient Calibration | Chih-Ting Liao et.al. | 2505.11895 | null |
2025-05-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-16 | On the Sharp Input-Output Analysis of Nonlinear Systems under Adversarial Attacks | Jihun Kim et.al. | 2505.11688 | null |
2025-05-16 | GenoArmory: A Unified Evaluation Framework for Adversarial Attacks on Genomic Foundation Models | Haozheng Luo et.al. | 2505.10983 | **[link](https://github.com/MAGICS-LAB/GenoArmory)** |
2025-05-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-14 | Revisiting Adversarial Perception Attacks and Defense Methods on Autonomous Driving Systems | Cheng Chen et.al. | 2505.11532 | null |
2025-05-14 | Adversarial Attack on Large Language Models using Exponentiated Gradient Descent | Sajib Biswas et.al. | 2505.09820 | **[link](https://github.com/sbamit/exponentiated-gradient-descent-llm-attack)** |
2025-05-14 | Evaluating the Robustness of Adversarial Defenses in Malware Detection Systems | Mostafa Jafari et.al. | 2505.09342 | **[link](https://github.com/mostafa-ja/sigma-binary)** |
2025-05-14 | Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction | Xiaowei Zhu et.al. | 2505.05084 | null |
2025-05-13
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-13 | Towards Adaptive Meta-Gradient Adversarial Examples for Visual Tracking | Wei-Long Tian et.al. | 2505.08999 | **[link](https://github.com/pgao-lab/amga)** |
2025-05-13 | SHAP-based Explanations are Sensitive to Feature Representation | Hyunseung Hwang et.al. | 2505.08345 | **[link](https://github.com/aguno/shap-attack)** |
2025-05-13 | AI and Generative AI Transforming Disaster Management: A Survey of Damage Assessment and Response Techniques | Aman Raj et.al. | 2505.08202 | null |
2025-05-13 | Red Teaming the Mind of the Machine: A Systematic Evaluation of Prompt Injection and Jailbreak Vulnerabilities in LLMs | Chetan Pathade et.al. | 2505.04806 | null |
2025-05-12
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-12 | Sharp Gaussian approximations for Decentralized Federated Learning | Soham Bonnerjee et.al. | 2505.08125 | null |
2025-05-12 | Dynamical Low-Rank Compression of Neural Networks with Robustness under Adversarial Attacks | Steffen Schotthöfer et.al. | 2505.08022 | null |
2025-05-12 | Must Read: A Systematic Survey of Computational Persuasion | Nimet Beyza Bozdag et.al. | 2505.07775 | **[link](https://github.com/beyzabozdag/PersuasionSurvey)** |
2025-05-12 | GRADA: Graph-based Reranker against Adversarial Documents Attack | Jingjie Zheng et.al. | 2505.07546 | **[link](https://github.com/agrzheng/GRADA)** |
2025-05-12 | No Query, No Access | Wenqiang Wang et.al. | 2505.07258 | null |
2025-05-11
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-11 | IM-BERT: Enhancing Robustness of BERT through the Implicit Euler Method | Mihyeon Kim et.al. | 2505.06889 | null |
2025-05-11 | DP-TRAE: A Dual-Phase Merging Transferable Reversible Adversarial Example for Image Privacy Protection | Xia Du et.al. | 2505.06860 | null |
2025-05-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-10 | Boundary-Guided Trajectory Prediction for Road Aware and Physically Feasible Autonomous Driving | Ahmed Abouelazm et.al. | 2505.06740 | null |
2025-05-10 | TAROT: Towards Essentially Domain-Invariant Robustness with Theoretical Justification | Dongyoon Yang et.al. | 2505.06580 | null |
2025-05-09
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-09 | Remote Rowhammer Attack using Adversarial Observations on Federated Learning Clients | Jinsheng Yuan et.al. | 2505.06335 | null |
2025-05-09 | Realistic Adversarial Attacks for Robustness Evaluation of Trajectory Prediction Models via Future State Perturbation | Julian F. Schumann et.al. | 2505.06134 | **[link](https://github.com/jhagenus/general-framework-update-adversarial-jeroen)** |
2025-05-09 | A Taxonomy of Attacks and Defenses in Split Learning | Aqsa Shabbir et.al. | 2505.05872 | null |
2025-05-08
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-08 | Unpacking Robustness in Inflectional Languages: Adversarial Evaluation and Mechanistic Insights | Paweł Walkowiak et.al. | 2505.07856 | null |
2025-05-08 | X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP | Hanxun Huang et.al. | 2505.05528 | **[link](https://github.com/HanxunH/XTransferBench)** |
2025-05-08 | DispBench: Benchmarking Disparity Estimation to Synthetic Corruptions | Shashank Agnihotri et.al. | 2505.05091 | **[link](https://github.com/shashankskagnihotri/benchmarking_robustness)** |
2025-05-08 | Uncovering the Limitations of Model Inversion Evaluation -- Benchmarks and Connection to Type-I Adversarial Attacks | Sy-Tuyen Ho et.al. | 2505.03519 | null |
2025-05-07
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-07 | Input-Specific and Universal Adversarial Attack Generation for Spiking Neural Networks in the Spiking Domain | Spyridon Raptis et.al. | 2505.06299 | null |
2025-05-07 | Crafting Physical Adversarial Examples by Combining Differentiable and Physically Based Renders | Yuqiu Liu et.al. | 2505.04662 | null |
2025-05-07 | Reliable Disentanglement Multi-view Learning Against View Adversarial Attacks | Xuyang Wang et.al. | 2505.04046 | **[link](https://github.com/Willy1005/2025-IJCAI-RDML)** |
2025-05-06
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-06 | ALMA: Aggregated Lipschitz Maximization Attack on Auto-encoders | Chethan Krishnamurthy Ramanaik et.al. | 2505.03646 | null |
2025-05-06 | Robustness in AI-Generated Detection: Enhancing Resistance to Adversarial Attacks | Sun Haoxuan et.al. | 2505.03435 | null |
2025-05-06 | Attention-aggregated Attack for Boosting the Transferability of Facial Adversarial Examples | Jian-Wei Li et.al. | 2505.03383 | null |
2025-05-06 | A Chaos Driven Metric for Backdoor Attack Detection | Hema Karnam Surendrababu et.al. | 2505.03208 | null |
2025-05-06 | Adversarial Sample Generation for Anomaly Detection in Industrial Control Systems | Abdul Mustafa et.al. | 2505.03120 | null |
2025-05-06 | Adversarial Attacks in Multimodal Systems: A Practitioner's Survey | Shashank Kapoor et.al. | 2505.03084 | null |
2025-05-06 | Adversarial Robustness of Deep Learning Models for Inland Water Body Segmentation from SAR Images | Siddharth Kothari et.al. | 2505.01884 | **[link](https://github.com/gvcl/iwseg-sar-poison)** |
2025-05-05
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-05 | Adversarial Robustness Analysis of Vision-Language Models in Medical Image Segmentation | Anjila Budathoki et.al. | 2505.02971 | **[link](https://github.com/anjilab/secure-private-ai)** |
2025-05-05 | Bayesian Robust Aggregation for Federated Learning | Aleksandr Karakulev et.al. | 2505.02490 | **[link](https://github.com/sciml-fl/bra-fl)** |
2025-05-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-04 | A Comprehensive Analysis of Adversarial Attacks against Spam Filters | Esra Hotoğlu et.al. | 2505.03831 | null |
2025-05-04 | Lightweight Defense Against Adversarial Attacks in Time Series Classification | Yi Han et.al. | 2505.02073 | **[link](https://github.com/yi126/lightweight-defence)** |
2025-05-03
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-03 | CAMOUFLAGE: Exploiting Misinformation Detection Systems Through LLM-driven Adversarial Claim Transformation | Mazal Bethany et.al. | 2505.01900 | null |
2025-05-03 | Rogue Cell: Adversarial Attack and Defense in Untrusted O-RAN Setup Exploiting the Traffic Steering xApp | Eran Aizikovich et.al. | 2505.01816 | null |
2025-05-02
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-02 | Modeling Behavioral Preferences of Cyber Adversaries Using Inverse Reinforcement Learning | Aditya Shinde et.al. | 2505.03817 | null |
2025-05-02 | Machine Learning for Cyber-Attack Identification from Traffic Flows | Yujing Zhou et.al. | 2505.01489 | null |
2025-05-02 | Constrained Network Adversarial Attacks: Validity, Robustness, and Transferability | Anass Grini et.al. | 2505.01328 | null |
2025-05-02 | Harmonizing Intra-coherence and Inter-divergence in Ensemble Attacks for Adversarial Transferability | Zhaoyang Ma et.al. | 2505.01168 | null |
2025-05-02 | Transferable Adversarial Attacks on Black-Box Vision-Language Models | Kai Hu et.al. | 2505.01050 | null |
2025-05-02 | Quantum Support Vector Regression for Robust Anomaly Detection | Kilian Tscharke et.al. | 2505.01012 | null |
2025-05-02 | MARS: Defending Unmanned Aerial Vehicles From Attacks on Inertial Sensors with Model-based Anomaly Detection and Recovery | Haocheng Meng et.al. | 2505.00924 | null |
2025-05-02 | Fast and Low-Cost Genomic Foundation Models via Outlier Removal | Haozheng Luo et.al. | 2505.00598 | **[link](https://github.com/MAGICS-LAB/GERM)** |
2025-05-01
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-01 | OET: Optimization-based prompt injection Evaluation Toolkit | Jinsheng Pan et.al. | 2505.00843 | null |
2025-05-01 | Fully passive quantum random number generation with untrusted light | KaiWei Qiu et.al. | 2505.00636 | null |
2025-05-01 | Analysis of the vulnerability of machine learning regression models to adversarial attacks using data from 5G wireless networks | Leonid Legashev et.al. | 2505.00487 | null |
2025-05-01 | GAN-based Generator of Adversarial Attack on Intelligent End-to-End Autoencoder-based Communication System | Jianyuan Chen et.al. | 2505.00395 | null |
2025-04-30
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-30 | Stochastic Subspace Descent Accelerated via Bi-fidelity Line Search | Nuojin Cheng et.al. | 2505.00162 | null |
2025-04-30 | Active Light Modulation to Counter Manipulation of Speech Visual Content | Hadleigh Schwartz et.al. | 2504.21846 | null |
2025-04-30 | Generative AI in Financial Institution: A Global Survey of Opportunities, Threats, and Regulation | Bikash Saha et.al. | 2504.21574 | null |
2025-04-30 | How to Backdoor the Knowledge Distillation | Chen Wu et.al. | 2504.21323 | null |
2025-04-30 | Quantifying the Noise of Structural Perturbations on Graph Adversarial Attacks | Junyuan Fang et.al. | 2504.20869 | null |
2025-04-29
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-29 | AegisLLM: Scaling Agentic Systems for Self-Reflective Defense in LLM Security | Zikui Cai et.al. | 2504.20965 | null |
2025-04-29 | Mitigating the Structural Bias in Graph Adversarial Defenses | Junyuan Fang et.al. | 2504.20848 | null |
2025-04-29 | Learning and Generalization with Mixture Data | Harsh Vardhan et.al. | 2504.20651 | null |
2025-04-29 | WILD: a new in-the-Wild Image Linkage Dataset for synthetic image attribution | Pietro Bongini et.al. | 2504.19595 | null |
2025-04-28
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-28 | AGATE: Stealthy Black-box Watermarking for Multimodal Model Copyright Protection | Jianbo Gao et.al. | 2504.21044 | null |
2025-04-28 | Prefill-Based Jailbreak: A Novel Approach of Bypassing LLM Safety Boundary | Yakai Li et.al. | 2504.21038 | null |
2025-04-28 | The Dark Side of Digital Twins: Adversarial Attacks on AI-Driven Water Forecasting | Mohammadhossein Homaei et.al. | 2504.20295 | null |
2025-04-28 | A Case Study on the Use of Representativeness Bias as a Defense Against Adversarial Cyber Threats | Briland Hitaj et.al. | 2504.20245 | null |
2025-04-28 | Evaluate-and-Purify: Fortifying Code Language Models Against Adversarial Attacks Using LLM-as-a-Judge | Wenhan Mu et.al. | 2504.19730 | null |
2025-04-28 | Fooling the Decoder: An Adversarial Attack on Quantum Error Correction | Jerome Lenssen et.al. | 2504.19651 | null |
2025-04-28 | FCGHunter: Towards Evaluating Robustness of Graph-Based Android Malware Detection | Shiwen Song et.al. | 2504.19456 | null |
2025-04-27
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-27 | Forging and Removing Latent-Noise Diffusion Watermarks Using a Single Image | Anubhav Jain et.al. | 2504.20111 | null |
2025-04-27 | CapsFake: A Multimodal Capsule Network for Detecting Instruction-Guided Deepfakes | Tuan Nguyen et.al. | 2504.19212 | null |
2025-04-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-26 | Unveiling and Mitigating Adversarial Vulnerabilities in Iterative Optimizers | Elad Sofer et.al. | 2504.19000 | null |
2025-04-26 | Test It Before You Trust It: Applying Software Testing for Trustworthy In-context Learning | Teeradaj Racharak et.al. | 2504.18827 | null |
2025-04-25
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-25 | Edge-Based Learning for Improved Classification Under Adversarial Noise | Manish Kansana et.al. | 2504.20077 | null |
2025-04-25 | Adversarial Attacks on LLM-as-a-Judge Systems: Insights from Prompt Injections | Narek Maloyan et.al. | 2504.18333 | null |
2025-04-25 | Generative AI for Physical-Layer Authentication | Rui Meng et.al. | 2504.18175 | null |
2025-04-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-24 | A Simple DropConnect Approach to Transfer-based Targeted Attack | Tongrui Su et.al. | 2504.18594 | null |
2025-04-24 | Unsupervised Corpus Poisoning Attacks in Continuous Space for Dense Retrieval | Yongkang Li et.al. | 2504.17884 | null |
2025-04-24 | On the Generalization of Adversarially Trained Quantum Classifiers | Petros Georgiou et.al. | 2504.17690 | null |
2025-04-24 | Evaluating the Vulnerability of ML-Based Ethereum Phishing Detectors to Single-Feature Adversarial Perturbations | Ahod Alghuried et.al. | 2504.17684 | null |
2025-04-24 | Evaluating Time Series Models for Urban Wastewater Management: Predictive Performance, Model Complexity and Resilience | Vipin Singh et.al. | 2504.17461 | null |
2025-04-24 | Unveiling Hidden Vulnerabilities in Digital Human Generation via Adversarial Attacks | Zhiying Li et.al. | 2504.17457 | null |
2025-04-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-23 | Seeking Flat Minima over Diverse Surrogates for Improved Adversarial Transferability: A Theoretical Framework and Algorithmic Instantiation | Meixi Zheng et.al. | 2504.16474 | null |
2025-04-23 | Property-Preserving Hashing for $\ell_1$ -Distance Predicates: Applications to Countering Adversarial Input Attacks | Hassan Asghar et.al. | 2504.16355 | null |
2025-04-23 | aiXamine: Simplified LLM Safety and Security | Fatih Deniz et.al. | 2504.14985 | null |
2025-04-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-22 | Human-Imperceptible Physical Adversarial Attack for NIR Face Recognition Models | Songyan Xie et.al. | 2504.15823 | null |
2025-04-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-21 | Unifying Image Counterfactuals and Feature Attributions with Latent-Space Adversarial Attacks | Jeremy Goldwasser et.al. | 2504.15479 | null |
2025-04-21 | MR. Guard: Multilingual Reasoning Guardrail using Curriculum Learning | Yahan Yang et.al. | 2504.15241 | null |
2025-04-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-20 | Towards Model Resistant to Transferable Adversarial Examples via Trigger Activation | Yi Yu et.al. | 2504.14541 | null |
2025-04-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-19 | Adversarial Attack for RGB-Event based Visual Object Tracking | Qiang Chen et.al. | 2504.14423 | null |
2025-04-19 | Hydra: An Agentic Reasoning Approach for Enhancing Adversarial Robustness and Mitigating Hallucinations in Vision-Language Models | Chung-En et.al. | 2504.14395 | null |
2025-04-19 | Rethinking Target Label Conditioning in Adversarial Attacks: A 2D Tensor-Guided Generative Approach | Hangyu Liu et.al. | 2504.14137 | null |
2025-04-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-18 | Context-Awareness and Interpretability of Rare Occurrences for Discovery and Formalization of Critical Failure Modes | Sridevi Polavaram et.al. | 2504.16117 | null |
2025-04-18 | VideoPASTA: 7K Preference Pairs That Matter for Video-LLM Alignment | Yogesh Kulkarni et.al. | 2504.14096 | null |
2025-04-18 | Fairness and Robustness in Machine Unlearning | Khoa Tran et.al. | 2504.13610 | null |
2025-04-18 | Q-FAKER: Query-free Hard Black-box Attack via Controlled Generation | CheolWon Na et.al. | 2504.13551 | null |
2025-04-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-17 | DYNAMITE: Dynamic Defense Selection for Enhancing Machine Learning-based Intrusion Detection Against Adversarial Attacks | Jing Chen et.al. | 2504.13301 | null |
2025-04-17 | Quantum Computing Supported Adversarial Attack-Resilient Autonomous Vehicle Perception Module for Traffic Sign Classification | Reek Majumder et.al. | 2504.12644 | **[link](https://github.com/reek129/QuantumAI_MultiClass_Sign_Recognition)** |
2025-04-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-16 | Human Aligned Compression for Robust Models | Samuel Räber et.al. | 2504.12255 | **[link](https://github.com/aplesner/Human-aligned-compression-for-robust-models)** |
2025-04-16 | SemDiff: Generating Natural Unrestricted Adversarial Examples via Semantic Attributes Optimization in Diffusion Models | Zeyu Dai et.al. | 2504.11923 | null |
2025-04-16 | Support is All You Need for Certified VAE Training | Changming Xu et.al. | 2504.11831 | null |
2025-04-16 | Towards Safe Synthetic Image Generation On the Web: A Multimodal Robust NSFW Defense and Million Scale Dataset | Muhammad Shahid Muneer et.al. | 2504.11707 | **[link](https://github.com/shahidmuneer/multimodal-nsfw-defense)** |
2025-04-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-15 | R-TPT: Improving Adversarial Robustness of Vision-Language Models through Test-Time Prompt Tuning | Lijun Sheng et.al. | 2504.11195 | **[link](https://github.com/tomsheng21/r-tpt)** |
2025-04-15 | Exploring Backdoor Attack and Defense for LLM-empowered Recommendations | Liangbo Ning et.al. | 2504.11182 | null |
2025-04-15 | Token-Level Constraint Boundary Search for Jailbreaking Text-to-Image Models | Jiangtao Liu et.al. | 2504.11106 | null |
2025-04-15 | QAVA: Query-Agnostic Visual Attack to Large Vision-Language Models | Yudong Zhang et.al. | 2504.11038 | **[link](https://github.com/btzyd/qava)** |
2025-04-15 | Defending Against Frequency-Based Attacks with Diffusion Models | Fatemeh Amerehi et.al. | 2504.11034 | null |
2025-04-15 | Towards Spatially-Aware and Optimally Faithful Concept-Based Explanations | Shubham Kumar et.al. | 2504.10833 | null |
2025-04-15 | The Sword of Damocles in ViTs: Computational Redundancy Amplifies Adversarial Transferability | Jiani Liu et.al. | 2504.10804 | null |
2025-04-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-14 | Investigating cybersecurity incidents using large language models in latest-generation wireless networks | Leonid Legashev et.al. | 2504.13196 | null |
2025-04-14 | Demo: ViolentUTF as An Accessible Platform for Generative AI Red Teaming | Tam n. Nguyen et.al. | 2504.10603 | null |
2025-04-14 | Quantifying Privacy Leakage in Split Inference via Fisher-Approximated Shannon Information Analysis | Ruijun Deng et.al. | 2504.10016 | null |
2025-04-14 | An Investigation of Large Language Models and Their Vulnerabilities in Spam Detection | Qiyao Tang et.al. | 2504.09776 | null |
2025-04-13
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-13 | Bregman Linearized Augmented Lagrangian Method for Nonconvex Constrained Stochastic Zeroth-order Optimization | Qiankun Shi et.al. | 2504.09409 | null |
2025-04-12
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-12 | PapMOT: Exploring Adversarial Patch Attack against Multiple Object Tracking | Jiahuan Long et.al. | 2504.09361 | null |
2025-04-12 | Classical Autoencoder Distillation of Quantum Adversarial Manipulations | Amena Khatun et.al. | 2504.09216 | null |
2025-04-12 | From Visual Explanations to Counterfactual Explanations with Latent Diffusion | Tung Luu et.al. | 2504.09202 | null |
2025-04-11
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-11 | Robust SAM: On the Adversarial Robustness of Vision Foundation Models | Jiahuan Long et.al. | 2504.08906 | null |
2025-04-11 | Toward Spiking Neural Network Local Learning Modules Resistant to Adversarial Attacks | Jiaqi Lin et.al. | 2504.08897 | null |
2025-04-11 | On Transfer-based Universal Attacks in Pure Black-box Setting | Mohammad A. A. K. Jalwana et.al. | 2504.08866 | null |
2025-04-11 | X-Guard: Multilingual Guard Agent for Content Moderation | Bibek Upadhayay et.al. | 2504.08848 | null |
2025-04-11 | Enabling Safety for Aerial Robots: Planning and Control Architectures | Kaleb Ben Naveed et.al. | 2504.08601 | null |
2025-04-11 | Toward Realistic Adversarial Attacks in IDS: A Novel Feasibility Metric for Transferability | Sabrine Ennaji et.al. | 2504.08480 | null |
2025-04-11 | Adversarial Examples in Environment Perception for Automated Driving (Review) | Jun Yan et.al. | 2504.08414 | null |
2025-04-11 | Adversarial Training of Reward Models | Alexander Bukharin et.al. | 2504.06141 | null |
2025-04-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-10 | Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge | Riccardo Cantini et.al. | 2504.07887 | **[link](https://github.com/SCAlabUnical/CLEAR-Bias_LLM_benchmark)** |
2025-04-09
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-09 | SafeMLRM: Demystifying Safety in Multi-modal Large Reasoning Models | Junfeng Fang et.al. | 2504.08813 | null |
2025-04-09 | Code Generation with Small Language Models: A Deep Evaluation on Codeforces | Débora Souza et.al. | 2504.07343 | null |
2025-04-08
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-08 | Exploring Gradient-Guided Masked Language Model to Detect Textual Adversarial Attacks | Xiaomei Zhang et.al. | 2504.08798 | null |
2025-04-08 | Exploiting Meta-Learning-based Poisoning Attacks for Graph Link Prediction | Mingchen Li et.al. | 2504.06492 | null |
2025-04-08 | Towards Calibration Enhanced Network by Inverse Adversarial Attack | Yupeng Cheng et.al. | 2504.06358 | null |
2025-04-08 | Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking | Junxi Chen et.al. | 2504.05838 | **[link](https://github.com/fhdnskfbeuv/attackipa)** |
2025-04-08 | Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-spoofing | Tianchi Liu et.al. | 2504.05657 | **[link](https://github.com/liu-tianchi/nes2net)** |
2025-04-07
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-07 | SINCon: Mitigate LLM-Generated Malicious Message Injection Attack for Rumor Detection | Mingqing Zhang et.al. | 2504.07135 | null |
2025-04-07 | Secure Diagnostics: Adversarial Robustness Meets Clinical Interpretability | Mohammad Hossein Najafi et.al. | 2504.05483 | null |
2025-04-07 | Adversarial KA | Sviatoslav Dzhenzher et.al. | 2504.05255 | null |
2025-04-07 | Security Risks in Vision-Based Beam Prediction: From Spatial Proxy Attacks to Feature Refinement | Avi Deb Raha et.al. | 2504.05222 | null |
2025-04-07 | Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models | Yoojin Jung et.al. | 2504.04747 | null |
2025-04-07 | On the Robustness of GUI Grounding Models Against Image Attacks | Haoren Zhao et.al. | 2504.04716 | null |
2025-04-07 | Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models | Zhaochen Wang et.al. | 2504.01589 | null |
2025-04-06
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-06 | Systematic Literature Review on Vehicular Collaborative Perception -- A Computer Vision Perspective | Lei Wan et.al. | 2504.04631 | null |
2025-04-06 | Selective Masking Adversarial Attack on Automatic Speech Recognition Systems | Zheng Fang et.al. | 2504.04394 | null |
2025-04-06 | WeiDetect: Weibull Distribution-Based Defense against Poisoning Attacks in Federated Learning for Network Intrusion Detection Systems | Sameera K. M. et.al. | 2504.04367 | null |
2025-04-03
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-03 | SLACK: Attacking LiDAR-based SLAM with Adversarial Point Injections | Prashant Kumar et.al. | 2504.03089 | null |
2025-04-03 | Moving Target Defense Against Adversarial False Data Injection Attacks In Power Grids | Yexiang Chen et.al. | 2504.03065 | null |
2025-04-03 | ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization | Kehua Feng et.al. | 2504.02725 | null |
2025-04-03 | Evaluating and Enhancing Segmentation Model Robustness with Metamorphic Testing | Seif Mzoughi et.al. | 2504.02335 | null |
2025-04-03 | Robust Unsupervised Domain Adaptation for 3D Point Cloud Segmentation Under Source Adversarial Attacks | Haosheng Li et.al. | 2504.01659 | null |
2025-04-03 | No Free Lunch with Guardrails | Divyanshu Kumar et.al. | 2504.00441 | null |
2025-04-02
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-02 | Watermarking for AI Content Detection: A Review on Text, Visual, and Audio Modalities | Lele Cao et.al. | 2504.03765 | null |
2025-04-02 | AdPO: Enhancing the Adversarial Robustness of Large Vision-Language Models with Preference Optimization | Chaohu Liu et.al. | 2504.01735 | null |
2025-04-02 | Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation | Junjie Chen et.al. | 2504.01668 | null |
2025-04-02 | Representation Bending for Large Language Model Safety | Ashkan Yousefpour et.al. | 2504.01550 | null |
2025-04-02 | Leveraging Generalizability of Image-to-Image Translation for Enhanced Adversarial Defense | Haibo Zhang et.al. | 2504.01399 | null |
2025-04-02 | Breaking BERT: Gradient Attack on Twitter Sentiment Analysis for Targeted Misclassification | Akil Raj Subedi et.al. | 2504.01345 | **[link](https://github.com/akil003/bert-attack)** |
2025-04-01
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-01 | Towards Resilient Federated Learning in CyberEdge Networks: Recent Advances and Future Trends | Kai Li et.al. | 2504.01240 | null |
2025-04-01 | TenAd: A Tensor-based Low-rank Black Box Adversarial Attack for Video Classification | Kimia haghjooei et.al. | 2504.01228 | null |
2025-04-01 | S3C2 Summit 2024-08: Government Secure Supply Chain Summit | Courtney Miller et.al. | 2504.00924 | null |
2025-04-01 | Whispering Under the Eaves: Protecting User Privacy Against Commercial and LLM-powered Automatic Speech Recognition Systems | Weifei Jin et.al. | 2504.00858 | **[link](https://github.com/WeifeiJin/AudioShield)** |
2025-04-01 | Alleviating Performance Disparity in Adversarial Spatiotemporal Graph Learning Under Zero-Inflated Distribution | Songran Bai et.al. | 2504.00721 | null |
2025-04-01 | Impact of Data Duplication on Deep Neural Network-Based Image Classifiers: Robust vs. Standard Models | Alireza Aghabagherloo et.al. | 2504.00638 | null |
2025-04-01 | Unleashing the Power of Pre-trained Encoders for Universal Adversarial Attack Detection | Yinghe Zhang et.al. | 2504.00429 | null |
2025-04-01 | A Survey on Unlearnable Data | Jiahao Li et.al. | 2503.23536 | **[link](https://github.com/LiJiahao-Alex/Awesome-UnLearnable-Data)** |
2025-03-31
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-31 | System Identification from Partial Observations under Adversarial Attacks | Jihun Kim et.al. | 2504.00244 | null |
2025-03-31 | Towards Benchmarking and Assessing the Safety and Robustness of Autonomous Driving on Safety-critical Scenarios | Jingzheng Li et.al. | 2503.23708 | null |
2025-03-30
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-30 | Towards Trustworthy GUI Agents: A Survey | Yucheng Shi et.al. | 2503.23434 | **[link](https://github.com/sycny/Awesome-Trustworthy-GUI-Agents)** |
2025-03-28
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-28 | Data-Free Universal Attack by Exploiting the Intrinsic Vulnerability of Deep Models | YangTian Yan et.al. | 2503.22205 | **[link](https://github.com/yyt0718/Intri_Attack)** |
2025-03-27
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-27 | Learning to Lie: Reinforcement Learning Attacks Damage Human-AI Teams and Teams of LLMs | Abed Kareem Musaffar et.al. | 2503.21983 | null |
2025-03-27 | Adversarial Wear and Tear: Exploiting Natural Damage for Generating Physical-World Adversarial Examples | Samra Irshad et.al. | 2503.21164 | null |
2025-03-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-26 | Robust Deep Reinforcement Learning in Robotics via Adaptive Gradient-Masked Adversarial Attacks | Zongyuan Zhang et.al. | 2503.20844 | null |
2025-03-26 | State-Aware Perturbation Optimization for Robust Deep Reinforcement Learning | Zongyuan Zhang et.al. | 2503.20613 | null |
2025-03-26 | Feature Statistics with Uncertainty Help Adversarial Robustness | Ran Wang et.al. | 2503.20583 | **[link](https://github.com/techtrekkerz/fsu)** |
2025-03-26 | Lipschitz Constant Meets Condition Number: Learning Robust and Compact Deep Neural Networks | Yangqi Feng et.al. | 2503.20454 | null |
2025-03-26 | Enabling Heterogeneous Adversarial Transferability via Feature Permutation Attacks | Tao Wu et.al. | 2503.20310 | null |
2025-03-26 | Are We There Yet? Unraveling the State-of-the-Art Graph Network Intrusion Detection Systems | Chenglong Wang et.al. | 2503.20281 | null |
2025-03-26 | How Secure is Forgetting? Linking Machine Unlearning to Machine Learning Attacks | Muhammed Shafi K. P. et.al. | 2503.20257 | null |
2025-03-26 | Hi-ALPS -- An Experimental Robustness Quantification of Six LiDAR-based Object Detection Systems for Autonomous Driving | Alexandra Arzberger et.al. | 2503.17168 | null |
2025-03-25
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-25 | Bitstream Collisions in Neural Image Compression via Adversarial Perturbations | Jordan Madden et.al. | 2503.19817 | **[link](https://github.com/neddamj/nicsec)** |
2025-03-25 | SITA: Structurally Imperceptible and Transferable Adversarial Attacks for Stylized Image Generation | Jingdan Kang et.al. | 2503.19791 | **[link](https://github.com/a-raniy-day/sita)** |
2025-03-25 | Boosting the Transferability of Audio Adversarial Examples with Acoustic Representation Optimization | Weifei Jin et.al. | 2503.19591 | null |
2025-03-25 | Towards Imperceptible Adversarial Attacks for Time Series Classification with Local Perturbations and Frequency Analysis | Wenwei Gu et.al. | 2503.19519 | null |
2025-03-25 | Stop Walking in Circles! Bailing Out Early in Projected Gradient Descent | Philip Doldo et.al. | 2503.19347 | null |
2025-03-25 | Efficient Adversarial Detection Frameworks for Vehicle-to-Microgrid Services in Edge Computing | Ahmed Omara et.al. | 2503.19318 | null |
2025-03-25 | Robustness of Proof of Team Sprint (PoTS) Against Attacks: A Simulation-Based Analysis | Naoki Yonezawa et.al. | 2503.19293 | null |
2025-03-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-24 | CEFW: A Comprehensive Evaluation Framework for Watermark in Large Language Models | Shuhao Zhang et.al. | 2503.20802 | **[link](https://github.com/drankxs/balancedwatermark)** |
2025-03-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-22 | Safe RLHF-V: Safe Reinforcement Learning from Human Feedback in Multimodal Large Language Models | Jiaming Ji et.al. | 2503.17682 | null |
2025-03-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-21 | Debugging and Runtime Analysis of Neural Networks with VLMs (A Case Study) | Boyue Caroline Hu et.al. | 2503.17416 | null |
2025-03-21 | Robustness of deep learning classification to adversarial input on GPUs: asynchronous parallel accumulation is a source of vulnerability | Sanjif Shanmugavelu et.al. | 2503.17173 | null |
2025-03-21 | EasyRobust: A Comprehensive and Easy-to-use Toolkit for Robust and Generalized Vision | Xiaofeng Mao et.al. | 2503.16975 | null |
2025-03-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-20 | REVAL: A Comprehension Evaluation on Reliability and Values of Large Vision-Language Models | Jie Zhang et.al. | 2503.16566 | null |
2025-03-20 | Narrowing Class-Wise Robustness Gaps in Adversarial Training | Fatemeh Amerehi et.al. | 2503.16179 | null |
2025-03-20 | SAUCE: Selective Concept Unlearning in Vision-Language Models with Sparse Autoencoders | Qing Li et.al. | 2503.14530 | null |
2025-03-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-19 | Improving Adversarial Transferability on Vision Transformers via Forward Propagation Refinement | Yuchen Ren et.al. | 2503.15404 | **[link](https://github.com/ryc-98/fpr)** |
2025-03-19 | Improving Generalization of Universal Adversarial Perturbation via Dynamic Maximin Optimization | Yechao Zhang et.al. | 2503.12793 | **[link](https://github.com/yechao-zhang/dm-uap)** |
2025-03-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-18 | Unveiling the Role of Randomization in Multiclass Adversarial Classification: Insights from Graph Theory | Lucas Gnecco-Heredia et.al. | 2503.14299 | null |
2025-03-18 | Survey of Adversarial Robustness in Multimodal Large Language Models | Chengze Jiang et.al. | 2503.13962 | null |
2025-03-18 | Make the Most of Everything: Further Considerations on Disrupting Diffusion-based Customization | Long Tang et.al. | 2503.13945 | null |
2025-03-18 | Evolution-based Region Adversarial Prompt Learning for Robustness Enhancement in Vision-Language Models | Xiaojun Jia et.al. | 2503.12874 | null |
2025-03-18 | GSBA $^K$: $top$-$K$ Geometric Score-based Black-box Attack | Md Farhamdur Reza et.al. | 2503.12827 | null |
2025-03-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-17 | Securing Virtual Reality Experiences: Unveiling and Tackling Cybersickness Attacks with Explainable AI | Ripan Kumar Kundu et.al. | 2503.13419 | null |
2025-03-17 | How Good is my Histopathology Vision-Language Foundation Model? A Holistic Benchmark | Roba Al Majzoub et.al. | 2503.12990 | **[link](https://github.com/musk007/Histopathology_Benchmark)** |
2025-03-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-16 | Towards Privacy-Preserving Data-Driven Education: The Potential of Federated Learning | Mohammad Khalil et.al. | 2503.13550 | null |
2025-03-16 | Algebraic Adversarial Attacks on Explainability Models | Lachlan Simpson et.al. | 2503.12683 | null |
2025-03-16 | GAN-Based Single-Stage Defense for Traffic Sign Classification Under Adversarial Patch Attack | Abyad Enan et.al. | 2503.12567 | null |
2025-03-16 | Augmented Adversarial Trigger Learning | Zhe Wang et.al. | 2503.12339 | null |
2025-03-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-15 | Robust Dataset Distillation by Matching Adversarial Trajectories | Wei Lai et.al. | 2503.12069 | null |
Poisoning attacks
2025-06-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-26 | E-FreeM2: Efficient Training-Free Multi-Scale and Cross-Modal News Verification via MLLMs | Van-Hoang Phan et.al. | 2506.20944 | null |
2025-06-26 | SPA: Towards More Stealth and Persistent Backdoor Attacks in Federated Learning | Chengcheng Zhu et.al. | 2506.20931 | null |
2025-06-25
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-25 | Client Clustering Meets Knowledge Sharing: Enhancing Privacy and Robustness in Personalized Peer-to-Peer Learning | Mohammad Mahdi Maheri et.al. | 2506.20413 | null |
2025-06-25 | Don't Hash Me Like That: Exposing and Mitigating Hash-Induced Unfairness in Local Differential Privacy | Berkay Kemal Balioglu et.al. | 2506.20290 | null |
2025-06-25 | Screen Hijack: Visual Poisoning of VLM Agents in Mobile Environments | Xuan Wang et.al. | 2506.13205 | null |
2025-06-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-24 | Identifying Physically Realizable Triggers for Backdoored Face Recognition Networks | Ankita Raj et.al. | 2506.19533 | null |
2025-06-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-22 | Generalization under Byzantine & Poisoning Attacks: Tight Stability Bounds in Robust Distributed Learning | Thomas Boudou et.al. | 2506.18020 | null |
2025-06-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | CUBA: Controlled Untargeted Backdoor Attack against Deep Neural Networks | Yinghao Wu et.al. | 2506.17350 | null |
2025-06-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-19 | SecureFed: A Two-Phase Framework for Detecting Malicious Clients in Federated Learning | Likhitha Annapurna Kavuri et.al. | 2506.16458 | null |
2025-06-19 | Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models | Biao Yi et.al. | 2506.16447 | null |
2025-06-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-18 | PDLRecover: Privacy-preserving Decentralized Model Recovery with Machine Unlearning | Xiangman Li et.al. | 2506.15112 | null |
2025-06-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-17 | Winter Soldier: Backdooring Language Models at Pre-Training with Indirect Data Poisoning | Wassim Bouaziz et.al. | 2506.14913 | null |
2025-06-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-16 | EBS-CFL: Efficient and Byzantine-robust Secure Clustered Federated Learning | Zhiqiang Li et.al. | 2506.13612 | **[link](https://github.com/lee-va/ebs-cfl)** |
2025-06-16 | Unlearning-Enhanced Website Fingerprinting Attack: Against Backdoor Poisoning in Anonymous Networks | Yali Yuan et.al. | 2506.13563 | null |
2025-06-16 | Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs | Houcheng Jiang et.al. | 2506.13285 | null |
2025-06-16 | Detecting Hard-Coded Credentials in Software Repositories via LLMs | Chidera Biringa et.al. | 2506.13090 | null |
2025-06-16 | Data Shifts Hurt CoT: A Theoretical Study | Lang Yin et.al. | 2506.10647 | null |
2025-06-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-15 | TrojanTO: Action-Level Backdoor Attacks against Trajectory Optimization Models | Yang Dai et.al. | 2506.12815 | null |
2025-06-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-14 | When Forgetting Triggers Backdoors: A Clean Unlearning Attack | Marco Arazzi et.al. | 2506.12522 | null |
2025-06-14 | InverTune: Removing Backdoors from Multimodal Contrastive Learning Models via Trigger Inversion and Activation Tuning | Mengyuan Sun et.al. | 2506.12411 | null |
2025-06-13
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-13 | Investigating Vulnerabilities and Defenses Against Audio-Visual Attacks: A Comprehensive Survey Emphasizing Multimodal Models | Jinming Wen et.al. | 2506.11521 | null |
2025-06-13 | Bias Amplification in RAG: Poisoning Knowledge Retrieval to Steer LLMs | Linlin Wang et.al. | 2506.11415 | null |
2025-06-12
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-12 | Collapsing Sequence-Level Data-Policy Coverage via Poisoning Attack in Offline Reinforcement Learning | Xue Zhou et.al. | 2506.11172 | null |
2025-06-12 | ME: Trigger Element Combination Backdoor Attack on Copyright Infringement | Feiyu Yang et.al. | 2506.10776 | null |
2025-06-12 | TED-LaST: Towards Robust Backdoor Defense Against Adaptive Attacks | Xiaoxing Mo et.al. | 2506.10722 | null |
2025-06-12 | TooBadRL: Trigger Optimization to Boost Effectiveness of Backdoor Attacks on Deep Reinforcement Learning | Songze Li et.al. | 2506.09562 | null |
2025-06-11
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-11 | FedMLAC: Mutual Learning Driven Heterogeneous Federated Audio Classification | Jun Bai et.al. | 2506.10207 | null |
2025-06-11 | Devil's Hand: Data Poisoning Attacks to Locally Private Graph Learning Protocols | Longzhu He et.al. | 2506.09803 | null |
2025-06-11 | Evasion Attacks Against Bayesian Predictive Models | Pablo G. Arce et.al. | 2506.09640 | **[link](https://github.com/pablogarciarce/advreg)** |
2025-06-11 | Your Agent Can Defend Itself against Backdoor Attacks | Li Changjiang et.al. | 2506.08336 | null |
2025-06-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-10 | WGLE:Backdoor-free and Multi-bit Black-box Watermarking for Graph Neural Networks | Tingzhi Li et.al. | 2506.08602 | null |
2025-06-10 | Single-Node Trigger Backdoor Attacks in Graph-Based Recommendation Systems | Runze Li et.al. | 2506.08401 | null |
2025-06-10 | SPBA: Utilizing Speech Large Language Model for Backdoor Attacks on Speech Classification Models | Wenhan Yao et.al. | 2506.08346 | null |
2025-06-09
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-09 | Circumventing Backdoor Space via Weight Symmetry | Jie Peng et.al. | 2506.07467 | **[link](https://github.com/jiepeng104/tsc)** |
2025-06-08
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-08 | Backdoor Attack on Vision Language Models with Stealthy Semantic Manipulation | Zhiyuan Zhong et.al. | 2506.07214 | null |
2025-06-07
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-07 | Can In-Context Reinforcement Learning Recover From Reward Poisoning Attacks? | Paulius Sasnauskas et.al. | 2506.06891 | null |
2025-06-07 | Rescaled Influence Functions: Accurate Data Attribution in High Dimension | Ittai Rubinstein et.al. | 2506.06656 | null |
2025-06-06
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-06 | Securing Traffic Sign Recognition Systems in Autonomous Vehicles | Thushari Hapuarachchi et.al. | 2506.06563 | null |
2025-06-06 | A Systematic Review of Poisoning Attacks Against Large Language Models | Neil Fendley et.al. | 2506.06518 | null |
2025-06-06 | Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems | Haowei Wang et.al. | 2506.06151 | null |
2025-06-06 | SATversary: Adversarial Attacks on Satellite Fingerprinting | Joshua Smailes et.al. | 2506.06119 | null |
2025-06-06 | What Really is a Member? Discrediting Membership Inference via Poisoning | Neal Mangaokar et.al. | 2506.06003 | null |
2025-06-05
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-05 | Invisible Backdoor Triggers in Image Editing Model via Deep Watermarking | Yu-Feng Chen et.al. | 2506.04879 | null |
2025-06-05 | SRD: Reinforcement-Learned Semantic Perturbation for Backdoor Defense in VLMs | Shuhan Xu et.al. | 2506.04743 | null |
2025-06-05 | Beyond the Protocol: Unveiling Attack Vectors in the Model Context Protocol Ecosystem | Hao Song et.al. | 2506.02040 | null |
2025-06-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-04 | Robust Anti-Backdoor Instruction Tuning in LVLMs | Yuan Xun et.al. | 2506.05401 | null |
2025-06-04 | VLMs Can Aggregate Scattered Training Patches | Zhanhui Zhou et.al. | 2506.03614 | **[link](https://github.com/zhziszz/visual-stitching)** |
2025-06-04 | DyePack: Provably Flagging Test Set Contamination in LLMs Using Backdoors | Yize Cheng et.al. | 2505.23001 | **[link](https://github.com/chengez/DyePack)** |
2025-06-03
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-03 | BadReward: Clean-Label Poisoning of Reward Models in Text-to-Image RLHF | Kaiwen Duan et.al. | 2506.03234 | null |
2025-06-03 | Agnostic Learning under Targeted Poisoning: Optimal Rates and the Role of Randomness | Bogdan Chornomaz et.al. | 2506.03075 | null |
2025-06-02
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-02 | Mitigating Data Poisoning Attacks to Local Differential Privacy | Xiaolin Li et.al. | 2506.02156 | null |
2025-06-02 | Which Factors Make Code LLMs More Vulnerable to Backdoor Attacks? A Systematic Study | Chenyu Wang et.al. | 2506.01825 | null |
2025-06-02 | Variance-Based Defense Against Blended Backdoor Attacks | Sujeevan Aseervatham et.al. | 2506.01444 | null |
2025-05-31
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-31 | Security Concerns for Large Language Models: A Survey | Miles Q. Li et.al. | 2505.18889 | null |
2025-05-30
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-30 | Adversarial Threat Vectors and Risk Mitigation for Retrieval-Augmented Generation Systems | Chris M. Ward et.al. | 2506.00281 | null |
2025-05-30 | Heterogeneous Graph Backdoor Attack | Jiawei Chen et.al. | 2506.00191 | null |
2025-05-30 | Cascading Adversarial Bias from Injection to Distillation in Language Models | Harsh Chaudhari et.al. | 2505.24842 | null |
2025-05-29
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-29 | Distributed Federated Learning for Vehicular Network Security: Anomaly Detection Benefits and Multi-Domain Attack Threats | Utku Demir et.al. | 2505.23706 | null |
2025-05-29 | Merge Hijacking: Backdoor Attacks to Model Merging of Large Language Models | Zenghui Yuan et.al. | 2505.23561 | null |
2025-05-29 | Performance Guaranteed Poisoning Attacks in Federated Learning: A Sliding Mode Approach | Huazi Pan et.al. | 2505.16403 | null |
2025-05-28
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-28 | Spa-VLM: Stealthy Poisoning Attacks on RAG-based VLM | Lei Yu et.al. | 2505.23828 | null |
2025-05-28 | Wolf Hidden in Sheep's Conversations: Toward Harmless Data-Based Backdoor Attacks for Jailbreaking Large Language Models | Jiawei Kong et.al. | 2505.17601 | null |
2025-05-27
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-27 | GUARD:Dual-Agent based Backdoor Defense on Chain-of-Thought in Neural Code Generation | Naizhu Jin et.al. | 2505.21425 | null |
2025-05-27 | HeteroBA: A Structure-Manipulating Backdoor Attack on Heterogeneous Graphs | Honglin Gao et.al. | 2505.21140 | null |
2025-05-27 | Multi-level Certified Defense Against Poisoning Attacks in Offline Reinforcement Learning | Shijie Liu et.al. | 2505.20621 | null |
2025-05-27 | Backdoors in DRL: Four Environments Focusing on In-distribution Triggers | Chace Ashcraft et.al. | 2505.17248 | null |
2025-05-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-26 | CPA-RAG:Covert Poisoning Attacks on Retrieval-Augmented Generation in Large Language Models | Chunyang Li et.al. | 2505.19864 | null |
2025-05-26 | Poison in the Well: Feature Embedding Disruption in Backdoor Attacks | Zhou Feng et.al. | 2505.19821 | null |
2025-05-26 | Fox in the Henhouse: Supply-Chain Backdoor Attacks Against Reinforcement Learning | Shijie Liu et.al. | 2505.19532 | null |
2025-05-26 | Are Time-Series Foundation Models Deployment-Ready? A Systematic Study of Adversarial Robustness Across Domains | Jiawen Zhang et.al. | 2505.19397 | null |
2025-05-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-24 | Benchmarking Poisoning Attacks against Retrieval-Augmented Generation | Baolei Zhang et.al. | 2505.18543 | null |
2025-05-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Ranking Free RAG: Replacing Re-ranking with Selection in RAG for Sensitive Domains | Yash Saxena et.al. | 2505.16014 | null |
2025-05-23 | A Linear Approach to Data Poisoning | Diego Granziol et.al. | 2505.15175 | null |
2025-05-23 | Hidden Ghost Hand: Unveiling Backdoor Vulnerabilities in MLLM-Powered Mobile GUI Agents | Pengzhou Cheng et.al. | 2505.14418 | null |
2025-05-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-22 | BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization | Xueyang Zhou et.al. | 2505.16640 | null |
2025-05-22 | Chain-of-Thought Poisoning Attacks against R1-based Retrieval-Augmented Generation Systems | Hongru Song et.al. | 2505.16367 | null |
2025-05-22 | BadDepth: Backdoor Attacks Against Monocular Depth Estimation in the Physical World | Ji Guo et.al. | 2505.16154 | null |
2025-05-22 | PoisonArena: Uncovering Competing Poisoning Attacks in Retrieval-Augmented Generation | Liuji Chen et.al. | 2505.12574 | **[link](https://github.com/yxf203/poisonarena)** |
2025-05-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-21 | BadSR: Stealthy Label Backdoor Attacks on Image Super-Resolution | Ji Guo et.al. | 2505.15308 | null |
2025-05-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-20 | SifterNet: A Generalized and Model-Agnostic Trigger Purification Approach | Shaoye Luo et.al. | 2505.14531 | null |
2025-05-20 | Capturing the Effects of Quantization on Trojans in Code LLMs | Aftab Hussain et.al. | 2505.14200 | null |
2025-05-20 | SVAFD: A Secure and Verifiable Co-Aggregation Protocol for Federated Distillation | Tian Wen et.al. | 2505.13319 | null |
2025-05-20 | One Shot Dominance: Knowledge Poisoning Attack on Retrieval-Augmented Generation Systems | Zhiyuan Chang et.al. | 2505.11548 | null |
2025-05-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-19 | Does Low Rank Adaptation Lead to Lower Robustness against Training-Time Attacks? | Zi Liang et.al. | 2505.12871 | **[link](https://github.com/liangzid/lora-ssecurity)** |
2025-05-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-17 | FIGhost: Fluorescent Ink-based Stealthy and Flexible Backdoor Attacks on Physical Traffic Sign Recognition | Shuai Yuan et.al. | 2505.12045 | null |
2025-05-17 | FL-PLAS: Federated Learning with Partial Layer Aggregation for Backdoor Defense Against High-Ratio Malicious Clients | Jianyi Zhang et.al. | 2505.12019 | **[link](https://github.com/besticsp/fl-plas)** |
2025-05-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-16 | PeerGuard: Defending Multi-Agent Systems Against Backdoor Attacks Through Mutual Reasoning | Falong Fan et.al. | 2505.11642 | **[link](https://github.com/anonymoususertech/defensecot)** |
2025-05-16 | The Ripple Effect: On Unforeseen Complications of Backdoor Attacks | Rui Zhang et.al. | 2505.11586 | **[link](https://github.com/zhangrui4041/backdoor_complications)** |
2025-05-16 | Towards Robust Spiking Neural Networks:Mitigating Heterogeneous Training Vulnerability via Dominant Eigencomponent Projection | Desong Zhang et.al. | 2505.11134 | null |
2025-05-16 | Task Reconstruction and Extrapolation for $π_0$ using Text Latent | Quanyi Li et.al. | 2505.03500 | null |
2025-05-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-15 | Defending the Edge: Representative-Attention for Mitigating Backdoor Attacks in Federated Learning | Chibueze Peace Obioma et.al. | 2505.10297 | null |
2025-05-15 | Sybil-based Virtual Data Poisoning Attacks in Federated Learning | Changxun Zhu et.al. | 2505.09983 | null |
2025-05-15 | Model-Targeted Data Poisoning Attacks against ITS Applications with Provable Convergence | Xin Wang et.al. | 2505.03966 | null |
2025-05-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-14 | Toward Malicious Clients Detection in Federated Learning | Zhihao Dou et.al. | 2505.09110 | null |
2025-05-13
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-13 | MUBox: A Critical Evaluation Framework of Deep Machine Unlearning | Xiang Li et.al. | 2505.08576 | null |
2025-05-12
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-12 | MixBridge: Heterogeneous Image-to-Image Backdoor Attack through Mixture of Schrödinger Bridges | Shixi Qin et.al. | 2505.08809 | **[link](https://github.com/qsx830/mixbridge)** |
2025-05-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-10 | POISONCRAFT: Practical Poisoning of Retrieval-Augmented Generation for Large Language Models | Yangguang Shao et.al. | 2505.06579 | **[link](https://github.com/andyshaw01/poisoncraft)** |
2025-05-09
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-09 | Natural Reflection Backdoor Attack on Vision Language Model for Autonomous Driving | Ming Liu et.al. | 2505.06413 | null |
2025-05-09 | Sparsification Under Siege: Defending Against Poisoning Attacks in Communication-Efficient Federated Learning | Zhiyong Jin et.al. | 2505.01454 | null |
2025-05-08
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-08 | KPI Poisoning: An Attack in Open RAN Near Real-Time Control Loop | Hamed Alimohammadi et.al. | 2505.05537 | null |
2025-05-08 | MTL-UE: Learning to Learn Nothing for Multi-Task Learning | Yi Yu et.al. | 2505.05279 | null |
2025-05-08 | Stealthy LLM-Driven Data Poisoning Attacks Against Embedding-Based Retrieval-Augmented Recommender Systems | Fatemeh Nazary et.al. | 2505.05196 | null |
2025-05-06
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-06 | BadLingual: A Novel Lingual-Backdoor Attack against Large Language Models | Zihan Wang et.al. | 2505.03501 | null |
2025-05-06 | Mitigating Backdoor Triggered and Targeted Data Poisoning Attacks in Voice Authentication Systems | Alireza Mohammadi et.al. | 2505.03455 | null |
2025-05-06 | Framework GNN-AID: Graph Neural Network Analysis Interpretation and Defense | Kirill Lukyanov et.al. | 2505.03424 | **[link](https://github.com/ispras/gnn-aid)** |
2025-05-06 | A Chaos Driven Metric for Backdoor Attack Detection | Hema Karnam Surendrababu et.al. | 2505.03208 | null |
2025-05-06 | Adversarial Robustness of Deep Learning Models for Inland Water Body Segmentation from SAR Images | Siddharth Kothari et.al. | 2505.01884 | **[link](https://github.com/gvcl/iwseg-sar-poison)** |
2025-05-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-04 | Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents | Christian Schroeder de Witt et.al. | 2505.02077 | null |
2025-05-03
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-03 | Backdoor Attacks Against Patch-based Mixture of Experts | Cedric Chan et.al. | 2505.01811 | **[link](https://github.com/geefmegeld/pmoe-backdoor)** |
2025-05-02
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-02 | Explainable AI Based Diagnosis of Poisoning Attacks in Evolutionary Swarms | Mehrdad Asadi et.al. | 2505.01181 | null |
2025-05-01
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-01 | Protocol-agnostic and Data-free Backdoor Attacks on Pre-trained Models in RF Fingerprinting | Tianya Zhao et.al. | 2505.00881 | **[link](https://github.com/Tianyaz97/rf_backdoor)** |
2025-04-30
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-30 | Cert-SSB: Toward Certified Sample-Specific Backdoor Defense | Ting Qiao et.al. | 2504.21730 | **[link](https://github.com/ncepuqiaoting/cert-ssb)** |
2025-04-30 | Traceback of Poisoning Attacks to Retrieval-Augmented Generation | Baolei Zhang et.al. | 2504.21668 | null |
2025-04-30 | How to Backdoor the Knowledge Distillation | Chen Wu et.al. | 2504.21323 | null |
2025-04-29
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-29 | Erased but Not Forgotten: How Backdoors Compromise Concept Erasure | Jonas Henry Grebe et.al. | 2504.21072 | null |
2025-04-29 | FFCBA: Feature-based Full-target Clean-label Backdoor Attacks | Yangxu Yin et.al. | 2504.21054 | null |
2025-04-29 | SFIBA: Spatial-based Full-target Invisible Backdoor Attacks | Yangxu Yin et.al. | 2504.21052 | null |
2025-04-29 | GaussTrap: Stealthy Poisoning Attacks on 3D Gaussian Splatting for Targeted Scene Confusion | Jiaxin Hong et.al. | 2504.20829 | null |
2025-04-29 | Dynamic Attention Analysis for Backdoor Detection in Text-to-Image Diffusion Models | Zhongqi Wang et.al. | 2504.20518 | null |
2025-04-29 | BadMoE: Backdooring Mixture-of-Experts LLMs via Optimizing Routing Triggers and Infecting Dormant Experts | Qingyue Wang et.al. | 2504.18598 | null |
2025-04-28
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-28 | What's Pulling the Strings? Evaluating Integrity and Attribution in AI Training and Inference through Concept Shift | Jiamin Chang et.al. | 2504.21042 | null |
2025-04-25
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-25 | Intelligent Attacks and Defense Methods in Federated Learning-enabled Energy-Efficient Wireless Networks | Han Zhang et.al. | 2504.18519 | null |
2025-04-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-24 | Unsupervised Corpus Poisoning Attacks in Continuous Space for Dense Retrieval | Yongkang Li et.al. | 2504.17884 | null |
2025-04-24 | GRANITE : a Byzantine-Resilient Dynamic Gossip Learning Framework | Yacine Belal et.al. | 2504.17471 | null |
2025-04-24 | The Ultimate Cookbook for Invisible Poison: Crafting Subtle Clean-Label Text Backdoors with Style Attributes | Wencong You et.al. | 2504.17300 | null |
2025-04-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-23 | Robo-Troj: Attacking LLM-based Task Planners | Mohaiminul Al Nahian et.al. | 2504.17070 | null |
2025-04-23 | BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation | Ruotong Wang et.al. | 2504.16907 | null |
2025-04-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-22 | A Geometric Approach to Problems in Optimization and Data Science | Naren Sarayu Manoj et.al. | 2504.16270 | null |
2025-04-22 | OPUS-VFL: Incentivizing Optimal Privacy-Utility Tradeoffs in Vertical Federated Learning | Sindhuja Madabushi et.al. | 2504.15995 | null |
2025-04-22 | TrojanDam: Detection-Free Backdoor Defense in Federated Learning through Proactive Model Robustification utilizing OOD Data | Yanbo Dai et.al. | 2504.15674 | null |
2025-04-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-21 | Backdoor Defense in Diffusion Models via Spatial Attention Unlearning | Abha Jha et.al. | 2504.18563 | null |
2025-04-21 | BadApex: Backdoor Attack Based on Adaptive Optimization Mechanism of Black-box Large Language Models | Zhengxian Wu et.al. | 2504.13775 | null |
2025-04-21 | A Client-level Assessment of Collaborative Backdoor Poisoning in Non-IID Federated Learning | Phung Lai et.al. | 2504.12875 | null |
2025-04-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-20 | REDEditing: Relationship-Driven Precise Backdoor Poisoning on Text-to-Image Diffusion Models | Chongye Guo et.al. | 2504.14554 | null |
2025-04-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-19 | WeiDetect: Weibull Distribution-Based Defense against Poisoning Attacks in Federated Learning for Network Intrusion Detection Systems | Sameera K. M. et.al. | 2504.04367 | null |
2025-04-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-17 | Strategic Planning of Stealthy Backdoor Attacks in Markov Decision Processes | Xinyi Wei et.al. | 2504.13276 | null |
2025-04-17 | ControlNET: A Firewall for RAG-based LLM System | Hongwei Yao et.al. | 2504.09593 | null |
2025-04-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-16 | Secure Transfer Learning: Training Clean Models Against Backdoor in (Both) Pre-trained Encoders and Downstream Datasets | Yechao Zhang et.al. | 2504.11990 | null |
2025-04-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-15 | Propaganda via AI? A Study on Semantic Backdoors in Large Language Models | Nay Myat Min et.al. | 2504.12344 | **[link](https://github.com/naymyatmin/raven)** |
2025-04-15 | Exploring Backdoor Attack and Defense for LLM-empowered Recommendations | Liangbo Ning et.al. | 2504.11182 | null |
2025-04-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-14 | Investigating cybersecurity incidents using large language models in latest-generation wireless networks | Leonid Legashev et.al. | 2504.13196 | null |
2025-04-14 | An Investigation of Large Language Models and Their Vulnerabilities in Spam Detection | Qiyao Tang et.al. | 2504.09776 | null |
2025-04-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-10 | Augmented Shuffle Protocols for Accurate and Robust Frequency Estimation under Differential Privacy | Takao Murakami et.al. | 2504.07362 | null |
2025-04-09
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-09 | Bridging the Gap Between Preference Alignment and Machine Unlearning | Xiaohua Feng et.al. | 2504.06659 | null |
2025-04-09 | Diversity-aware Dual-promotion Poisoning Attack on Sequential Recommendation | Yuchuan Zhao et.al. | 2504.06586 | null |
2025-04-08
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-08 | Exploiting Meta-Learning-based Poisoning Attacks for Graph Link Prediction | Mingchen Li et.al. | 2504.06492 | **[link](https://github.com/mingchenli/VGAE_Attack_Meta)** |
2025-04-08 | Defending Deep Neural Networks against Backdoor Attacks via Module Switching | Weijun Li et.al. | 2504.05902 | null |
2025-04-08 | Parasite: A Steganography-based Backdoor Attack Framework for Diffusion Models | Jiahao Chen et.al. | 2504.05815 | null |
2025-04-08 | ShadowCoT: Cognitive Hijacking for Stealthy Reasoning Backdoors in LLMs | Gejian Zhao et.al. | 2504.05605 | null |
2025-04-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-04 | Practical Poisoning Attacks against Retrieval-Augmented Generation | Baolei Zhang et.al. | 2504.03957 | null |
2025-04-04 | PPFPL: Cross-silo Privacy-preserving Federated Prototype Learning Against Data Poisoning Attacks on Non-IID Data | Hongliang Zhang et.al. | 2504.03173 | null |
2025-04-02
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-02 | One Pic is All it Takes: Poisoning Visual Document Retrieval Augmented Generation with a Single Image | Ezzeldin Shereen et.al. | 2504.02132 | null |
2025-04-02 | Sky of Unlearning (SoUL): Rewiring Federated Machine Unlearning via Selective Pruning | Md Mahabub Uz Zaman et.al. | 2504.01705 | null |
2025-03-31
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-31 | Backdoor Detection through Replicated Execution of Outsourced Training | Hengrui Jia et.al. | 2504.00170 | null |
2025-03-31 | A Channel-Triggered Backdoor Attack on Wireless Semantic Image Reconstruction | Jialin Wan et.al. | 2503.23866 | null |
2025-03-30
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-30 | Buffer is All You Need: Defending Federated Learning against Backdoor Attacks under Non-iids via Buffering | Xingyu Lyu et.al. | 2503.23511 | null |
2025-03-30 | Two Heads Are Better than One: Model-Weight and Latent-Space Analysis for Federated Learning on Non-iid Data against Poisoning Attacks | Xingyu Lyu et.al. | 2503.23288 | null |
2025-03-27
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-27 | Data Poisoning in Deep Learning: A Survey | Pinlong Zhao et.al. | 2503.22759 | **[link](https://github.com/pinlong-zhao/data-poisoning)** |
2025-03-27 | Tricking Retrievers with Influential Tokens: An Efficient Black-Box Corpus Poisoning Attack | Cheng Wang et.al. | 2503.21315 | null |
2025-03-27 | DeBackdoor: A Deductive Framework for Detecting Backdoor Attacks on Deep Models with Limited Data | Dorde Popovic et.al. | 2503.21305 | null |
2025-03-27 | Clean Image May be Dangerous: Data Poisoning Attacks Against Deep Hashing | Shuai Li et.al. | 2503.21236 | null |
2025-03-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-26 | Robust Federated Learning Against Poisoning Attacks: A GAN-Based Defense Framework | Usama Zafar et.al. | 2503.20884 | **[link](https://github.com/SciML-FL/gan-filter)** |
2025-03-26 | How Secure is Forgetting? Linking Machine Unlearning to Machine Learning Attacks | Muhammed Shafi K. P. et.al. | 2503.20257 | null |
2025-03-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-24 | Deterministic Certification of Graph Neural Networks against Graph Poisoning Attacks with Arbitrary Perturbations | Jiate Li et.al. | 2503.18503 | null |
2025-03-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-22 | Towards Invisible Backdoor Attack on Text-to-Image Diffusion Model | Jie Zhang et.al. | 2503.17724 | **[link](https://github.com/robin-wzq/iba)** |
2025-03-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-21 | Large Language Models Can Verbatim Reproduce Long Malicious Sequences | Sharon Lin et.al. | 2503.17578 | null |
2025-03-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-20 | BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models | Zenghui Yuan et.al. | 2503.16023 | null |
2025-03-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-19 | Cyber Threats in Financial Transactions -- Addressing the Dual Challenge of AI and Quantum Computing | Ahmed M. Elmisery et.al. | 2503.15678 | null |
2025-03-19 | Test-Time Backdoor Detection for Object Detection Models | Hangtao Zhang et.al. | 2503.15293 | null |
2025-03-19 | OFL: Opportunistic Federated Learning for Resource-Heterogeneous and Privacy-Aware Devices | Yunlong Mao et.al. | 2503.15015 | null |
2025-03-19 | A Semantic and Clean-label Backdoor Attack against Graph Convolutional Networks | Jiazhu Dai et.al. | 2503.14922 | null |
2025-03-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-18 | Entente: Cross-silo Intrusion Detection on Network Log Graphs with Federated Learning | Jiacen Xu et.al. | 2503.14284 | null |
2025-03-18 | XOXO: Stealthy Cross-Origin Context Poisoning Attacks against AI Coding Assistants | Adam Štorek et.al. | 2503.14281 | null |
2025-03-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-17 | Optimizing ML Training with Metagradient Descent | Logan Engstrom et.al. | 2503.13751 | null |
2025-03-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-16 | One Goal, Many Challenges: Robust Preference Optimization Amid Content-Aware and Multi-Source Noise | Amirabbas Afzali et.al. | 2503.12301 | null |
2025-03-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-15 | Revisiting Training-Inference Trigger Intensity in Backdoor Attacks | Chenhao Lin et.al. | 2503.12058 | **[link](https://github.com/cv12ha0/TITIM)** |
2025-03-15 | Revisiting Backdoor Attacks on Time Series Classification in the Frequency Domain | Yuanmin Huang et.al. | 2503.09712 | null |
2025-03-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-14 | Trust Under Siege: Label Spoofing Attacks against Machine Learning for Android Malware Detection | Tianwei Lan et.al. | 2503.11841 | null |
2025-03-13
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-13 | Targeted Data Poisoning for Black-Box Audio Datasets Ownership Verification | Wassim Bouaziz et.al. | 2503.10269 | null |
2025-03-13 | Policy Teaching via Data Poisoning in Learning from Human Preferences | Andi Nika et.al. | 2503.10228 | null |
2025-03-12
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-12 | Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models | Sangwon Jang et.al. | 2503.09669 | null |
2025-03-12 | Stealthy Patch-Wise Backdoor Attack in 3D Point Cloud via Curvature Awareness | Yu Feng et.al. | 2503.09336 | null |
2025-03-12 | Detecting and Preventing Data Poisoning Attacks on AI Models | Halima I. Kure et.al. | 2503.09302 | null |
2025-03-12 | C^2 ATTACK: Towards Representation Backdoor on CLIP via Concept Confusion | Lijie Hu et.al. | 2503.09095 | null |
2025-03-12 | Adaptive Backdoor Attacks with Reasonable Constraints on Graph Neural Networks | Xuewen Dong et.al. | 2503.09049 | null |
2025-03-12 | Not All Edges are Equally Robust: Evaluating the Robustness of Ranking-Based Federated Learning | Zirui Gong et.al. | 2503.08976 | null |
Generative models safety
2025-06-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-18 | LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning | Gabrel J. Perin et.al. | 2506.15606 | **[link](https://github.com/vita-group/lox)** |
2025-06-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-17 | Safe-Child-LLM: A Developmental Benchmark for Evaluating LLM Safety in Child-LLM Interactions | Junfeng Jiao et.al. | 2506.13510 | **[link](https://github.com/the-responsible-ai-initiative/safe_child_llm_benchmark)** |
2025-06-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-16 | We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems | Junfeng Fang et.al. | 2506.13666 | **[link](https://github.com/littlelittlenine/safemcp)** |
2025-06-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-15 | SecurityLingua: Efficient Defense of LLM Jailbreak Attacks via Security-Aware Prompt Compression | Yucheng Li et.al. | 2506.12707 | null |
2025-06-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-14 | QGuard:Question-based Zero-shot Guard for Multi-modal LLM Safety | Taegyeong Lee et.al. | 2506.12299 | null |
2025-06-14 | Monitoring Decomposition Attacks in LLMs with Lightweight Sequential Monitors | Chen Yueh-Han et.al. | 2506.10949 | **[link](https://github.com/yuehhanchen/monitoring-decomposition-attack)** |
2025-06-09
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-09 | When Style Breaks Safety: Defending Language Models Against Superficial Style Alignment | Yuxin Xiao et.al. | 2506.07452 | **[link](https://github.com/xiaoyuxin1002/SafeStyle)** |
2025-06-09 | Beyond Jailbreaks: Revealing Stealthier and Broader LLM Security Risks Stemming from Alignment Failures | Yukai Zhou et.al. | 2506.07402 | null |
2025-06-09 | Refusal-Feature-guided Teacher for Safe Finetuning via Data Filtering and Alignment Distillation | Seokil Ham et.al. | 2506.07356 | null |
2025-06-08
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-08 | Quality-Diversity Red-Teaming: Automated Generation of High-Quality and Diverse Attackers for Large Language Models | Ren-Jian Wang et.al. | 2506.07121 | null |
2025-06-08 | AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint | Leheng Sheng et.al. | 2506.07022 | **[link](https://github.com/alphalab-ustc/alphasteer)** |
2025-06-07
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-07 | SafeLawBench: Towards Safe Alignment of Large Language Models | Chuxue Cao et.al. | 2506.06636 | null |
2025-06-06
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-06 | The Scales of Justitia: A Comprehensive Survey on Safety Evaluation of LLMs | Songyang Liu et.al. | 2506.11094 | null |
2025-06-06 | Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety Assurance | Ruizhong Qiu et.al. | 2506.06444 | **[link](https://github.com/q-rz/saffron)** |
2025-06-05
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-05 | Interpretation Meets Safety: A Survey on Interpretation Methods and Tools for Improving LLM Safety | Seongmin Lee et.al. | 2506.05451 | null |
2025-06-05 | Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets | Lei Hsiung et.al. | 2506.05346 | null |
2025-06-05 | Evaluating Prompt-Driven Chinese Large Language Models: The Influence of Persona Assignment on Stereotypes and Safeguards | Geng Liu et.al. | 2506.04975 | null |
2025-06-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-04 | Should LLM Safety Be More Than Refusing Harmful Instructions? | Utsav Maskey et.al. | 2506.02442 | null |
2025-06-02
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-02 | ReGA: Representation-Guided Abstraction for Model-based Safeguarding of LLMs | Zeming Wei et.al. | 2506.01770 | **[link](https://github.com/weizeming/rega)** |
2025-05-31
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-31 | SafeTuneBed: A Toolkit for Benchmarking LLM Safety Alignment in Fine-Tuning | Saad Hossain et.al. | 2506.00676 | null |
2025-05-30
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-30 | Learning Safety Constraints for Large Language Models | Xin Chen et.al. | 2505.24445 | **[link](https://github.com/lasgroup/safetypolytope)** |
2025-05-30 | The State of Multilingual LLM Safety Research: From Measuring the Language Gap to Mitigating It | Zheng-Xin Yong et.al. | 2505.24119 | null |
2025-05-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-26 | Safety Through Reasoning: An Empirical Study of Reasoning Guardrail Models | Makesh Narsimhan Sreedhar et.al. | 2505.20087 | null |
2025-05-26 | PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks | Guobin Shen et.al. | 2505.13862 | **[link](https://github.com/beijing-aisi/panda-guard)** |
2025-05-25
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-25 | Benign Samples Matter! Fine-tuning On Outlier Benign Samples Severely Breaks Safety | Zihan Guan et.al. | 2505.06843 | **[link](https://github.com/guanzihan/benign-samples-matter)** |
2025-05-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-24 | Exploring the Vulnerability of the Content Moderation Guardrail in Large Language Models via Intent Manipulation | Jun Zhuang et.al. | 2505.18556 | null |
2025-05-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Chain-of-Lure: A Synthetic Narrative-Driven Approach to Compromise Large Language Models | Wenhan Chang et.al. | 2505.17519 | null |
2025-05-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-22 | Shape it Up! Restoring LLM Safety during Finetuning | ShengYun Peng et.al. | 2505.17196 | null |
2025-05-22 | Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models | Junjie Xiong et.al. | 2505.16957 | null |
2025-05-22 | MixAT: Combining Continuous and Discrete Adversarial Training for LLMs | Csaba Dékány et.al. | 2505.16947 | **[link](https://github.com/insait-institute/mixat)** |
2025-05-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-21 | Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering | Hwan Chang et.al. | 2505.15805 | **[link](https://github.com/hwanchang00/CoPriva)** |
2025-05-21 | Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval | Taiye Chen et.al. | 2505.15753 | null |
2025-05-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-20 | Evaluating the efficacy of LLM Safety Solutions : The Palit Benchmark Dataset | Sayon Palit et.al. | 2505.13028 | null |
2025-05-20 | MrGuard: A Multilingual Reasoning Guardrail for Universal LLM Safety | Yahan Yang et.al. | 2504.15241 | null |
2025-05-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-19 | Concept-Level Explainability for Auditing & Steering LLM Responses | Kenza Amara et.al. | 2505.07610 | **[link](https://github.com/k-amara/ConceptX)** |
2025-05-19 | A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment | Kun Wang et.al. | 2504.15585 | null |
2025-05-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-18 | Logic Jailbreak: Efficiently Unlocking LLM Safety Restrictions Through Formal Logical Expression | Jingyu Peng et.al. | 2505.13527 | null |
2025-05-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-16 | CARES: Comprehensive Evaluation of Safety and Adversarial Robustness in Medical LLMs | Sijia Chen et.al. | 2505.11413 | null |
2025-05-16 | PIG: Privacy Jailbreak Attack on LLMs via Gradient-based Iterative In-Context Optimization | Yidan Wang et.al. | 2505.09921 | **[link](https://github.com/redwyd/privacyjailbreak)** |
2025-05-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-14 | Understanding Gen Alpha Digital Language: Evaluation of LLM Safety Systems for Content Moderation | Manisha Mehta et.al. | 2505.10588 | null |
2025-05-03
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-03 | Cannot See the Forest for the Trees: Invoking Heuristics and Biases to Elicit Irrational Choices of LLMs | Haoming Yang et.al. | 2505.02862 | null |
2025-04-30
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-30 | Hoist with His Own Petard: Inducing Guardrails to Facilitate Denial-of-Service Attacks on Retrieval-Augmented Generation of LLMs | Pan Suo et.al. | 2504.21680 | null |
2025-04-28
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-28 | Prefill-Based Jailbreak: A Novel Approach of Bypassing LLM Safety Boundary | Yakai Li et.al. | 2504.21038 | **[link](https://github.com/star5o/Prefill-based-Jailbreak)** |
2025-04-28 | $\texttt{SAGE}$ : A Generic Framework for LLM Safety Evaluation | Madhur Jindal et.al. | 2504.19674 | **[link](https://github.com/Madhur-1/SageLLMSafetyEval)** |
2025-04-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-26 | Siege: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search | Andy Zhou et.al. | 2503.10619 | null |
2025-04-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-23 | Safety Pretraining: Toward the Next Generation of Safe AI | Pratyush Maini et.al. | 2504.16980 | null |
2025-04-23 | aiXamine: Simplified LLM Safety and Security | Fatih Deniz et.al. | 2504.14985 | null |
2025-04-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-22 | Red Team Diffuser: Exposing Toxic Continuation Vulnerabilities in Vision-Language Models via Reinforcement Learning | Ruofan Wang et.al. | 2503.06223 | null |
2025-04-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-21 | RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity Search | Quy-Anh Dang et.al. | 2504.15047 | **[link](https://github.com/knoveleng/rainbowplus)** |
2025-04-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-17 | GraphAttack: Exploiting Representational Blindspots in LLM Safety Mechanisms | Sinan He et.al. | 2504.13052 | null |
2025-04-13
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-13 | SaRO: Enhancing LLM Safety through Reasoning-based Alignment | Yutao Mou et.al. | 2504.09420 | null |
2025-04-11
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-11 | SAEs $\textit{Can}$ Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs | Aashiq Muhamed et.al. | 2504.08192 | null |
2025-03-31
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-31 | $\textit{Agents Under Siege}$ : Breaking Pragmatic Multi-Agent LLM Systems with Optimized Prompt Attacks | Rana Muhammad Shahroz Khan et.al. | 2504.00218 | null |
2025-03-31 | Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms | Shuoming Zhang et.al. | 2503.24191 | null |
2025-03-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-14 | LLMs for Translation: Historical, Low-Resourced Languages and Contemporary AI Models | Merve Tekgurler et.al. | 2503.11898 | null |
2025-03-14 | Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification | Yingjie Zhang et.al. | 2503.11185 | null |
2025-03-12
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-12 | How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation | Ruohao Guo et.al. | 2503.09598 | **[link](https://github.com/octaviaguo/EchoMist)** |
2025-03-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-10 | Safety Guardrails for LLM-Enabled Robots | Zachary Ravichandran et.al. | 2503.07885 | null |
2025-03-10 | Graphormer-Guided Task Planning: Beyond Static Rules with LLM Safety Perception | Wanjing Huang et.al. | 2503.06866 | **[link](https://github.com/hwj20/ggtp)** |
2025-03-06
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-06 | Safety is Not Only About Refusal: Reasoning-Enhanced Fine-tuning for Interpretable LLM Safety | Yuyou Zhang et.al. | 2503.05021 | null |
2025-03-06 | One-Shot is Enough: Consolidating Multi-Turn Attacks into Efficient Single-Turn Prompts for LLMs | Junwoo Ha et.al. | 2503.04856 | null |
2025-03-06 | Know Thy Judge: On the Robustness Meta-Evaluation of LLM Safety Judges | Francisco Eiras et.al. | 2503.04474 | null |
2025-03-05
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-05 | Improving LLM Safety Alignment with Dual-Objective Optimization | Xuandong Zhao et.al. | 2503.03710 | **[link](https://github.com/wicai24/door-alignment)** |
2025-03-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-04 | LLM-Safety Evaluations Lack Robustness | Tim Beyer et.al. | 2503.02574 | null |
2025-03-04 | Adversarial Tokenization | Renato Lui Geh et.al. | 2503.02174 | null |
2025-03-02
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-02 | Output Length Effect on DeepSeek-R1's Safety in Forced Thinking | Xuying Li et.al. | 2503.01923 | null |
2025-02-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-26 | JailBench: A Comprehensive Chinese Security Assessment Benchmark for Large Language Models | Shuyi Liu et.al. | 2502.18935 | null |
2025-02-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-24 | LongSafety: Evaluating Long-Context Safety of Large Language Models | Yida Lu et.al. | 2502.16971 | **[link](https://github.com/thu-coai/longsafety)** |
2025-02-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-21 | SafeInt: Shielding Large Language Models from Jailbreak Attacks via Safety-Aware Representation Intervention | Jiaqi Wu et.al. | 2502.15594 | null |
2025-02-21 | Attention Eclipse: Manipulating Attention to Bypass LLM Safety-Alignment | Pedram Zaree et.al. | 2502.15334 | null |
2025-02-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-20 | Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models | Yeonjun In et.al. | 2502.15086 | **[link](https://github.com/yeonjun-in/u-safebench)** |
2025-02-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-19 | Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region | Chak Tou Leong et.al. | 2502.13946 | null |
2025-02-19 | Qorgau: Evaluating LLM Safety in Kazakh-Russian Bilingual Contexts | Maiya Goloburda et.al. | 2502.13640 | null |
Data privacy
2025-06-25
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-25 | WallStreetFeds: Client-Specific Tokens as Investment Vehicles in Federated Learning | Arno Geimer et.al. | 2506.20518 | null |
2025-06-25 | Client Clustering Meets Knowledge Sharing: Enhancing Privacy and Robustness in Personalized Peer-to-Peer Learning | Mohammad Mahdi Maheri et.al. | 2506.20413 | null |
2025-06-25 | FedBKD: Distilled Federated Learning to Embrace Gerneralization and Personalization on Non-IID Data | Yushan Zhao et.al. | 2506.20245 | null |
2025-06-25 | Personalized Mental State Evaluation in Human-Robot Interaction using Federated Learning | Andrea Bussolan et.al. | 2506.20212 | null |
2025-06-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-24 | Progressive Size-Adaptive Federated Learning: A Comprehensive Framework for Heterogeneous Multi-Modal Data Systems | Sajid Hussain et.al. | 2506.20685 | null |
2025-06-24 | ReBoot: Encrypted Training of Deep Neural Networks with CKKS Bootstrapping | Alberto Pirillo et.al. | 2506.19693 | null |
2025-06-24 | Automated Detection of Pre-training Text in Black-box LLMs | Ruihan Hu et.al. | 2506.19399 | null |
2025-06-24 | Behavioral Anomaly Detection in Distributed Systems via Federated Contrastive Learning | Renzi Meng et.al. | 2506.19246 | null |
2025-06-24 | FOCoOp: Enhancing Out-of-Distribution Robustness in Federated Prompt Learning for Vision-Language Models | Xinting Liao et.al. | 2506.16218 | null |
2025-06-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-23 | SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications | Jinyang Li et.al. | 2506.18951 | null |
2025-06-23 | Federated Learning from Molecules to Processes: A Perspective | Jan G. Rittig et.al. | 2506.18525 | null |
2025-06-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-22 | Federated Learning-Based Data Collaboration Method for Enhancing Edge Cloud AI System Security Using Large Language Models | Huaiying Luo et.al. | 2506.18087 | null |
2025-06-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-21 | Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs | Yiwei Chen et.al. | 2506.14003 | **[link](https://github.com/optml-group/unlearn-trace)** |
2025-06-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | AI based Content Creation and Product Recommendation Applications in E-commerce: An Ethical overview | Aditi Madhusudan Jain et.al. | 2506.17370 | null |
2025-06-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-19 | SecureFed: A Two-Phase Framework for Detecting Malicious Clients in Federated Learning | Likhitha Annapurna Kavuri et.al. | 2506.16458 | null |
2025-06-19 | Leveraging Optimal Transport for Distributed Two-Sample Testing: An Integrated Transportation Distance-based Framework | Zhengqi Lin et.al. | 2506.16047 | null |
2025-06-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-18 | Tracking GPTs Third Party Service: Automation, Analysis, and Insights | Chuan Yan et.al. | 2506.17315 | null |
2025-06-18 | PNCS:Power-Norm Cosine Similarity for Diverse Client Selection in Federated Learning | Liangyan Li et.al. | 2506.15923 | null |
2025-06-18 | Mapping Caregiver Needs to AI Chatbot Design: Strengths and Gaps in Mental Health Support for Alzheimer's and Dementia Caregivers | Jiayue Melissa Shi et.al. | 2506.15047 | null |
2025-06-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-17 | SoK: Privacy-Enhancing Technologies in Artificial Intelligence | Nouha Oualha et.al. | 2506.14576 | null |
2025-06-17 | CausalDiffTab: Mixed-Type Causal-Aware Diffusion for Tabular Data Generation | Jia-Chen Zhang et.al. | 2506.14206 | null |
2025-06-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-16 | ReinDSplit: Reinforced Dynamic Split Learning for Pest Recognition in Precision Agriculture | Vishesh Kumar Tanwar et.al. | 2506.13935 | null |
2025-06-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-15 | Privacy-Preserving Federated Learning against Malicious Clients Based on Verifiable Functional Encryption | Nina Cai et.al. | 2506.12846 | null |
2025-06-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-14 | Real-Time, Low-Latency Surveillance Using Entropy-Based Adaptive Buffering and MobileNetV2 on Edge Devices | Poojashree Chandrashekar Pankaj M Sajjanar et.al. | 2506.14833 | null |
2025-06-14 | OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and Metrics | Vineeth Dorna et.al. | 2506.12618 | **[link](https://github.com/locuslab/open-unlearning)** |
2025-06-13
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-13 | MRI-CORE: A Foundation Model for Magnetic Resonance Imaging | Haoyu Dong et.al. | 2506.12186 | null |
2025-06-13 | AutoGen Driven Multi Agent Framework for Iterative Crime Data Analysis and Prediction | Syeda Kisaa Fatima et.al. | 2506.11475 | null |
2025-06-12
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-12 | Federated Learning within Global Energy Budget over Heterogeneous Edge Accelerators | Roopkatha Banerjee et.al. | 2506.10413 | null |
2025-06-11
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-11 | Knockoffs Inference under Privacy Constraints | Zhanrui Cai et.al. | 2506.09690 | null |
2025-06-11 | Wavelet Scattering Transform and Fourier Representation for Offline Detection of Malicious Clients in Federated Learning | Alessandro Licciardi et.al. | 2506.09674 | null |
2025-06-11 | A Survey on the Role of Artificial Intelligence and Machine Learning in 6G-V2X Applications | Donglin Wang et.al. | 2506.09512 | null |
2025-06-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-10 | DIsoN: Decentralized Isolation Networks for Out-of-Distribution Detection in Medical Imaging | Felix Wagner et.al. | 2506.09024 | null |
2025-06-10 | A Privacy-Preserving Federated Learning Framework for Generalizable CBCT to Synthetic CT Translation in Head and Neck | Ciro Benito Raggio et.al. | 2506.08654 | null |
2025-06-09
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-09 | Private Memorization Editing: Turning Memorization into a Defense to Strengthen Data Privacy in Large Language Models | Elena Sofia Ruzzetti et.al. | 2506.10024 | **[link](https://github.com/elenasofia98/pme)** |
2025-06-09 | FedGA-Tree: Federated Decision Tree using Genetic Algorithm | Anh V Nguyen et.al. | 2506.08176 | null |
2025-06-09 | A Systematic Literature Review on Continuous Integration and Deployment (CI/CD) for Secure Cloud Computing | Sabbir M. Saleh et.al. | 2506.08055 | null |
2025-06-09 | Understanding the Error Sensitivity of Privacy-Aware Computing | Matías Mazzanti et.al. | 2506.07957 | null |
2025-06-09 | Federated In-Context Learning: Iterative Refinement for Improved Answer Quality | Ruhan Wang et.al. | 2506.07440 | null |
2025-06-08
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-08 | Patient Similarity Computation for Clinical Decision Support: An Efficient Use of Data Transformation, Combining Static and Time Series Data | Joydeb Kumar Sana et.al. | 2506.07092 | null |
2025-06-07
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-07 | In-Sensor Motion Recognition with Memristive System and Light Sensing Surfaces | Hritom Das et.al. | 2506.06829 | null |
2025-06-07 | LADSG: Label-Anonymized Distillation and Similar Gradient Substitution for Label Privacy in Vertical Federated Learning | Zeyu Yan et.al. | 2506.06742 | null |
2025-06-07 | Fuse and Federate: Enhancing EV Charging Station Security with Multimodal Fusion and Federated Learning | Rabah Rahal et.al. | 2506.06730 | null |
2025-06-06
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-06 | Privacy Perspectives and Practices of Chinese Smart Home Product Teams | Shijing He et.al. | 2506.06591 | null |
2025-06-06 | Reinforcement Learning for Autonomous Warehouse Orchestration in SAP Logistics Execution: Redefining Supply Chain Agility | Sumanth Pillella et.al. | 2506.06523 | null |
2025-06-06 | A Certified Unlearning Approach without Access to Source Data | Umit Yigit Basaran et.al. | 2506.06486 | null |
2025-06-06 | Direct Behavior Optimization: Unlocking the Potential of Lightweight LLMs | Hongming Yang et.al. | 2506.06401 | null |
2025-06-06 | Towards Lifecycle Unlearning Commitment Management: Measuring Sample-level Unlearning Completeness | Cheng-Long Wang et.al. | 2506.06112 | **[link](https://github.com/happy2git/unlearning_inference_iam)** |
2025-06-06 | Simple Yet Effective: Extracting Private Data Across Clients in Federated Fine-Tuning of Large Language Models | Yingqi Hu et.al. | 2506.06060 | null |
2025-06-06 | Mitigating Catastrophic Forgetting with Adaptive Transformer Block Expansion in Federated Fine-Tuning | Yujia Huo et.al. | 2506.05977 | null |
2025-06-06 | Small Models, Big Support: A Local LLM Framework for Teacher-Centric Content Creation and Assessment using RAG and CAG | Zarreen Reza et.al. | 2506.05925 | null |
2025-06-06 | When Better Features Mean Greater Risks: The Performance-Privacy Trade-Off in Contrastive Learning | Ruining Sun et.al. | 2506.05743 | null |
2025-06-06 | FedShield-LLM: A Secure and Scalable Federated Fine-Tuned Large Language Model | Md Jueal Mia et.al. | 2506.05640 | null |
2025-06-06 | Rethinking Machine Unlearning in Image Generation Models | Renyang Liu et.al. | 2506.02761 | **[link](https://github.com/ryliu68/igmu)** |
2025-06-06 | Federated Foundation Model for GI Endoscopy Images | Alina Devkota et.al. | 2505.24108 | null |
2025-06-05
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-05 | Federated Isolation Forest for Efficient Anomaly Detection on Edge IoT Systems | Pavle Vasiljevic et.al. | 2506.05138 | null |
2025-06-05 | Software Bill of Materials in Software Supply Chain Security A Systematic Literature Review | Eric O'Donoghue et.al. | 2506.03507 | null |
2025-06-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-04 | Privacy and Security Threat for OpenAI GPTs | Wei Wenying et.al. | 2506.04036 | null |
2025-06-04 | PC-MoE: Memory-Efficient and Privacy-Preserving Collaborative Training for Mixture-of-Experts LLMs | Ze Yu Zhang et.al. | 2506.02965 | null |
2025-06-03
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-03 | An Algorithmic Pipeline for GDPR-Compliant Healthcare Data Anonymisation: Moving Toward Standardisation | Hamza Khan et.al. | 2506.02942 | null |
2025-06-03 | ThinkTank: A Framework for Generalizing Domain-Specific AI Agent Systems into Universal Collaborative Intelligence Platforms | Praneet Sai Madhu Surabhi et.al. | 2506.02931 | **[link](https://github.com/taugroup/thinktank)** |
2025-06-03 | CLONE: Customizing LLMs for Efficient Latency-Aware Inference at the Edge | Chunlin Tian et.al. | 2506.02847 | null |
2025-06-03 | Decentralized COVID-19 Health System Leveraging Blockchain | Lingsheng Chen et.al. | 2506.02674 | null |
2025-06-03 | From Prompts to Protection: Large Language Model-Enabled In-Context Learning for Smart Public Safety UAV | Yousef Emami et.al. | 2506.02649 | null |
2025-06-03 | State Similarity in Modular Superconducting Quantum Processors with Classical Communications | Bujiao Wu et.al. | 2506.01657 | null |
2025-06-02
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-02 | Fingerprinting Deep Learning Models via Network Traffic Patterns in Federated Learning | Md Nahid Hasan Shuvo et.al. | 2506.03207 | null |
2025-06-01
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-01 | Addressing the Collaboration Dilemma in Low-Data Federated Learning via Transient Sparsity | Qiao Xiao et.al. | 2506.00932 | null |
2025-05-31
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-31 | Blockchain Powered Edge Intelligence for U-Healthcare in Privacy Critical and Time Sensitive Environment | Anum Nawaz et.al. | 2506.02038 | null |
2025-05-31 | PSI-PFL: Population Stability Index for Client Selection in non-IID Personalized Federated Learning | Daniel-M. Jimenez-Gutierrez et.al. | 2506.00440 | null |
2025-05-31 | Hybrid Cloud Security: Balancing Performance, Cost, and Compliance in Multi-Cloud Deployments | Anjani kumar Polinati et.al. | 2506.00426 | null |
2025-05-31 | SHARE: An SLM-based Hierarchical Action CorREction Assistant for Text-to-SQL | Ge Qu et.al. | 2506.00391 | null |
2025-05-30
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-30 | Structuring Radiology Reports: Challenging LLMs with Lightweight Models | Johannes Moll et.al. | 2506.00200 | null |
2025-05-30 | Position: Federated Foundation Language Model Post-Training Should Focus on Open-Source Models | Nikita Agrawal et.al. | 2505.23593 | null |
2025-05-29
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-29 | Adaptive Deadline and Batch Layered Synchronized Federated Learning | Asaf Goren et.al. | 2505.23973 | null |
2025-05-29 | Does Machine Unlearning Truly Remove Model Knowledge? A Framework for Auditing Unlearning in LLMs | Haokun Chen et.al. | 2505.23270 | null |
2025-05-29 | Loss-Guided Model Sharing and Local Learning Correction in Decentralized Federated Learning for Crop Disease Classification | Denis Mamba Kabala et.al. | 2505.23063 | null |
2025-05-29 | Deep Modeling and Optimization of Medical Image Classification | Yihang Wu et.al. | 2505.23040 | **[link](https://github.com/aipmlab/skincancersimulation)** |
2025-05-29 | EL4NER: Ensemble Learning for Named Entity Recognition via Multiple Small-Parameter Large Language Models | Yuzhen Xiao et.al. | 2505.23038 | null |
2025-05-28
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-28 | TensorShield: Safeguarding On-Device Inference by Shielding Critical DNN Tensors with TEE | Tong Sun et.al. | 2505.22735 | null |
2025-05-28 | Evolution of repositories and privacy laws: commit activities in the GDPR and CCPA era | Georgia M. Kapitsaki et.al. | 2505.22234 | null |
2025-05-27
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-27 | DP-RTFL: Differentially Private Resilient Temporal Federated Learning for Trustworthy AI in Regulated Industries | Abhijit Talluri et.al. | 2505.23813 | null |
2025-05-27 | MedOrchestra: A Hybrid Cloud-Local LLM Approach for Clinical Data Interpretation | Sihyeon Lee et.al. | 2505.23806 | null |
2025-05-27 | StreamLink: Large-Language-Model Driven Distributed Data Engineering System | Dawei Feng et.al. | 2505.21575 | null |
2025-05-27 | Federated Instrumental Variable Analysis via Federated Generalized Method of Moments | Geetika et.al. | 2505.21012 | null |
2025-05-27 | Facial Attribute Based Text Guided Face Anonymization | Mustafa İzzet Muştu et.al. | 2505.21002 | null |
2025-05-27 | Generalized and Personalized Federated Learning with Foundation Models via Orthogonal Transformations | Eun Gyung Kong et.al. | 2505.19888 | null |
2025-05-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-26 | SEMFED: Semantic-Aware Resource-Efficient Federated Learning for Heterogeneous NLP Tasks | Sajid Hussain et.al. | 2505.23801 | null |
2025-05-26 | LAPA-based Dynamic Privacy Optimization for Wireless Federated Learning in Heterogeneous Environments | Pengcheng Sun et.al. | 2505.19823 | null |
2025-05-26 | Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments | Junming Liu et.al. | 2505.19699 | null |
2025-05-25
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-25 | Cellular Traffic Prediction via Byzantine-robust Asynchronous Federated Learning | Hui Ma et.al. | 2505.19263 | **[link](https://github.com/maggiemh/bafdp)** |
2025-05-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-24 | Understanding the Relationship Between Personal Data Privacy Literacy and Data Privacy Information Sharing by University Students | Brady D. Lund et.al. | 2505.18870 | null |
2025-05-24 | Anonymity-washing | Szivia Lestyán et.al. | 2505.18627 | null |
2025-05-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Temporal Restoration and Spatial Rewiring for Source-Free Multivariate Time Series Domain Adaptation | Peiliang Gong et.al. | 2505.21525 | null |
2025-05-23 | Retrieval Augmented Generation-based Large Language Models for Bridging Transportation Cybersecurity Legal Knowledge Gaps | Khandakar Ashrafi Akbar et.al. | 2505.18426 | null |
2025-05-23 | RedactOR: An LLM-Powered Framework for Automatic Clinical Data De-Identification | Praphul Singh et.al. | 2505.18380 | null |
2025-05-23 | WiNGPT-3.0 Technical Report | Boqin Zhuang et.al. | 2505.17387 | null |
2025-05-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-22 | LLM Access Shield: Domain-Specific LLM Framework for Privacy Policy Compliance | Yu Wang et.al. | 2505.17145 | null |
2025-05-22 | Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks | Hongyuan Tao et.al. | 2505.16901 | null |
2025-05-22 | ATR-Bench: A Federated Learning Benchmark for Adaptation, Trust, and Reasoning | Tajamul Ashraf et.al. | 2505.16850 | **[link](https://github.com/tajamul21/atr-bench)** |
2025-05-22 | From Local Patterns to Global Understanding: Cross-Stock Trend Integration for Enhanced Predictive Modeling | Yi Hu et.al. | 2505.16573 | null |
2025-05-22 | A Two-Stage Data Selection Framework for Data-Efficient Model Training on Edge Devices | Chen Gong et.al. | 2505.16563 | null |
2025-05-22 | Interpretable Anomaly Detection in Encrypted Traffic Using SHAP with Machine Learning Models | Kalindi Singh et.al. | 2505.16261 | null |
2025-05-22 | Enhancing Federated Survival Analysis through Peer-Driven Client Reputation in Healthcare | Navid Seidi et.al. | 2505.16190 | null |
2025-05-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-21 | Are LLMs reliable? An exploration of the reliability of large language models in clinical note generation | Kristine Ann M. Carandang et.al. | 2505.17095 | null |
2025-05-21 | Federated Learning-Enhanced Blockchain Framework for Privacy-Preserving Intrusion Detection in Industrial IoT | Anas Ali et.al. | 2505.15376 | null |
2025-05-21 | EC-LDA : Label Distribution Inference Attack against Federated Graph Learning with Embedding Compression | Tong Cheng et.al. | 2505.15140 | null |
2025-05-21 | A Survey On Secure Machine Learning | Taobo Liao et.al. | 2505.15124 | null |
2025-05-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-20 | Listen, Analyze, and Adapt to Learn New Attacks: An Exemplar-Free Class Incremental Learning Method for Audio Deepfake Source Tracing | Yang Xiao et.al. | 2505.14601 | null |
2025-05-20 | Federated prediction for scalable and privacy-preserved knowledge-based planning in radiotherapy | Jingyun Chen et.al. | 2505.14507 | null |
2025-05-20 | CE-LSLM: Efficient Large-Small Language Model Inference and Communication via Cloud-Edge Collaboration | Pengyan Zhu et.al. | 2505.14085 | null |
2025-05-20 | FedGraM: Defending Against Untargeted Attacks in Federated Learning via Embedding Gram Matrix | Di Wu et.al. | 2505.14024 | null |
2025-05-20 | Zk-SNARK for String Match | Taoran Li et.al. | 2505.13964 | **[link](https://github.com/taobol2/CS407_Project)** |
2025-05-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-19 | Advancing Software Quality: A Standards-Focused Review of LLM-Based Assurance Techniques | Avinash Patil et.al. | 2505.13766 | null |
2025-05-19 | FedCTTA: A Collaborative Approach to Continual Test-Time Adaptation in Federated Learning | Rakibul Hasan Rajib et.al. | 2505.13643 | null |
2025-05-19 | Exploring Federated Pruning for Large Language Models | Pengxin Guo et.al. | 2505.13547 | **[link](https://github.com/pengxin-guo/fedprllm)** |
2025-05-19 | DynaNoise: Dynamic Probabilistic Noise Injection for Defending Against Membership Inference Attacks | Javad Forough et.al. | 2505.13362 | null |
2025-05-19 | Cross-Cloud Data Privacy Protection: Optimizing Collaborative Mechanisms of AI Systems by Integrating Federated Learning and LLMs | Huaiying Luo et.al. | 2505.13292 | null |
2025-05-19 | Unlearning for Federated Online Learning to Rank: A Reproducibility Study | Yiling Tao et.al. | 2505.12791 | **[link](https://github.com/iris1026/unlearning-for-foltr)** |
2025-05-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-18 | A Comprehensive Review of Techniques, Algorithms, Advancements, Challenges, and Clinical Applications of Multi-modal Medical Image Fusion for Improved Diagnosis | Muhammad Zubair et.al. | 2505.14715 | null |
2025-05-18 | PoLO: Proof-of-Learning and Proof-of-Ownership at Once with Chained Watermarking | Haiyu Deng et.al. | 2505.12296 | null |
2025-05-18 | ACU: Analytic Continual Unlearning for Efficient and Exact Forgetting with Privacy Preservation | Jianheng Tang et.al. | 2505.12239 | null |
2025-05-18 | Enhancing the Performance of Global Model by Improving the Adaptability of Local Models in Federated Learning | Wujun Zhou et.al. | 2505.10125 | null |
2025-05-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-17 | Federated Deep Reinforcement Learning for Privacy-Preserving Robotic-Assisted Surgery | Sana Hafeez et.al. | 2505.12153 | null |
2025-05-17 | FedHQ: Hybrid Runtime Quantization for Federated Learning | Zihao Zheng et.al. | 2505.11982 | null |
2025-05-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-16 | Federated Low-Rank Adaptation for Foundation Models: A Survey | Yiyuan Yang et.al. | 2505.13502 | null |
2025-05-16 | Verifiably Forgotten? Gradient Differences Still Enable Data Reconstruction in Federated Unlearning | Fuyao Zhang et.al. | 2505.11097 | null |
2025-05-16 | Nosy Layers, Noisy Fixes: Tackling DRAs in Federated Learning Systems using Explainable AI | Meghali Nandi et.al. | 2505.10942 | null |
2025-05-16 | Privacy-Aware Lifelong Learning | Ozan Özdenizci et.al. | 2505.10941 | null |
2025-05-16 | Convergence Analysis of the Last Iterate in Distributed Stochastic Gradient Descent with Momentum | Difei Cheng et.al. | 2505.10889 | null |
2025-05-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-15 | Locally Differentially Private Frequency Estimation via Joint Randomized Response | Ye Zheng et.al. | 2505.10349 | **[link](https://github.com/ZhengYeah/JRR)** |
2025-05-15 | Private Transformer Inference in MLaaS: A Survey | Yang Li et.al. | 2505.10315 | null |
2025-05-15 | Cutting Through Privacy: A Hyperplane-Based Data Reconstruction Attack in Federated Learning | Francesco Diana et.al. | 2505.10264 | null |
2025-05-15 | Robust Federated Learning on Edge Devices with Domain Heterogeneity | Huy Q. Le et.al. | 2505.10128 | null |
2025-05-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-14 | Robust Federated Learning with Confidence-Weighted Filtering and GAN-Based Completion under Noisy and Incomplete Data | Alpaslan Gokcen et.al. | 2505.09733 | null |
2025-05-14 | FedSaaS: Class-Consistency Federated Semantic Segmentation via Global Prototype Supervision and Local Adversarial Harmonization | Xiaoyang Yu et.al. | 2505.09385 | null |
2025-05-13
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-13 | AI and Generative AI Transforming Disaster Management: A Survey of Damage Assessment and Response Techniques | Aman Raj et.al. | 2505.08202 | null |
2025-05-12
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-12 | A Federated Random Forest Solution for Secure Distributed Machine Learning | Alexandre Cotorobai et.al. | 2505.08085 | **[link](https://github.com/ieeta-pt/fed_rf)** |
2025-05-12 | Privacy-Preserving Real-Time Vietnamese-English Translation on iOS using Edge AI | Cong Le et.al. | 2505.07583 | null |
2025-05-12 | FedIFL: A federated cross-domain diagnostic framework for motor-driven systems with inconsistent fault modes | Zexiao Wang et.al. | 2505.07315 | null |
2025-05-12 | Empowering the Grid: Collaborative Edge Artificial Intelligence for Decentralized Energy Systems | Eddie de Paula Jr et.al. | 2505.07170 | null |
2025-05-11
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-11 | Source Anonymity for Private Random Walk Decentralized Learning | Maximilian Egger et.al. | 2505.07011 | null |
2025-05-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-10 | A Contrastive Federated Semi-Supervised Learning Intrusion Detection Framework for Internet of Robotic Things | Yifan Zeng et.al. | 2505.06636 | null |
2025-05-09
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-09 | Remote Rowhammer Attack using Adversarial Observations on Federated Learning Clients | Jinsheng Yuan et.al. | 2505.06335 | null |
2025-05-09 | Self-Supervised Federated GNSS Spoofing Detection with Opportunistic Data | Wenjie Liu et.al. | 2505.06171 | null |
2025-05-09 | Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation | Stefan Vasilev et.al. | 2505.06027 | null |
2025-05-09 | Efficient Full-Stack Private Federated Deep Learning with Post-Quantum Security | Yiwei Zhang et.al. | 2505.05751 | null |
2025-05-08
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-08 | Optimal Regret of Bernoulli Bandits under Global Differential Privacy | Achraf Azize et.al. | 2505.05613 | null |
2025-05-08 | Adaptive Biased User Scheduling for Heterogeneous Wireless Federate Learning Network | Changxiang Wu et.al. | 2505.05231 | null |
2025-05-08 | FedTDP: A Privacy-Preserving and Unified Framework for Trajectory Data Preparation via Federated Learning | Zhihao Zeng et.al. | 2505.05155 | null |
2025-05-08 | CacheFL: Efficient Federated Cache Model Fine-Tuning for Vision-Language Models | Mengjun Yi et.al. | 2505.05130 | null |
2025-05-08 | Balancing Client Participation in Federated Learning Using AoI | Alireza Javani et.al. | 2505.05099 | null |
2025-05-08 | Federated Learning for Cyber Physical Systems: A Comprehensive Survey | Minh K. Quan et.al. | 2505.04873 | null |
2025-05-07
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-07 | Privacy-preserving neutral atom-based quantum classifier towards real healthcare applications | Ettore Canonici et.al. | 2505.04570 | null |
2025-05-07 | RDPP-TD: Reputation and Data Privacy-Preserving based Truth Discovery Scheme in Mobile Crowdsensing | Lijian Wu et.al. | 2505.04361 | null |
2025-05-07 | A Framework to Prevent Biometric Data Leakage in the Immersive Technologies Domain | Keshav Sood et.al. | 2505.04123 | null |
2025-05-06
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-06 | An Overview of the Prospects and Challenges of Using Artificial Intelligence for Energy Management Systems in Microgrids | Noor ul Misbah Khanum et.al. | 2505.05498 | null |
2025-05-06 | AI-Driven Security in Cloud Computing: Enhancing Threat Detection, Automated Response, and Cyber Resilience | Shamnad Mohamed Shaffi et.al. | 2505.03945 | null |
2025-05-06 | Event-Triggered GAT-LSTM Framework for Attack Detection in Heating, Ventilation, and Air Conditioning Systems | Zhenan Feng et.al. | 2505.03559 | null |
2025-05-06 | SKALD: Scalable K-Anonymisation for Large Datasets | Kailash Reddy et.al. | 2505.03529 | null |
2025-05-06 | SemSpaceFL: A Collaborative Hierarchical Federated Learning Framework for Semantic Communication in 6G LEO Satellites | Loc X. Nguyen et.al. | 2505.00966 | null |
2025-05-05
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-05 | Memorization or Interpolation ? Detecting LLM Memorization through Input Perturbation Analysis | Albérick Euraste Djiré et.al. | 2505.03019 | null |
2025-05-05 | Navigating Privacy and Trust: AI Assistants as Social Support for Older Adults | Karina LaRubbio et.al. | 2505.02975 | null |
2025-05-05 | Unlearning vs. Obfuscation: Are We Truly Removing Knowledge? | Guangzhi Sun et.al. | 2505.02884 | null |
2025-05-05 | Encrypted Federated Search Using Homomorphic Encryption | Om Rathod et.al. | 2505.02409 | null |
2025-05-05 | Quantitative Analysis of Performance Drop in DeepSeek Model Quantization | Enbo Zhao et.al. | 2505.02390 | **[link](https://github.com/unicomai/deepseek-eval)** |
2025-05-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-04 | Student Perspectives on the Benefits and Risks of AI in Education | Griffin Pitts et.al. | 2505.02198 | null |
2025-05-03
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-03 | Towards Trustworthy Federated Learning with Untrusted Participants | Youssef Allouah et.al. | 2505.01874 | null |
2025-05-03 | PQS-BFL: A Post-Quantum Secure Blockchain-based Federated Learning Framework | Daniel Commey et.al. | 2505.01866 | null |
2025-05-03 | Privacy Preserving Machine Learning Model Personalization through Federated Personalized Learning | Md. Tanzib Hosain et.al. | 2505.01788 | null |
2025-05-03 | Enhanced Flexibility Aggregation Using LinDistFlow Model with Loss Compensation | Yanlin Jiang et.al. | 2505.01715 | null |
2025-05-02
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-02 | Securing the Future of IVR: AI-Driven Innovation with Agile Security, Data Regulation, and Ethical AI Integration | Khushbu Mehboob Shaikh et.al. | 2505.01514 | null |
2025-05-01
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-01 | Distributed Retrieval-Augmented Generation | Chenhao Xu et.al. | 2505.00443 | null |
2025-04-30
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-30 | Sparsification Under Siege: Defending Against Poisoning Attacks in Communication-Efficient Federated Learning | Zhiyong Jin et.al. | 2505.01454 | null |
2025-04-30 | VDDP: Verifiable Distributed Differential Privacy under the Client-Server-Verifier Setup | Haochen Sun et.al. | 2504.21752 | null |
2025-04-30 | Bilateral Differentially Private Vertical Federated Boosted Decision Trees | Bokang Zhang et.al. | 2504.21739 | null |
2025-04-29
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-29 | FedHERO: A Federated Learning Approach for Node Classification Task on Heterophilic Graphs | Zihan Chen et.al. | 2504.21206 | null |
2025-04-29 | Federated One-Shot Learning with Data Privacy and Objective-Hiding | Maximilian Egger et.al. | 2504.21182 | null |
2025-04-29 | A Brief Review for Compression and Transfer Learning Techniques in DeepFake Detection | Andreas Karathanasis et.al. | 2504.21066 | null |
2025-04-29 | Bipartite Randomized Response Mechanism for Local Differential Privacy | Shun Zhang et.al. | 2504.20926 | null |
2025-04-29 | ReCIT: Reconstructing Full Private Data from Gradient in Parameter-Efficient Fine-Tuning of Large Language Models | Jin Xie et.al. | 2504.20570 | null |
2025-04-29 | Clustering-Based Evolutionary Federated Multiobjective Optimization and Learning | Chengui Xiao et.al. | 2504.20346 | null |
2025-04-28
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-28 | FedCCL: Federated Clustered Continual Learning Framework for Privacy-focused Energy Forecasting | Michael A. Helcig et.al. | 2504.20282 | null |
2025-04-28 | Financial Data Analysis with Robust Federated Logistic Regression | Kun Yang et.al. | 2504.20250 | null |
2025-04-28 | Federated Out-of-Distribution Generalization: A Causal Augmentation View | Runhui Zhang et.al. | 2504.19882 | null |
2025-04-28 | Dynamic Tsetlin Machine Accelerators for On-Chip Training at the Edge using FPGAs | Gang Mao et.al. | 2504.19797 | null |
2025-04-27
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-27 | Privacy-Preserving Federated Embedding Learning for Localized Retrieval-Augmented Generation | Qianren Mao et.al. | 2504.19101 | null |
2025-04-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-26 | SONNI: Secure Oblivious Neural Network Inference | Luke Sperling et.al. | 2504.18974 | null |
2025-04-25
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-25 | The Rise of Small Language Models in Healthcare: A Comprehensive Survey | Muskan Garg et.al. | 2504.17119 | null |
2025-04-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-24 | Federated Learning: A Survey on Privacy-Preserving Collaborative Intelligence | Edward Collins et.al. | 2504.17703 | null |
2025-04-24 | From Randomized Response to Randomized Index: Answering Subset Counting Queries with Local Differential Privacy | Qingqing Ye et.al. | 2504.17523 | null |
2025-04-24 | PatientDx: Merging Large Language Models for Protecting Data-Privacy in Healthcare | Jose G. Moreno et.al. | 2504.17360 | null |
2025-04-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-23 | AI for Accessible Education: Personalized Audio-Based Learning for Blind Students | Crystal Yang et.al. | 2504.17117 | null |
2025-04-23 | Simplified Swarm Learning Framework for Robust and Scalable Diagnostic Services in Cancer Histopathology | Yanjie Wu et.al. | 2504.16732 | null |
2025-04-23 | DP2FL: Dual Prompt Personalized Federated Learning in Foundation Models | Ying Chang et.al. | 2504.16357 | null |
2025-04-23 | aiXamine: Simplified LLM Safety and Security | Fatih Deniz et.al. | 2504.14985 | null |
2025-04-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-22 | Towards a Distributed Federated Learning Aggregation Placement using Particle Swarm Intelligence | Amir Ali-Pour et.al. | 2504.16227 | null |
2025-04-22 | LLMs meet Federated Learning for Scalable and Secure IoT Management | Yazan Otoum et.al. | 2504.16032 | null |
2025-04-22 | Federated Latent Factor Learning for Recovering Wireless Sensor Networks Signal with Privacy-Preserving | Chengjun Yu et.al. | 2504.15525 | null |
2025-04-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-21 | From Reviews to Dialogues: Active Synthesis for Zero-Shot LLM-based Conversational Recommender System | Rohan Surana et.al. | 2504.15476 | null |
2025-04-21 | Federated Latent Factor Model for Bias-Aware Recommendation with Privacy-Preserving | Junxiang Gao et.al. | 2504.15090 | null |
2025-04-21 | Aligning Beam with Imbalanced Multi-modality: A Generative Federated Learning Approach | Jiahui Liang et.al. | 2504.14835 | null |
2025-04-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-19 | WeiDetect: Weibull Distribution-Based Defense against Poisoning Attacks in Federated Learning for Network Intrusion Detection Systems | Sameera K. M. et.al. | 2504.04367 | null |
2025-04-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-17 | SHA256 at SemEval-2025 Task 4: Selective Amnesia -- Constrained Unlearning for Large Language Models via Knowledge Isolation | Saransh Agrawal et.al. | 2504.12996 | **[link](https://github.com/LAB-FLAIR/Constrained-Unlearning-for-LLM)** |
2025-04-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-16 | Enhanced Battery Capacity Estimation in Data-Limited Scenarios through Swarm Learning | Jiawei Zhang et.al. | 2504.12444 | null |
2025-04-16 | Federated Spectral Graph Transformers Meet Neural Ordinary Differential Equations for Non-IID Graphs | Kishan Gurumurthy et.al. | 2504.11808 | **[link](https://github.com/springwiz11/fed-gnodeformer)** |
2025-04-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-15 | Never Start from Scratch: Expediting On-Device LLM Personalization via Explainable Model Selection | Haoming Wang et.al. | 2504.13938 | null |
2025-04-15 | FLSSM: A Federated Learning Storage Security Model with Homomorphic Encryption | Yang Li et.al. | 2504.11088 | null |
2025-04-15 | Collaborative Bayesian Optimization via Wasserstein Barycenters | Donglin Zhan et.al. | 2504.10770 | null |
2025-04-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-14 | Optimising Intrusion Detection Systems in Cloud-Edge Continuum with Knowledge Distillation for Privacy-Preserving and Efficient Communication | Soad Almabdy et.al. | 2504.10698 | null |
2025-04-14 | VAE-based Feature Disentanglement for Data Augmentation and Compression in Generalized GNSS Interference Classification | Lucas Heublein et.al. | 2504.10556 | null |
2025-04-14 | Privacy-Preserving Distributed Link Predictions Among Peers in Online Classrooms Using Federated Learning | Anurata Prabha Hridi et.al. | 2504.10456 | null |
2025-04-14 | Understanding the Impact of Data Domain Extraction on Synthetic Data Privacy | Georgi Ganev et.al. | 2504.08254 | null |
2025-04-12
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-12 | Deploying Large AI Models on Resource-Limited Devices with Split Federated Learning | Xianke Qiang et.al. | 2504.09114 | null |
2025-04-12 | Large Language Models integration in Smart Grids | Seyyedreza Madani et.al. | 2504.09059 | null |
2025-04-11
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-11 | DataMap: A Portable Application for Visualizing High-Dimensional Data | Xijin Ge et.al. | 2504.08875 | **[link](https://github.com/gexijin/datamap)** |
2025-04-11 | Personalizing Federated Learning for Hierarchical Edge Networks with Non-IID Data | Seunghyun Lee et.al. | 2504.08872 | null |
2025-04-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-10 | Traversal Learning Coordination For Lossless And Efficient Distributed Learning | Erdenebileg Batbaatar et.al. | 2504.07471 | null |
2025-04-10 | FAST: Federated Active Learning with Foundation Models for Communication-efficient Sampling and Training | Haoyuan Li et.al. | 2504.03783 | null |
2025-04-05
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-05 | Unmasking the Reality of PII Masking Models: Performance Gaps and the Call for Accountability | Devansh Singh et.al. | 2504.12308 | null |
2025-04-05 | Multi-Agent Reinforcement Learning for Graph Discovery in D2D-Enabled Federated Learning | Satyavrat Wagle et.al. | 2503.23218 | null |
2025-04-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-04 | Secure Federated XGBoost with CUDA-accelerated Homomorphic Encryption via NVIDIA FLARE | Ziyue Xu et.al. | 2504.03909 | null |
2025-04-04 | An Intelligent and Privacy-Preserving Digital Twin Model for Aging-in-Place | Yongjie Wang et.al. | 2504.03798 | null |
2025-04-04 | Hierarchical Knowledge Structuring for Effective Federated Learning in Heterogeneous Environments | Wai Fong Tam et.al. | 2504.03505 | null |
2025-04-03
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-03 | Enhancing Air Quality Monitoring: A Brief Review of Federated Learning Advances | Sara Yarham et.al. | 2504.02909 | null |
2025-04-03 | Web3DB: Web 3.0 RDBMS for Individual Data Ownership | Shankha Shubhra Mukherjee et.al. | 2504.02713 | null |
2025-04-02
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-02 | Efficient Federated Learning Tiny Language Models for Mobile Network Feature Prediction | Daniel Becking et.al. | 2504.01947 | null |
2025-04-02 | CO-DEFEND: Continuous Decentralized Federated Learning for Secure DoH-Based Threat Detection | Diego Cajaraville-Aboy et.al. | 2504.01882 | **[link](https://gitlab.com/compromise3/co-defend)** |
2025-04-02 | A Two-Timescale Approach for Wireless Federated Learning with Parameter Freezing and Power Control | Jinhao Ouyang et.al. | 2504.01752 | null |
2025-04-02 | Sky of Unlearning (SoUL): Rewiring Federated Machine Unlearning via Selective Pruning | Md Mahabub Uz Zaman et.al. | 2504.01705 | null |
2025-04-02 | Split Federated Learning for UAV-Enabled Integrated Sensing, Computation, and Communication | Xiangwang Hou et.al. | 2504.01443 | null |
2025-04-02 | TeleAntiFraud-28k: An Audio-Text Slow-Thinking Dataset for Telecom Fraud Detection | Zhiming Ma et.al. | 2503.24115 | **[link](https://github.com/jimmyma99/teleantifraud)** |
2025-04-02 | SimDC: A High-Fidelity Device Simulation Platform for Device-Cloud Collaborative Computing | Ruiguang Pei et.al. | 2503.22288 | **[link](https://github.com/opas-lab/olearning-sim)** |
2025-04-01
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-01 | AI Regulation and Capitalist Growth: Balancing Innovation, Ethics, and Global Governance | Vikram Kulothungan et.al. | 2504.02000 | null |
2025-04-01 | Benchmarking Federated Machine Unlearning methods for Tabular Data | Chenguang Xiao et.al. | 2504.00921 | null |
2025-04-01 | A Survey on Unlearnable Data | Jiahao Li et.al. | 2503.23536 | **[link](https://github.com/LiJiahao-Alex/Awesome-UnLearnable-Data)** |
2025-03-31
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-31 | Federated Learning for Cross-Domain Data Privacy: A Distributed Approach to Secure Collaboration | Yiwei Zhang et.al. | 2504.00282 | null |
2025-03-31 | Communication-Efficient and Personalized Federated Foundation Model Fine-Tuning via Tri-Matrix Adaptation | Yongle Li et.al. | 2503.23869 | null |
2025-03-31 | VIDEX: A Disaggregated and Extensible Virtual Index for the Cloud and AI Era | Rong Kang et.al. | 2503.23776 | **[link](https://github.com/bytedance/videx)** |
2025-03-29
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-29 | Towards Secure Semantic Communications in the Presence of Intelligent Eavesdroppers | Shunpu Tang et.al. | 2503.23103 | null |
2025-03-29 | Disentangled Source-Free Personalization for Facial Expression Recognition with Neutral Target Data | Masoumeh Sharafi et.al. | 2503.20771 | **[link](https://github.com/MasoumehSharafi/DSFDA-for-Pain-Assessment)** |
2025-03-28
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-28 | Efficient Verified Machine Unlearning For Distillation | Yijun Quan et.al. | 2503.22539 | null |
2025-03-27
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-27 | Adaptive Clipping for Privacy-Preserving Few-Shot Learning: Enhancing Generalization with Limited Data | Kanishka Ranaweera et.al. | 2503.22749 | null |
2025-03-27 | Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation | Reza Qorbani et.al. | 2503.21780 | **[link](https://github.com/rezaqorbani/semla)** |
2025-03-27 | Federated Intelligence: When Large AI Models Meet Federated Fine-Tuning and Collaborative Reasoning at the Network Edge | Wanli Ni et.al. | 2503.21412 | null |
2025-03-27 | Improving $(α, f)$ -Byzantine Resilience in Federated Learning via layerwise aggregation and cosine distance | Mario García-Márquez et.al. | 2503.21244 | **[link](https://github.com/ari-dasci/S-layerwise_cosine_aggregation)** |
2025-03-27 | Federated Learning with Differential Privacy: An Utility-Enhanced Approach | Kanishka Ranaweera et.al. | 2503.21154 | null |
2025-03-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-26 | Privacy in Immersive Extended Reality: Exploring User Perceptions, Concerns, and Coping Strategies | Hilda Hadan et.al. | 2503.21010 | null |
2025-03-26 | MedSegNet10: A Publicly Accessible Network Repository for Split Federated Medical Image Segmentation | Chamani Shiranthika et.al. | 2503.20830 | null |
2025-03-26 | Continual learning via probabilistic exchangeable sequence modelling | Hanwen Xing et.al. | 2503.20725 | null |
2025-03-26 | How Secure is Forgetting? Linking Machine Unlearning to Machine Learning Attacks | Muhammed Shafi K. P. et.al. | 2503.20257 | null |
2025-03-25
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-25 | Federated Learning: A new frontier in the exploration of multi-institutional medical imaging data | Dominika Ciupek et.al. | 2503.20107 | null |
2025-03-25 | Distributed Stochastic Zeroth-Order Optimization with Compressed Communication | Youqing Hua et.al. | 2503.17429 | null |
2025-03-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-24 | Two Types of Data Privacy Controls | Eman Alashwali et.al. | 2503.18729 | null |
2025-03-24 | The Role of Artificial Intelligence in Enhancing Insulin Recommendations and Therapy Outcomes | Maria Panagiotou et.al. | 2503.18592 | null |
2025-03-24 | Surgical Action Planning with Large Language Models | Mengya Xu et.al. | 2503.18296 | null |
2025-03-24 | Zero-Knowledge Federated Learning: A New Trustworthy and Privacy-Preserving Distributed Learning Paradigm | Yuxin Jin et.al. | 2503.15550 | null |
2025-03-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-23 | FROG: Fair Removal on Graphs | Ziheng Chen et.al. | 2503.18197 | null |
2025-03-23 | Active Inference for Energy Control and Planning in Smart Buildings and Communities | Seyyed Danial Nazemi et.al. | 2503.18161 | null |
2025-03-23 | Dynamic Gradient Sparse Update for Edge Training | I-Hsuan Li et.al. | 2503.17959 | null |
2025-03-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-22 | Feather-SQL: A Lightweight NL2SQL Framework with Dual-Model Collaboration Paradigm for Small Language Models | Wenqi Pei et.al. | 2503.17811 | null |
2025-03-22 | Decentralized Federated Dataset Dictionary Learning for Multi-Source Domain Adaptation | Rebecca Clain et.al. | 2503.17683 | null |
2025-03-22 | A Qualitative Study of User Perception of M365 AI Copilot | Muneera Bano et.al. | 2503.17661 | null |
2025-03-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-21 | Collaborative Value Function Estimation Under Model Mismatch: A Federated Temporal Difference Analysis | Ali Beikmohammadi et.al. | 2503.17454 | **[link](https://github.com/AliBeikmohammadi/FedRL)** |
2025-03-21 | On-Device Federated Continual Learning on RISC-V-based Ultra-Low-Power SoC for Intelligent Nano-Drone Swarms | Lars Kröger et.al. | 2503.17436 | null |
2025-03-21 | A Thorough Assessment of the Non-IID Data Impact in Federated Learning | Daniel M. Jimenez-Gutierrez et.al. | 2503.17070 | null |
2025-03-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-20 | Empirical Analysis of Privacy-Fairness-Accuracy Trade-offs in Federated Learning: A Step Towards Responsible AI | Dawood Wasif et.al. | 2503.16233 | null |
2025-03-20 | Privacy-Preserving Utilization of Distribution System Flexibility for Enhanced TSO-DSO Interoperability: A Novel Machine Learning-Based Optimal Power Flow Approach | Burak Dindar et.al. | 2503.15966 | null |
2025-03-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-19 | ChatGPT or A Silent Everywhere Helper: A Survey of Large Language Models | Azim Akhtarshenas et.al. | 2503.17403 | null |
2025-03-19 | AEJIM: A Real-Time AI Framework for Crowdsourced, Transparent, and Ethical Environmental Hazard Detection and Reporting | Torsten Tiltack et.al. | 2503.17401 | null |
2025-03-19 | Distributed Generalized Linear Models: A Privacy-Preserving Approach | Daniel Tinoco et.al. | 2503.15287 | **[link](https://github.com/dbtnc/distributed_glm)** |
2025-03-19 | Online federated learning framework for classification | Wenxing Guo et.al. | 2503.15210 | null |
2025-03-19 | Prada: Black-Box LLM Adaptation with Private Data on Resource-Constrained Devices | Ziyao Wang et.al. | 2503.14932 | null |
2025-03-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-18 | Advanced Relay-Based Collaborative Framework for Optimizing Synchronization in Split Federated Learning over Wireless Networks | Haoran Gao et.al. | 2503.15559 | null |
2025-03-18 | Anomaly-Flow: A Multi-domain Federated Generative Adversarial Network for Distributed Denial-of-Service Detection | Leonardo Henrique de Melo et.al. | 2503.14618 | **[link](https://github.com/c2dc/anomaly-flow)** |
2025-03-18 | Robust Machine Unlearning for Quantized Neural Networks via Adaptive Gradient Reweighting with Similar Labels | Yujia Tong et.al. | 2503.13917 | null |
2025-03-18 | Automatic MILP Model Construction for Multi-Robot Task Allocation and Scheduling Based on Large Language Models | Mingming Peng et.al. | 2503.13813 | null |
2025-03-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-17 | The Impact of Artificial Intelligence on Emergency Medicine: A Review of Recent Advances | Gustavo Correia et.al. | 2503.14546 | null |
2025-03-17 | Regulating Ai In Financial Services: Legal Frameworks And Compliance Challenges | Shahmar Mirishli et.al. | 2503.14541 | null |
2025-03-17 | SDFLMQ: A Semi-Decentralized Federated Learning Framework over MQTT | Amir Ali-Pour et.al. | 2503.13624 | null |
2025-03-17 | How Good is my Histopathology Vision-Language Foundation Model? A Holistic Benchmark | Roba Al Majzoub et.al. | 2503.12990 | **[link](https://github.com/musk007/Histopathology_Benchmark)** |
2025-03-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-16 | Synthetic Data for Robust AI Model Development in Regulated Enterprises | Aditi Godbole et.al. | 2503.12353 | null |
2025-03-16 | Synthesizing Privacy-Preserving Text Data via Finetuning without Finetuning Billion-Scale LLMs | Bowen Tan et.al. | 2503.12347 | null |
2025-03-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-15 | From Laboratory to Real World: A New Benchmark Towards Privacy-Preserved Visible-Infrared Person Re-Identification | Yan Jiang et.al. | 2503.12232 | null |
2025-03-15 | Efficient and Privacy-Preserved Link Prediction via Condensed Graphs | Yunbo Long et.al. | 2503.12156 | null |
2025-03-15 | A Survey on Federated Fine-tuning of Large Language Models | Yebo Wu et.al. | 2503.12016 | **[link](https://github.com/clin0212/awesome-federated-llm-learning)** |
Model Privacy
2025-06-03
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-03 | Privacy Leaks by Adversaries: Adversarial Iterations for Membership Inference Attack | Jing Xue et.al. | 2506.02711 | null |
2025-05-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-15 | Private Transformer Inference in MLaaS: A Survey | Yang Li et.al. | 2505.10315 | null |
2025-05-09
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-09 | From Models to Network Topologies: A Topology Inference Attack in Decentralized Federated Learning | Chao Feng et.al. | 2501.03119 | null |
2025-05-05
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-05 | Bayes-Nash Generative Privacy Against Membership Inference Attacks | Tao Zhang et.al. | 2410.07414 | null |
2025-04-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-18 | Selective Attention Federated Learning: Improving Privacy and Efficiency for Clinical Text Classification | Yue Li et.al. | 2504.11793 | null |
2025-04-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-15 | FLSSM: A Federated Learning Storage Security Model with Homomorphic Encryption | Yang Li et.al. | 2504.11088 | null |
2025-04-12
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-12 | Footprints of Data in a Classifier: Understanding the Privacy Risks and Solution Strategies | Payel Sadhukhan et.al. | 2407.02268 | null |
2025-04-07
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-07 | Enhancing Trust in AI Marketplaces: Evaluating On-Chain Verification of Personalized AI models using zk-SNARKs | Nishant Jagannath et.al. | 2504.04794 | null |
2025-03-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-26 | Modelling Privacy Compliance in Cross-border Data Transfers with Bigraphs | Ebtihal Althubiti et.al. | 2503.20464 | null |
2025-03-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-19 | Prada: Black-Box LLM Adaptation with Private Data on Resource-Constrained Devices | Ziyao Wang et.al. | 2503.14932 | null |
2025-03-11
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-11 | Multi-P $^2$ A: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language Models | Jie Zhang et.al. | 2412.19496 | **[link](https://github.com/Xiangkui-Cao/Multi-P2A)** |
2025-02-27
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-27 | GOD model: Privacy Preserved AI School for Personal Assistant | PIN AI Team et.al. | 2502.18527 | **[link](https://github.com/pin-ai/god-model)** |
2025-02-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-21 | Model Privacy: A Unified Framework to Understand Model Stealing Attacks and Defenses | Ganghua Wang et.al. | 2502.15567 | null |
2025-02-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-14 | An Interactive Framework for Implementing Privacy-Preserving Federated Learning: Experiments on Large Language Models | Kasra Ahmadi et.al. | 2502.08008 | **[link](https://github.com/KasraAhmadi/FL-Privacy-LLM)** |
2025-02-08
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-08 | Dual Defense: Enhancing Privacy and Mitigating Poisoning Attacks in Federated Learning | Runhua Xu et.al. | 2502.05547 | **[link](https://github.com/irxyzzz/DualDefense)** |
2025-01-06
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-01-06 | Pathway to Secure and Trustworthy ZSM for LLMs: Attacks, Defense, and Opportunities | Sunder Ali Khowaja et.al. | 2408.00722 | null |
2025-01-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-01-04 | AdaMixup: A Dynamic Defense Framework for Membership Inference Attack Mitigation | Ying Chen et.al. | 2501.02182 | null |
2024-12-13
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-13 | ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression | Kai Yao et.al. | 2412.09812 | null |
2024-10-29
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-10-29 | PrivCirNet: Efficient Private Inference via Block Circulant Transformation | Tianshi Xu et.al. | 2405.14569 | **[link](https://github.com/tianshi-xu/privcirnet)** |
2024-10-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-10-24 | Does Differential Privacy Impact Bias in Pretrained NLP Models? | Md. Khairul Islam et.al. | 2410.18749 | **[link](https://github.com/khairulislam/dp-on-nlp-bias)** |
2024-10-01
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-10-01 | PrivTuner with Homomorphic Encryption and LoRA: A P3EFT Scheme for Privacy-Preserving Parameter-Efficient Fine-Tuning of AI Foundation Models | Yang Li et.al. | 2410.00433 | null |
2024-09-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-09-10 | A Pervasive, Efficient and Private Future: Realizing Privacy-Preserving Machine Learning Through Hybrid Homomorphic Encryption | Khoa Nguyen et.al. | 2409.06422 | **[link](https://github.com/khoaguin/pockethhe)** |
2024-07-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-07-23 | Representation Magnitude has a Liability to Privacy Vulnerability | Xingli Fang et.al. | 2407.16164 | **[link](https://github.com/jekimlab/aies2024_srcm)** |
2024-06-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-26 | Natural Language but Omitted? On the Ineffectiveness of Large Language Models' privacy policy from End-users' Perspective | Shuning Zhang et.al. | 2406.18100 | null |
2024-06-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-21 | A Survey on Intelligent Internet of Things: Applications, Security, Privacy, and Future Directions | Ons Aouedi et.al. | 2406.03820 | null |
2024-06-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-16 | Promoting Data and Model Privacy in Federated Learning through Quantized LoRA | JianHao Zhu et.al. | 2406.10976 | null |
2024-04-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-04-17 | OmniLytics+: A Secure, Efficient, and Affordable Blockchain Data Market for Machine Learning through Off-Chain Processing | Songze Li et.al. | 2406.06477 | null |
Forensics
2025-06-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-26 | Post-training for Deepfake Speech Detection | Wanying Ge et.al. | 2506.21090 | null |
2025-06-26 | IndieFake Dataset: A Benchmark Dataset for Audio Deepfake Detection | Abhay Kumar et.al. | 2506.19014 | null |
2025-06-25
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-25 | Pay Less Attention to Deceptive Artifacts: Robust Detection of Compressed Deepfakes on Online Social Networks | Manyi Li et.al. | 2506.20548 | null |
2025-06-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-21 | SELFI: Selective Fusion of Identity for Generalizable Deepfake Detection | Younghun Kim et.al. | 2506.17592 | null |
2025-06-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Loupe: A Generalizable and Adaptive Framework for Image Forgery Detection | Yuchu Jiang et.al. | 2506.16819 | **[link](https://github.com/kamichanw/loupe)** |
2025-06-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-18 | I Know Which LLM Wrote Your Code Last Summer: LLM generated Code Stylometry for Authorship Attribution | Tamas Bisztray et.al. | 2506.17323 | null |
2025-06-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-17 | A Comparative Study on Proactive and Passive Detection of Deepfake Speech | Chia-Hua Wu et.al. | 2506.14398 | **[link](https://github.com/nii-yamagishilab/antispoofing-watermark)** |
2025-06-17 | Manipulated Regions Localization For Partially Deepfake Audio: A Survey | Jiayi He et.al. | 2506.14396 | null |
2025-06-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-15 | Governments Should Mandate Tiered Anonymity on Social-Media Platforms to Counter Deepfakes and LLM-Driven Mass Misinformation | David Khachaturov et.al. | 2506.12814 | null |
2025-06-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-14 | Towards Neural Audio Codec Source Parsing | Orchid Chetia Phukan et.al. | 2506.12627 | null |
2025-06-13
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-13 | From Sharpness to Better Generalization for Speech Deepfake Detection | Wen Huang et.al. | 2506.11532 | **[link](https://github.com/nii-yamagishilab/SAM-AntiSpoofing)** |
2025-06-13 | FAME: A Lightweight Spatio-Temporal Network for Model Attribution of Face-Swap Deepfakes | Wasim Ahmad et.al. | 2506.11477 | **[link](https://github.com/wasim004/FAME)** |
2025-06-13 | A Watermark for Auto-Regressive Image Generation Models | Yihan Wu et.al. | 2506.11371 | null |
2025-06-12
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-12 | Enhancing Deepfake Detection using SE Block Attention with CNN | Subhram Dasgupta et.al. | 2506.10683 | null |
2025-06-12 | LLMs Are Not Yet Ready for Deepfake Image Detection | Shahroz Tariq et.al. | 2506.10474 | null |
2025-06-12 | TikTok's Research API: Problems Without Explanations | Carlos Entrena-Serrano et.al. | 2506.09746 | null |
2025-06-11
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-11 | Unmasking real-world audio deepfakes: A data-centric approach | David Combei et.al. | 2506.09606 | **[link](https://github.com/davidcombei/ai4t)** |
2025-06-11 | TADA: Training-free Attribution and Out-of-Domain Detection of Audio Deepfakes | Adriana Stan et.al. | 2506.05802 | **[link](https://github.com/adrianastan/tada)** |
2025-06-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-10 | Risks & Benefits of LLMs & GenAI for Platform Integrity, Healthcare Diagnostics, Cybersecurity, Privacy & AI Safety: A Comprehensive Survey, Roadmap & Implementation Blueprint | Kiarash Ahi et.al. | 2506.12088 | null |
2025-06-10 | Context-aware TFL: A Universal Context-aware Contrastive Learning Framework for Temporal Forgery Localization | Qilin Yin et.al. | 2506.08493 | null |
2025-06-10 | Multimodal Zero-Shot Framework for Deepfake Hate Speech Detection in Low-Resource Languages | Rishabh Ranjan et.al. | 2506.08372 | null |
2025-06-10 | Towards Generalized Source Tracing for Codec-Based Deepfake Speech | Xuanjun Chen et.al. | 2506.07294 | null |
2025-06-09
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-09 | Lightweight Joint Audio-Visual Deepfake Detection via Single-Stream Multi-Modal Learning Framework | Kuiyuan Zhang et.al. | 2506.07358 | null |
2025-06-07
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-07 | Identity Deepfake Threats to Biometric Authentication Systems: Public and Expert Perspectives | Shijing He et.al. | 2506.06825 | null |
2025-06-07 | SynHate: Detecting Hate Speech in Synthetic Deepfake Audio | Rishabh Ranjan et.al. | 2506.06772 | null |
2025-06-06
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-06 | DeepFake Doctor: Diagnosing and Treating Audio-Video Fake Detection | Marcel Klemt et.al. | 2506.05851 | null |
2025-06-05
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-05 | SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media Platforms | Arnesh Batra et.al. | 2506.05538 | null |
2025-06-05 | Practical Manipulation Model for Robust Deepfake Detection | Benedikt Hopf et.al. | 2506.05119 | null |
2025-06-05 | STOPA: A Database of Systematic VariaTion Of DeePfake Audio for Open-Set Source Tracing and Attribution | Anton Firc et.al. | 2505.19644 | null |
2025-06-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-04 | AuthGuard: Generalizable Deepfake Detection via Language Guidance | Guangyu Shen et.al. | 2506.04501 | null |
2025-06-03
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-03 | A Data-Driven Diffusion-based Approach for Audio Deepfake Explanations | Petr Grinberg et.al. | 2506.03425 | null |
2025-06-03 | Towards Source Attribution of Singing Voice Deepfake with Multimodal Foundation Models | Orchid Chetia Phukan et.al. | 2506.03364 | null |
2025-06-03 | DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models | Jiarui Wang et.al. | 2506.03007 | null |
2025-06-03 | PartialEdit: Identifying Partial Deepfakes in the Era of Neural Speech Editing | You Zhang et.al. | 2506.02958 | null |
2025-06-03 | Enhancing Abnormality Identification: Robust Out-of-Distribution Strategies for Deepfake Detection | Luca Maiano et.al. | 2506.02857 | null |
2025-06-03 | Beyond Face Swapping: A Diffusion-Based Digital Human Benchmark for Multimodal Deepfake Detection | Jiaxin Liu et.al. | 2505.16512 | null |
2025-06-02
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-02 | Unveiling Audio Deepfake Origins: A Deep Metric learning And Conformer Network Approach With Ensemble Fusion | Ajinkya Kulkarni et.al. | 2506.02085 | null |
2025-06-01
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-01 | Multiverse Through Deepfakes: The MultiFakeVerse Dataset of Person-Centric Visual and Conceptual Manipulations | Parul Gupta et.al. | 2506.00868 | **[link](https://github.com/parul-gupta/multifakeverse)** |
2025-06-01 | Replay Attacks Against Audio Deepfake Detection | Nicolas Müller et.al. | 2505.14862 | null |
2025-05-31
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-31 | XMAD-Bench: Cross-Domain Multilingual Audio Deepfake Benchmark | Ioan-Paul Ciobanu et.al. | 2506.00462 | **[link](https://github.com/ristea/xmad-bench)** |
2025-05-31 | RPRA-ADD: Forgery Trace Enhancement-Driven Audio Deepfake Detection | Ruibo Fu et.al. | 2506.00375 | null |
2025-05-30
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-30 | TalkingHeadBench: A Multi-Modal Benchmark & Analysis of Talking-Head DeepFake Detection | Xinqi Xiong et.al. | 2505.24866 | null |
2025-05-30 | Rehearsal with Auxiliary-Informed Sampling for Audio Deepfake Detection | Falih Gozi Febrinanto et.al. | 2505.24486 | null |
2025-05-30 | Benchmarking Foundation Models for Zero-Shot Biometric Tasks | Redwan Sony et.al. | 2505.24214 | null |
2025-05-29
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-29 | Few-Shot Speech Deepfake Detection Adaptation with Gaussian Processes | Neta Glazer et.al. | 2505.23619 | **[link](https://github.com/NetaGlazer/ADD-GP)** |
2025-05-28
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-28 | Speaking images. A novel framework for the automated self-description of artworks | Valentine Bernasconi et.al. | 2506.05368 | null |
2025-05-28 | Tell me Habibi, is it Real or Fake? | Kartik Kuckreja et.al. | 2505.22581 | null |
2025-05-27
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-27 | RoGA: Towards Generalizable Deepfake Detection through Robust Gradient Alignment | Lingyu Qiu et.al. | 2505.20653 | null |
2025-05-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-26 | ArVoice: A Multi-Speaker Dataset for Arabic Speech Synthesis | Hawau Olamide Toyin et.al. | 2505.20506 | null |
2025-05-26 | Guard Me If You Know Me: Protecting Specific Face-Identity from Deepfakes | Kaiqing Lin et.al. | 2505.19582 | null |
2025-05-25
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-25 | EnvSDD: Benchmarking Environmental Sound Deepfake Detection | Han Yin et.al. | 2505.19203 | null |
2025-05-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-24 | Think Twice before Adaptation: Improving Adaptability of DeepFake Detection via Online Test-Time Adaptation | Hong-Hanh Nguyen-Le et.al. | 2505.18787 | **[link](https://github.com/honghanh2104/t2a-think-twice-before-adaptation)** |
2025-05-24 | HyperFake: Hyperspectral Reconstruction and Attention-Guided Analysis for Advanced Deepfake Detection | Pavan C Shekar et.al. | 2505.18587 | null |
2025-05-24 | Preserving AUC Fairness in Learning with Noisy Protected Groups | Mingyang Wu et.al. | 2505.18532 | **[link](https://github.com/purdue-m2/auc_fairness_with_noisy_groups)** |
2025-05-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | CAMME: Adaptive Deepfake Image Detection with Multi-Modal Cross-Attention | Naseem Khan et.al. | 2505.18035 | **[link](https://github.com/magnet300/camme)** |
2025-05-23 | What You Read Isn't What You Hear: Linguistic Sensitivity in Deepfake Speech Detection | Binh Nguyen et.al. | 2505.17513 | null |
2025-05-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-22 | Do DeepFake Attribution Models Generalize? | Spiros Baxavanakis et.al. | 2505.21520 | null |
2025-05-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-21 | My Face Is Mine, Not Yours: Facial Protection Against Diffusion Model Face Swapping | Hon Ming Yam et.al. | 2505.15336 | null |
2025-05-21 | CAD: A General Multimodal Framework for Video Deepfake Detection via Cross-Modal Alignment and Distillation | Yuxuan Du et.al. | 2505.15233 | null |
2025-05-21 | BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation | Haiquan Wen et.al. | 2505.12620 | **[link](https://github.com/l8cv/busterx)** |
2025-05-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-20 | Listen, Analyze, and Adapt to Learn New Attacks: An Exemplar-Free Class Incremental Learning Method for Audio Deepfake Source Tracing | Yang Xiao et.al. | 2505.14601 | null |
2025-05-20 | Source Verification for Speech Deepfakes | Viola Negroni et.al. | 2505.14188 | null |
2025-05-20 | Naturalness-Aware Curriculum Learning with Dynamic Temperature for Speech Deepfake Detection | Taewoo Kim et.al. | 2505.13976 | null |
2025-05-20 | BiCrossMamba-ST: Speech Deepfake Detection with Bidirectional Mamba Spectro-Temporal Cross-Attention | Yassine El Kheir et.al. | 2505.13930 | null |
2025-05-20 | Forensic deepfake audio detection using segmental speech features | Tianle Yang et.al. | 2505.13847 | null |
2025-05-20 | Benchmarking Unified Face Attack Detection via Hierarchical Prompt Tuning | Ajian Liu et.al. | 2505.13327 | null |
2025-05-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-19 | Codec-Based Deepfake Source Tracing via Neural Audio Codec Taxonomy | Xuanjun Chen et.al. | 2505.12994 | null |
2025-05-19 | Multiscale Adaptive Conflict-Balancing Model For Multimedia Deepfake Detection | Zihan Xiong et.al. | 2505.12966 | null |
2025-05-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-18 | Towards Open-world Generalized Deepfake Detection: General Feature Extraction via Unsupervised Domain Adaptation | Midou Guo et.al. | 2505.12339 | null |
2025-05-18 | Is Artificial Intelligence Generated Image Detection a Solved Problem? | Ziqiang Li et.al. | 2505.12335 | **[link](https://github.com/horizontel/aigibench)** |
2025-05-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-16 | X-Edit: Detecting and Localizing Edits in Images Altered by Text-Guided Diffusion Models | Valentina Bazyleva et.al. | 2505.11753 | null |
2025-05-16 | Deepfake Forensic Analysis: Source Dataset Attribution and Legal Implications of Synthetic Media Manipulation | Massimiliano Cassia et.al. | 2505.11110 | null |
2025-05-16 | MAVOS-DD: Multilingual Audio-Video Open-Set Deepfake Detection Benchmark | Florinel-Alin Croitoru et.al. | 2505.11109 | null |
2025-05-16 | $\mathcal{A}LLM4ADD$ : Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection | Hao Gu et.al. | 2505.11079 | null |
2025-05-16 | ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and Localization | Bo Du et.al. | 2505.11003 | **[link](https://github.com/scu-zjz/forensichub)** |
2025-05-16 | BanglaFake: Constructing and Evaluating a Specialized Bengali Deepfake Audio Dataset | Istiaq Ahmed Fahad et.al. | 2505.10885 | **[link](https://github.com/KamruzzamanAsif/BanglaFake)** |
2025-05-16 | Visual Watermarking in the Era of Diffusion Models: Advances and Challenges | Junxian Duan et.al. | 2505.08197 | null |
2025-05-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-15 | Characterizing AI-Generated Misinformation on Social Media | Chiara Drolsbach et.al. | 2505.10266 | null |
2025-05-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-14 | WaveGuard: Robust Deepfake Detection and Source Tracing via Dual-Tree Complex Wavelet and Graph Neural Networks | Ziyuan He et.al. | 2505.08614 | **[link](https://github.com/vpsg-research/waveguard)** |
2025-05-13
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-13 | DFA-CON: A Contrastive Learning Approach for Detecting Copyright Infringement in DeepFake Art | Haroon Wahab et.al. | 2505.08552 | null |
2025-05-13 | TT-DF: A Large-Scale Diffusion-Based Dataset and Benchmark for Human Body Forgery Detection | Wenkui Yang et.al. | 2505.08437 | **[link](https://github.com/hashtag00002/tt-df)** |
2025-05-13 | FauForensics: Boosting Audio-Visual Deepfake Detection with Facial Action Units | Jian Wang et.al. | 2505.08294 | null |
2025-05-13 | Where the Devil Hides: Deepfake Detectors Can No Longer Be Trusted | Shuaiwei Yuan et.al. | 2505.08255 | null |
2025-05-11
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-11 | Multimodal Fake News Detection: MFND Dataset and Shallow-Deep Multitask Learning | Ye Zhu et.al. | 2505.06796 | **[link](https://github.com/yunan-wang33/sdml)** |
2025-05-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-10 | Beyond Identity: A Generalizable Approach for Deepfake Audio Detection | Yasaman Ahmadiadli et.al. | 2505.06766 | null |
2025-05-10 | Unmasking Deep Fakes: Leveraging Deep Learning for Video Authenticity Detection | Mahmudul Hasan et.al. | 2505.06528 | null |
2025-05-08
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-08 | Cross-Branch Orthogonality for Improved Generalization in Face Deepfake Detection | Tharindu Fernando et.al. | 2505.04888 | null |
2025-05-07
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-07 | Perpetuating Misogyny with Generative AI: How Model Personalization Normalizes Gendered Harm | Laura Wagner et.al. | 2505.04600 | null |
2025-05-07 | Learning Real Facial Concepts for Independent Deepfake Detection | Ming-Hui Liu et.al. | 2505.04460 | null |
2025-05-07 | DATA: Multi-Disentanglement based Contrastive Learning for Open-World Semi-Supervised Deepfake Attribution | Ming-Hui Liu et.al. | 2505.04384 | null |
2025-05-07 | From Incidents to Insights: Patterns of Responsibility following AI Harms | Isabel Richards et.al. | 2505.04291 | null |
2025-05-06
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-06 | Deepfakes on Demand: the rise of accessible non-consensual deepfake image generators | Will Hawkins et.al. | 2505.03859 | **[link](https://github.com/WillHawkins3/deepfakesondemand)** |
2025-05-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-04 | Robust AI-Generated Face Detection with Imbalanced Data | Yamini Sri Krubha et.al. | 2505.02182 | **[link](https://github.com/purdue-m2/sp_cup)** |
2025-05-04 | MLLM-Enhanced Face Forgery Detection: A Vision-Language Fusion Solution | Siran Peng et.al. | 2505.02013 | null |
2025-05-03
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-03 | Detecting Musical Deepfakes | Nick Sunday et.al. | 2505.09633 | **[link](https://github.com/nicksunday/deepfake-music-detector)** |
2025-05-01
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-01 | AWARE-NET: Adaptive Weighted Averaging for Robust Ensemble Network in Deepfake Detection | Muhammad Salman et.al. | 2505.00312 | **[link](https://github.com/recluzegeek/AWARE-NET)** |
2025-04-30
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-30 | Generative AI in Financial Institution: A Global Survey of Opportunities, Threats, and Regulation | Bikash Saha et.al. | 2504.21574 | null |
2025-04-30 | End-to-end Audio Deepfake Detection from RAW Waveforms: a RawNet-Based Approach with Cross-Dataset Evaluation | Andrea Di Pierno et.al. | 2504.20923 | null |
2025-04-29
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-29 | A Brief Review for Compression and Transfer Learning Techniques in DeepFake Detection | Andreas Karathanasis et.al. | 2504.21066 | null |
2025-04-29 | TriniMark: A Robust Generative Speech Watermarking Method for Trinity-Level Attribution | Yue Li et.al. | 2504.20532 | null |
2025-04-27
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-27 | Balancing Creativity and Automation: The Influence of AI on Modern Film Production and Dissemination | Yiren Xu et.al. | 2504.19275 | null |
2025-04-27 | CapsFake: A Multimodal Capsule Network for Detecting Instruction-Guided Deepfakes | Tuan Nguyen et.al. | 2504.19212 | null |
2025-04-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-24 | Towards Generalizable Deepfake Detection with Spatial-Frequency Collaborative Learning and Hierarchical Cross-Modal Fusion | Mengyu Qiao et.al. | 2504.17223 | null |
2025-04-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-19 | BMRL: Bi-Modal Guided Multi-Perspective Representation Learning for Zero-Shot Deepfake Attribution | Yaning Zhang et.al. | 2504.14129 | null |
2025-04-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-18 | MLEP: Multi-granularity Local Entropy Patterns for Universal AI-generated Image Detection | Lin Yuan et.al. | 2504.13726 | null |
2025-04-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-16 | Benchmarking Audio Deepfake Detection Robustness in Real-world Communication Scenarios | Haohan Shi et.al. | 2504.12423 | null |
2025-04-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-15 | Big Brother is Watching: Proactive Deepfake Detection via Learnable Hidden Face | Hongbo Li et.al. | 2504.11309 | null |
2025-04-15 | Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy | Botao Zhao et.al. | 2504.10819 | null |
2025-04-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-14 | SafeSpeech: Robust and Universal Voice Protection Against Malicious Speech Synthesis | Zhisheng Zhang et.al. | 2504.09839 | **[link](https://github.com/wxzyd123/safespeech)** |
2025-04-13
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-13 | FractalForensics: Proactive Deepfake Detection and Localization via Fractal Watermarks | Tianyi Wang et.al. | 2504.09451 | null |
2025-04-13 | Detecting Localized Deepfake Manipulations Using Action Unit-Guided Video Representations | Tharun Anand et.al. | 2503.22121 | null |
2025-04-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-10 | LoRAX: LoRA eXpandable Networks for Continual Synthetic Image Attribution | Danielle Sullivan-Pao et.al. | 2504.08149 | **[link](https://github.com/mit-ll/lorax_cil)** |
2025-04-09
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-09 | Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning | Ashutosh Chaubey et.al. | 2504.07198 | null |
2025-04-09 | Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception | Yuankun Xie et.al. | 2504.06753 | **[link](https://github.com/xieyuankun/all-type-add)** |
2025-04-08
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-08 | Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-spoofing | Tianchi Liu et.al. | 2504.05657 | **[link](https://github.com/liu-tianchi/nes2net)** |
2025-04-07
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-07 | From Specificity to Generality: Revisiting Generalizable Artifacts in Detecting Face Deepfakes | Long Ma et.al. | 2504.04827 | null |
2025-04-07 | SUEDE:Shared Unified Experts for Physical-Digital Face Attack Detection Enhancement | Zuying Xie et.al. | 2504.04818 | null |
2025-04-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-04 | Three Forensic Cues for JPEG AI Images | Sandra Bergmann et.al. | 2504.03191 | null |
2025-04-03
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-03 | Comparative Analysis of Deepfake Detection Models: New Approaches and Perspectives | Matheus Martins Batista et.al. | 2504.02900 | null |
2025-04-02
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-02 | Robust AI-Synthesized Image Detection via Multi-feature Frequency-aware Learning | Hongfei Cai et.al. | 2504.02879 | null |
2025-04-02 | Detecting Lip-Syncing Deepfakes: Vision Temporal Transformer for Analyzing Mouth Inconsistencies | Soumyya Kanti Datta et.al. | 2504.01470 | **[link](https://github.com/skrantidatta/lipinc-v2)** |
2025-04-01
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-01 | FA^{3}-CLIP: Frequency-Aware Cues Fusion and Attack-Agnostic Prompt Learning for Unified Face Attack Detection | Yongze Li et.al. | 2504.00454 | null |
2025-03-29
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-29 | Synthetic Art Generation and DeepFake Detection A Study on Jamini Roy Inspired Dataset | Kushal Agrawal et.al. | 2503.23226 | null |
2025-03-29 | Can Multi-modal (reasoning) LLMs work as deepfake detectors? | Simiao Ren et.al. | 2503.20084 | null |
2025-03-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-26 | MVFNet: Multipurpose Video Forensics Network using Multiple Forms of Forensic Evidence | Tai D. Nguyen et.al. | 2503.20991 | null |
2025-03-26 | Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector | Xiao Guo et.al. | 2503.20188 | **[link](https://github.com/chelsea234/m2f2_det)** |
2025-03-26 | Unlocking the Hidden Potential of CLIP in Generalizable Deepfake Detection | Andrii Yermakov et.al. | 2503.19683 | **[link](https://github.com/yermandy/deepfake-detection)** |
2025-03-26 | InnerSelf: Designing Self-Deepfaked Voice for Emotional Well-being | Guang Dai et.al. | 2503.14257 | null |
2025-03-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-24 | SCVI: Bridging Social and Cyber Dimensions for Comprehensive Vulnerability Assessment | Shutonu Mitra et.al. | 2503.20806 | null |
2025-03-24 | NullSwap: Proactive Identity Cloaking Against Deepfake Face Swapping | Tianyi Wang et.al. | 2503.18678 | null |
2025-03-24 | Deepfake-Eval-2024: A Multi-Modal In-the-Wild Benchmark of Deepfakes Circulated in 2024 | Nuria Alina Chandra et.al. | 2503.02857 | **[link](https://github.com/nuriachandra/deepfake-eval-2024)** |
2025-03-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-23 | Anomaly Detection and Localization for Speech Deepfakes via Feature Pyramid Matching | Emma Coletta et.al. | 2503.18032 | null |
2025-03-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-21 | Measuring the Robustness of Audio Deepfake Detectors | Xiang Li et.al. | 2503.17577 | **[link](https://github.com/Jessegator/Audio_robustness_evaluation)** |
2025-03-21 | D2Fusion: Dual-domain Fusion with Feature Superposition for Deepfake Detection | Xueqi Qiu et.al. | 2503.17184 | null |
2025-03-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-20 | TruthLens: Explainable DeepFake Detection for Face Manipulated and Fully Synthetic Data | Rohit Kundu et.al. | 2503.15867 | null |
2025-03-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-19 | Cyber Threats in Financial Transactions -- Addressing the Dual Challenge of AI and Quantum Computing | Ahmed M. Elmisery et.al. | 2503.15678 | null |
2025-03-19 | TruthLens:A Training-Free Paradigm for DeepFake Detection | Ritabrata Chakraborty et.al. | 2503.15342 | null |
2025-03-19 | Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation | Siwei Wen et.al. | 2503.14905 | null |
2025-03-19 | Unlocking the Capabilities of Vision-Language Models for Generalizable and Explainable Deepfake Detection | Peipeng Yu et.al. | 2503.14853 | null |
2025-03-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-18 | ExDDV: A New Dataset for Explainable Deepfake Detection in Video | Vlad Hondru et.al. | 2503.14421 | **[link](https://github.com/vladhondru25/exddv)** |
2025-03-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-16 | Deepfake Detection with Optimized Hybrid Model: EAR Biometric Descriptor via Improved RCNN | Ruchika Sharma et.al. | 2503.12381 | null |
2025-03-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-14 | Deepfake Detection of Face Images based on a Convolutional Neural Network | Lukas Kroiß et.al. | 2503.11389 | null |
2025-03-11
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-11 | Unmasking the Unknown: Facial Deepfake Detection in the Open-Set Paradigm | Nadarasar Bahavan et.al. | 2503.08055 | null |
2025-03-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-10 | VoD: Learning Volume of Differences for Video-Based Deepfake Detection | Ying Xu et.al. | 2503.07607 | **[link](https://github.com/xuyingzhongguo/vod)** |
2025-03-09
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-09 | Chameleon: On the Scene Diversity and Domain Variety of AI-Generated Videos Detection | Meiyu Zeng et.al. | 2503.06624 | null |
2025-03-08
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-08 | VLForgery Face Triad: Detection, Localization and Attribution via Multimodal Large Language Models | Xinan He et.al. | 2503.06142 | null |
2025-03-06
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-06 | Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems | Jooyoung Lee et.al. | 2503.04945 | null |
2025-03-05
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-05 | Reduced Spatial Dependency for More General Video-level Deepfake Detection | Beilin Chu et.al. | 2503.03270 | null |
2025-03-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-04 | Deepfake Detection via Knowledge Injection | Tonghui Li et.al. | 2503.02503 | null |
2025-02-28
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-28 | Two-Stream Spatial-Temporal Transformer Framework for Person Identification via Natural Conversational Keypoints | Masoumeh Chapariniya et.al. | 2502.20803 | null |
2025-02-27
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-27 | DIN-CTS: Low-Complexity Depthwise-Inception Neural Network with Contrastive Training Strategy for Deepfake Speech Detection | Lam Pham et.al. | 2502.20225 | null |
AIGC
2025-06-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-26 | Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test | Ziyue Li et.al. | 2506.21551 | null |
2025-06-26 | mTSBench: Benchmarking Multivariate Time Series Anomaly Detection and Model Selection at Scale | Xiaona Zhou et.al. | 2506.21550 | null |
2025-06-26 | Exploring the Design Space of 3D MLLMs for CT Report Generation | Mohammed Baharoon et.al. | 2506.21535 | null |
2025-06-26 | "What's Up, Doc?": Analyzing How Users Seek Health Information in Large-Scale Conversational AI Datasets | Akshay Paruchuri et.al. | 2506.21532 | null |
2025-06-26 | Potemkin Understanding in Large Language Models | Marina Mancoridis et.al. | 2506.21521 | null |
2025-06-26 | Enhancing User Engagement in Socially-Driven Dialogue through Interactive LLM Alignments | Jiashuo Wang et.al. | 2506.21497 | null |
2025-06-26 | Bridging Offline and Online Reinforcement Learning for LLMs | Jack Lanchantin et.al. | 2506.21495 | null |
2025-06-26 | Controllable 3D Placement of Objects with Scene-Aware Diffusion Models | Mohamed Omran et.al. | 2506.21446 | null |
2025-06-26 | Text2Cypher Across Languages: Evaluating Foundational Models Beyond English | Makbule Gulcin Ozsoy et.al. | 2506.21445 | null |
2025-06-26 | Domain Knowledge-Enhanced LLMs for Fraud and Concept Drift Detection | Ali Şenol et.al. | 2506.21443 | null |
2025-06-26 | Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning | Prajwal Koirala et.al. | 2506.21427 | null |
2025-06-26 | XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation | Bowen Chen et.al. | 2506.21416 | null |
2025-06-26 | Scalable Bayesian Low-Rank Adaptation of Large Language Models via Stochastic Variational Subspace Inference | Colin Samplawski et.al. | 2506.21408 | null |
2025-06-26 | Leveraging LLM-Assisted Query Understanding for Live Retrieval-Augmented Generation | Guanting Dong et.al. | 2506.21384 | null |
2025-06-26 | Structuralist Approach to AI Literary Criticism: Leveraging Greimas Semiotic Square for Large Language Models | Fangzhou Dong et.al. | 2506.21360 | null |
2025-06-26 | CoPa-SG: Dense Scene Graphs with Parametric and Proto-Relations | Julian Lorenz et.al. | 2506.21357 | null |
2025-06-26 | DynamicBench: Evaluating Real-Time Report Generation in Large Language Models | Jingyao Li et.al. | 2506.21343 | null |
2025-06-26 | Latent Prototype Routing: Achieving Near-Perfect Load Balancing in Mixture-of-Experts | Jiajie Yang et.al. | 2506.21328 | null |
2025-06-26 | Multimodal LLMs for Visualization Reconstruction and Understanding | Can Liu et.al. | 2506.21319 | null |
2025-06-26 | Exploring Adapter Design Tradeoffs for Low Resource Music Generation | Atharva Mehta et.al. | 2506.21298 | null |
2025-06-25
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-25 | Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs | Sonia K. Murthy et.al. | 2506.20666 | null |
2025-06-25 | The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind | Andrei Lupu et.al. | 2506.20664 | null |
2025-06-25 | Memento: Note-Taking for Your Future Self | Chao Wan et.al. | 2506.20642 | null |
2025-06-25 | Telegrapher's Generative Model via Kac Flows | Richard Duong et.al. | 2506.20641 | null |
2025-06-25 | AI Assistants to Enhance and Exploit the PETSc Knowledge Base | Barry Smith et.al. | 2506.20608 | null |
2025-06-25 | Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm | Baixiang Huang et.al. | 2506.20606 | null |
2025-06-25 | Video Perception Models for 3D Scene Synthesis | Rui Huang et.al. | 2506.20601 | null |
2025-06-25 | SFNet: Fusion of Spatial and Frequency-Domain Features for Remote Sensing Image Forgery Detection | Ji Qi et.al. | 2506.20599 | null |
2025-06-25 | Fine-Tuning and Prompt Engineering of LLMs, for the Creation of Multi-Agent AI for Addressing Sustainable Protein Production Challenges | Alexander D. Kalian et.al. | 2506.20598 | null |
2025-06-25 | AI in the Writing Process: How Purposeful AI Support Fosters Student Writing | Momin N. Siddiqui et.al. | 2506.20595 | null |
2025-06-25 | CCISolver: End-to-End Detection and Repair of Method-Level Code-Comment Inconsistency | Renyi Zhong et.al. | 2506.20558 | null |
2025-06-25 | Large Language Model-Driven Code Compliance Checking in Building Information Modeling | Soumya Madireddy et.al. | 2506.20551 | null |
2025-06-25 | When Life Gives You Samples: The Benefits of Scaling up Inference Compute for Multilingual LLMs | Ammar Khairi et.al. | 2506.20544 | null |
2025-06-25 | WattsOnAI: Measuring, Analyzing, and Visualizing Energy and Carbon Footprint of AI Workloads | Hongzhen Huang et.al. | 2506.20535 | null |
2025-06-25 | Case-based Reasoning Augmented Large Language Model Framework for Decision Making in Realistic Safety-Critical Driving Scenarios | Wenbin Gan et.al. | 2506.20531 | null |
2025-06-25 | Asymmetric REINFORCE for off-Policy Reinforcement Learning: Balancing positive and negative rewards | Charles Arnal et.al. | 2506.20520 | null |
2025-06-25 | BotHash: Efficient and Training-Free Bot Detection Through Approximate Nearest Neighbor | Edoardo Di Paolo et.al. | 2506.20503 | null |
2025-06-25 | ReCode: Updating Code API Knowledge with Reinforcement Learning | Haoze Wu et.al. | 2506.20495 | null |
2025-06-25 | Generative AI for Vulnerability Detection in 6G Wireless Networks: Advances, Case Study, and Future Directions | Shuo Yang et.al. | 2506.20488 | null |
2025-06-25 | GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching | Guinan Su et.al. | 2506.20480 | null |
2025-06-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-24 | JoyAgents-R1: Joint Evolution Dynamics for Versatile Multi-LLM Agents with Reinforcement Learning | Ai Han et.al. | 2506.19846 | null |
2025-06-24 | MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration | Yucheng Zhou et.al. | 2506.19835 | null |
2025-06-24 | A standard transformer and attention with linear biases for molecular conformer generation | Viatcheslav Gurev et.al. | 2506.19834 | null |
2025-06-24 | ProxelGen: Generating Proteins as 3D Densities | Felix Faltings et.al. | 2506.19820 | null |
2025-06-24 | KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality | Baochang Ren et.al. | 2506.19807 | null |
2025-06-24 | LLM-Based Social Simulations Require a Boundary | Zengqing Wu et.al. | 2506.19806 | null |
2025-06-24 | KnowML: Improving Generalization of ML-NIDS with Attack Knowledge Graphs | Xin Fan Guo et.al. | 2506.19802 | null |
2025-06-24 | Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study | Yuqi Zhu et.al. | 2506.19794 | null |
2025-06-24 | Line ratio identification of external photoevaporation | Tyger Peake et.al. | 2506.19788 | null |
2025-06-24 | SAGE: Strategy-Adaptive Generation Engine for Query Rewriting | Teng Wang et.al. | 2506.19783 | null |
2025-06-24 | Alleviating User-Sensitive bias with Fair Generative Sequential Recommendation Model | Yang Liu et.al. | 2506.19777 | null |
2025-06-24 | Canary in the Mine: An LLM Augmented Survey of Disciplinary Complaints to the Ordre des ingénieurs du Québec (OIQ) | Tammy Mackenzie et.al. | 2506.19775 | null |
2025-06-24 | Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation | Jun Wang et.al. | 2506.19774 | null |
2025-06-24 | Automatic Prompt Optimization for Knowledge Graph Construction: Insights from an Empirical Study | Nandana Mihindukulasooriya et.al. | 2506.19773 | null |
2025-06-24 | A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects | Shulan Ruan et.al. | 2506.19769 | null |
2025-06-24 | SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning | Yuqian Fu et.al. | 2506.19767 | null |
2025-06-24 | Arabic Dialect Classification using RNNs, Transformers, and Large Language Models: A Comparative Analysis | Omar A. Essameldin et.al. | 2506.19753 | null |
2025-06-24 | Noise Consistency Training: A Native Approach for One-Step Generator in Learning Additional Controls | Yihong Luo et.al. | 2506.19741 | null |
2025-06-24 | Breaking Barriers: Do Reinforcement Post Training Gains Transfer To Unseen Domains? | Chuxuan Hu et.al. | 2506.19733 | null |
2025-06-24 | Who Does What in Deep Learning? Multidimensional Game-Theoretic Attribution of Function of Neural Units | Shrey Dixit et.al. | 2506.19732 | null |
2025-06-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-23 | FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation | Kaiyi Huang et.al. | 2506.18899 | null |
2025-06-23 | Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations | Jiaming Han et.al. | 2506.18898 | null |
2025-06-23 | MinD: Unified Visual Imagination and Control via Hierarchical World Models | Xiaowei Chi et.al. | 2506.18897 | null |
2025-06-23 | ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs | Jiaru Zou et.al. | 2506.18896 | null |
2025-06-23 | Steering Conceptual Bias via Transformer Latent-Subspace Activation | Vansh Sharma et.al. | 2506.18887 | null |
2025-06-23 | Let Your Video Listen to Your Music! | Xinyu Zhang et.al. | 2506.18881 | null |
2025-06-23 | OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization | Yiyou Sun et.al. | 2506.18880 | null |
2025-06-23 | CommVQ: Commutative Vector Quantization for KV Cache Compression | Junyan Li et.al. | 2506.18879 | null |
2025-06-23 | OmniGen2: Exploration to Advanced Multimodal Generation | Chenyuan Wu et.al. | 2506.18871 | null |
2025-06-23 | OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation | Qijun Gan et.al. | 2506.18866 | null |
2025-06-23 | LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning | Yuhao Wu et.al. | 2506.18841 | null |
2025-06-23 | Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories | Islem Bouzenia et.al. | 2506.18824 | null |
2025-06-23 | RWESummary: A Framework and Test for Choosing Large Language Models to Summarize Real-World Evidence (RWE) Studies | Arjun Mukerji et.al. | 2506.18819 | null |
2025-06-23 | FORGE: An LLM-driven Framework for Large-Scale Smart Contract Vulnerability Dataset Construction | Jiachi Chen et.al. | 2506.18795 | null |
2025-06-23 | 3D Arena: An Open Platform for Generative 3D Evaluation | Dylan Ebert et.al. | 2506.18787 | null |
2025-06-23 | TRIZ Agents: A Multi-Agent LLM Approach for TRIZ-Based Innovation | Kamil Szczepanik et.al. | 2506.18783 | null |
2025-06-23 | Existing LLMs Are Not Self-Consistent For Simple Tasks | Zhenru Lin et.al. | 2506.18781 | null |
2025-06-23 | DefFusionNet: Learning Multimodal Goal Shapes for Deformable Object Manipulation via a Diffusion-based Probabilistic Model | Bao Thach et.al. | 2506.18779 | null |
2025-06-23 | Programming by Backprop: LLMs Acquire Reusable Algorithmic Abstractions During Code Training | Jonathan Cook et.al. | 2506.18777 | null |
2025-06-23 | ContinualFlow: Learning and Unlearning with Neural Flow Matching | Lorenzo Simone et.al. | 2506.18747 | null |
2025-06-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Yanzhi Zhang et.al. | 2506.17219 | null |
2025-06-20 | Fine-Tuning Lowers Safety and Disrupts Evaluation Consistency | Kathleen C. Fraser et.al. | 2506.17209 | null |
2025-06-20 | Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems | Matias Martinez et.al. | 2506.17208 | null |
2025-06-20 | Confidence Scoring for LLM-Generated SQL in Supply Chain Data Extraction | Jiekai Ma et.al. | 2506.17203 | null |
2025-06-20 | Dex1B: Learning with 1B Demonstrations for Dexterous Manipulation | Jianglong Ye et.al. | 2506.17198 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197 | null |
2025-06-20 | Detecting LLM-Generated Short Answers and Effects on Learner Performance | Shambhavi Bhushan et.al. | 2506.17196 | null |
2025-06-20 | Towards AI Search Paradigm | Yuchen Li et.al. | 2506.17188 | null |
2025-06-20 | Deep generative models as the probability transformation functions | Vitalii Bondar et.al. | 2506.17171 | null |
2025-06-20 | The MedPerturb Dataset: What Non-Content Perturbations Reveal About Human and Clinical LLM Decision Making | Abinitha Gourabathina et.al. | 2506.17163 | null |
2025-06-20 | Do We Need Large VLMs for Spotting Soccer Actions? | Ritabrata Chakraborty et.al. | 2506.17144 | null |
2025-06-20 | MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification | David Jacob Drexlin et.al. | 2506.17140 | null |
2025-06-20 | Large Language Model Unlearning for Source Code | Xue Jiang et.al. | 2506.17125 | null |
2025-06-20 | When Can Model-Free Reinforcement Learning be Enough for Thinking? | Josiah P. Hanna et.al. | 2506.17124 | null |
2025-06-20 | Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving | Chuxue Cao et.al. | 2506.17104 | null |
2025-06-20 | Chain-of-Thought Prompting Obscures Hallucination Cues in Large Language Models: An Empirical Evaluation | Jiahao Cheng et.al. | 2506.17088 | null |
2025-06-20 | Tower+: Bridging Generality and Translation Specialization in Multilingual LLMs | Ricardo Rei et.al. | 2506.17080 | null |
2025-06-20 | Simultaneous Translation with Offline Speech and LLM Models in CUNI Submission to IWSLT 2025 | Dominik Macháček et.al. | 2506.17077 | null |
2025-06-20 | LLM-Based Bot Broadens the Range of Arguments in Online Discussions, Even When Transparently Disclosed as AI | Valeria Vuk et.al. | 2506.17073 | null |
2025-06-20 | Empowering Near-Field Communications in Low-Altitude Economy with LLM: Fundamentals, Potentials, Solutions, and Future Directions | Zhuo Xu et.al. | 2506.17067 | null |
2025-06-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-18 | Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards | Qingming Liu et.al. | 2506.15684 | null |
2025-06-18 | PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning | Yuhui Shi et.al. | 2506.15683 | null |
2025-06-18 | Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model | Anirud Aggarwal et.al. | 2506.15682 | **[link](https://github.com/aniaggarwal/ecad)** |
2025-06-18 | GenRecal: Generation after Recalibration from Large to Small Vision-Language Models | Byung-Kwan Lee et.al. | 2506.15681 | null |
2025-06-18 | CC-LEARN: Cohort-based Consistency Learning | Xiao Ye et.al. | 2506.15662 | null |
2025-06-18 | PhishDebate: An LLM-Based Multi-Agent Framework for Phishing Website Detection | Wenhao Li et.al. | 2506.15656 | null |
2025-06-18 | deepSURF: Detecting Memory Safety Vulnerabilities in Rust Through Fuzzing LLM-Augmented Harnesses | Georgios Androutsopoulos et.al. | 2506.15648 | null |
2025-06-18 | Revisiting Compositional Generalization Capability of Large Language Models Considering Instruction Following Ability | Yusuke Sakai et.al. | 2506.15629 | null |
2025-06-18 | The Effect of State Representation on LLM Agent Behavior in Dynamic Routing Games | Lyle Goodyear et.al. | 2506.15624 | null |
2025-06-18 | LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning | Gabrel J. Perin et.al. | 2506.15606 | **[link](https://github.com/vita-group/lox)** |
2025-06-18 | From Model to Classroom: Evaluating Generated MCQs for Portuguese with Narrative and Difficulty Concerns | Bernardo Leite et.al. | 2506.15598 | null |
2025-06-18 | LiteGD: Lightweight and dynamic GPU Dispatching for Large-scale Heterogeneous Clusters | Kunming Zhang et.al. | 2506.15595 | null |
2025-06-18 | One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution | Yujing Sun et.al. | 2506.15591 | **[link](https://github.com/yjsunnn/dloral)** |
2025-06-18 | Gender Inclusivity Fairness Index (GIFI): A Multilevel Framework for Evaluating Gender Diversity in Large Language Models | Zhengyang Shan et.al. | 2506.15568 | **[link](https://github.com/zhengyangshan/gifi)** |
2025-06-18 | Managing Complex Failure Analysis Workflows with LLM-based Reasoning and Acting Agents | Aline Dobrovsky et.al. | 2506.15567 | null |
2025-06-18 | PredGen: Accelerated Inference of Large Language Models through Input-Time Speculation for Real-Time Speech Interaction | Shufan Li et.al. | 2506.15556 | null |
2025-06-18 | Diff-TONE: Timestep Optimization for iNstrument Editing in Text-to-Music Diffusion Models | Teysir Baoueb et.al. | 2506.15530 | null |
2025-06-18 | Lessons from Training Grounded LLMs with Verifiable Rewards | Shang Hong Sim et.al. | 2506.15522 | null |
2025-06-18 | RePCS: Diagnosing Data Memorization in LLM-Powered Retrieval-Augmented Generation | Le Vu Anh et.al. | 2506.15513 | null |
2025-06-18 | Optimizing Web-Based AI Query Retrieval with GPT Integration in LangChain A CoT-Enhanced Prompt Engineering Approach | Wenqi Guan et.al. | 2506.15512 | null |
2025-06-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-17 | Large Language Models -- the Future of Fundamental Physics? | Caroline Heneka et.al. | 2506.14757 | null |
2025-06-17 | Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs | Ring Team et.al. | 2506.14731 | null |
2025-06-17 | AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes | Jiahao Qiu et.al. | 2506.14728 | null |
2025-06-17 | Adaptive Accompaniment with ReaLchords | Yusong Wu et.al. | 2506.14723 | null |
2025-06-17 | Unified Software Engineering agent as AI Software Engineer | Leonhard Applis et.al. | 2506.14683 | null |
2025-06-17 | AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models | Ads Dawson et.al. | 2506.14682 | null |
2025-06-17 | Massive Supervised Fine-tuning Experiments Reveal How Data, Layer, and Training Factors Shape LLM Alignment Quality | Yuto Harada et.al. | 2506.14681 | null |
2025-06-17 | Issue Retrieval and Verification Enhanced Supplementary Code Comment Generation | Yanzhen Zou et.al. | 2506.14649 | null |
2025-06-17 | Passing the Turing Test in Political Discourse: Fine-Tuning LLMs to Mimic Polarized Social Media Comments | . Pazzaglia et.al. | 2506.14645 | null |
2025-06-17 | Revisiting Chain-of-Thought Prompting: Zero-shot Can Be Stronger than Few-shot | Xiang Cheng et.al. | 2506.14641 | null |
2025-06-17 | AIn't Nothing But a Survey? Using Large Language Models for Coding German Open-Ended Survey Responses on Survey Motivation | Leah von der Heyde et.al. | 2506.14634 | null |
2025-06-17 | Probabilistic Aggregation and Targeted Embedding Optimization for Collective Moral Reasoning in Large Language Models | Chenchen Yuan et.al. | 2506.14625 | null |
2025-06-17 | Guaranteed Guess: A Language Modeling Approach for CISC-to-RISC Transpilation with Testing Guarantees | Ahmed Heakl et.al. | 2506.14606 | null |
2025-06-17 | Align Your Flow: Scaling Continuous-Time Flow Map Distillation | Amirmojtaba Sabour et.al. | 2506.14603 | null |
2025-06-17 | NetRoller: Interfacing General and Specialized Models for End-to-End Autonomous Driving | Ren Xin et.al. | 2506.14589 | null |
2025-06-17 | GenerationPrograms: Fine-grained Attribution with Executable Programs | David Wan et.al. | 2506.14580 | null |
2025-06-17 | AlphaDecay:Module-wise Weight Decay for Heavy-Tailed Balancing in LLMs | Di He et.al. | 2506.14562 | null |
2025-06-17 | Doppelgänger Method: Breaking Role Consistency in LLM Agent via Prompt-based Transferable Adversarial Attack | Daewon Kang et.al. | 2506.14539 | null |
2025-06-17 | Automatic Qiskit Code Refactoring Using Large Language Models | José Manuel Suárez et.al. | 2506.14535 | null |
2025-06-17 | M2BeamLLM: Multimodal Sensing-empowered mmWave Beam Prediction with Large Language Models | Can Zheng et.al. | 2506.14532 | null |
2025-06-17 | Prefix-Tuning+: Modernizing Prefix-Tuning by Decoupling the Prefix from Attention | Haonan Wang et.al. | 2506.13674 | null |
2025-06-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-16 | Diagnosing and Improving Diffusion Models by Estimating the Optimal Loss Value | Yixian Xu et.al. | 2506.13763 | null |
2025-06-16 | Discrete Diffusion in Large Language and Multimodal Models: A Survey | Runpeng Yu et.al. | 2506.13759 | null |
2025-06-16 | AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning | Zewei Zhou et.al. | 2506.13757 | null |
2025-06-16 | UltraZoom: Generating Gigapixel Images from Regular Photos | Jingwei Ma et.al. | 2506.13756 | null |
2025-06-16 | Steering LLM Thinking with Budget Guidance | Junyan Li et.al. | 2506.13752 | null |
2025-06-16 | Evaluating Large Language Models for Phishing Detection, Self-Consistency, Faithfulness, and Explainability | Shova Kuikel et.al. | 2506.13746 | null |
2025-06-16 | LTRR: Learning To Rank Retrievers for LLMs | To Eun Kim et.al. | 2506.13743 | null |
2025-06-16 | Instruction Following by Boosting Attention of Large Language Models | Vitoria Guardieiro et.al. | 2506.13734 | null |
2025-06-16 | Attribution-guided Pruning for Compression, Circuit Discovery, and Targeted Correction in LLMs | Sayed Mohammad Vakilzadeh Hatefi et.al. | 2506.13727 | null |
2025-06-16 | TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning | Junru Zhang et.al. | 2506.13705 | null |
2025-06-16 | Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry | Junyoung Seo et.al. | 2506.13697 | null |
2025-06-16 | OneRec Technical Report | Guorui Zhou et.al. | 2506.13695 | null |
2025-06-16 | Balancing Knowledge Delivery and Emotional Comfort in Healthcare Conversational Systems | Shang-Chi Tsai et.al. | 2506.13692 | null |
2025-06-16 | UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions | Zhucun Xue et.al. | 2506.13691 | null |
2025-06-16 | Enforcing tail calibration when training probabilistic forecast models | Jakob Benjamin Wessel et.al. | 2506.13687 | null |
2025-06-16 | An LLM's Apology: Outsourcing Awkwardness in the Age of AI | Twm Stone et.al. | 2506.13685 | null |
2025-06-16 | Turning Down the Heat: A Critical Analysis of Min-p Sampling in Language Models | Rylan Schaeffer et.al. | 2506.13681 | null |
2025-06-16 | MultiViT2: A Data-augmented Multimodal Neuroimaging Prediction Framework via Latent Diffusion Model | Bi Yuda et.al. | 2506.13667 | null |
2025-06-16 | We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems | Junfeng Fang et.al. | 2506.13666 | null |
2025-06-13
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-13 | code_transformed: The Influence of Large Language Models on Code | Yuliang Xu et.al. | 2506.12014 | null |
2025-06-13 | Tracing LLM Reasoning Processes with Strategic Games: A Framework for Planning, Revision, and Resource-Constrained Decision Making | Xiaopeng Yuan et.al. | 2506.12012 | null |
2025-06-13 | How Visual Representations Map to Language Feature Space in Multimodal LLMs | Constantin Venhoff et.al. | 2506.11976 | null |
2025-06-13 | A Robust Local Fréchet Regression Using Unbalanced Neural Optimal Transport with Applications to Dynamic Single-cell Genomics Data | Binghao Yan et.al. | 2506.11969 | null |
2025-06-13 | Improving Large Language Model Safety with Contrastive Representation Learning | Samuel Simko et.al. | 2506.11938 | null |
2025-06-13 | Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback | Dongwei Jiang et.al. | 2506.11930 | null |
2025-06-13 | LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming? | Zihan Zheng et.al. | 2506.11928 | null |
2025-06-13 | Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation | Min-Seop Kwak et.al. | 2506.11924 | null |
2025-06-13 | TreeRL: LLM Reinforcement Learning with On-Policy Tree Search | Zhenyu Hou et.al. | 2506.11902 | null |
2025-06-13 | Towards a Cascaded LLM Framework for Cost-effective Human-AI Decision-Making | Claudio Fanconi et.al. | 2506.11887 | null |
2025-06-13 | Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache | Xiaoran Liu et.al. | 2506.11886 | null |
2025-06-13 | Addressing Bias in LLMs: Strategies and Application to Fair AI-based Recruitment | Alejandro Peña et.al. | 2506.11880 | null |
2025-06-13 | A Short Survey on Formalising Software Requirements using Large Language Models | Arshad Beg et.al. | 2506.11874 | null |
2025-06-13 | LLM-based Dynamic Differential Testing for Database Connectors with Reinforcement Learning-Guided Prompt Selection | Ce Lyu et.al. | 2506.11870 | null |
2025-06-13 | Post Persona Alignment for Multi-Session Dialogue Generation | Yi-Pei Chen et.al. | 2506.11857 | null |
2025-06-13 | TrustGLM: Evaluating the Robustness of GraphLLMs Against Prompt, Text, and Structure Attacks | Qihai Zhang et.al. | 2506.11844 | null |
2025-06-13 | Your Ride, Your Rules: Psychology and Cognition Enabled Automated Driving Systems | Zhipeng Bao et.al. | 2506.11842 | null |
2025-06-13 | Revealing Political Bias in LLMs through Structured Multi-Agent Debate | Aishwarya Bandaru et.al. | 2506.11825 | null |
2025-06-13 | Rethinking Multilingual Vision-Language Translation: Dataset, Evaluation, and Adaptation | Xintong Wang et.al. | 2506.11820 | null |
2025-06-13 | On the Performance of LLMs for Real Estate Appraisal | Margot Geerts et.al. | 2506.11812 | null |
2025-06-13 | MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning | Yuxuan Luo et.al. | 2506.10963 | null |
2025-06-12
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-12 | SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis | Weiliang Chen et.al. | 2506.10981 | null |
2025-06-12 | GenWorld: Towards Detecting AI-generated Real-world Simulation Videos | Weiliang Chen et.al. | 2506.10975 | null |
2025-06-12 | AutoMind: Adaptive Knowledgeable Agent for Automated Data Science | Yixin Ou et.al. | 2506.10974 | null |
2025-06-12 | Farseer: A Refined Scaling Law in Large Language Models | Houyi Li et.al. | 2506.10972 | null |
2025-06-12 | GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation | Ning Gao et.al. | 2506.10966 | null |
2025-06-12 | ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark | Kangwei Liu et.al. | 2506.10960 | null |
2025-06-12 | SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks | Lianghong Guo et.al. | 2506.10954 | **[link](https://github.com/deepsoftwareanalytics/swe-factory)** |
2025-06-12 | Build the web for agents, not agents for the web | Xing Han Lù et.al. | 2506.10953 | null |
2025-06-12 | Monitoring Decomposition Attacks in LLMs with Lightweight Sequential Monitors | Chen Yueh-Han et.al. | 2506.10949 | **[link](https://github.com/yuehhanchen/monitoring-decomposition-attack)** |
2025-06-12 | Execution Guided Line-by-Line Code Generation | Boaz Lavon et.al. | 2506.10948 | null |
2025-06-12 | GUARD: Guided Unlearning and Retention via Data Attribution for Large Language Models | Evelyn Ma et.al. | 2506.10946 | null |
2025-06-12 | Self-Adapting Language Models | Adam Zweiger et.al. | 2506.10943 | null |
2025-06-12 | Dynamic Epistemic Friction in Dialogue | Timothy Obiso et.al. | 2506.10934 | null |
2025-06-12 | The Role of Generative AI in Facilitating Social Interactions: A Scoping Review | T. T. J. E. Arets et.al. | 2506.10927 | null |
2025-06-12 | Robustly Improving LLM Fairness in Realistic Settings via Interpretability | Adam Karvonen et.al. | 2506.10922 | null |
2025-06-12 | Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization | Or Shafran et.al. | 2506.10920 | null |
2025-06-12 | Magistral | Mistral-AI et.al. | 2506.10910 | null |
2025-06-12 | Beyond Gold Standards: Epistemic Ensemble of LLM Judges for Formal Mathematical Reasoning | Lan Zhang et.al. | 2506.10903 | null |
2025-06-12 | GenPlanX. Generation of Plans and Execution | Daniel Borrajo et.al. | 2506.10897 | null |
2025-06-11
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-11 | Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling | Tim Z. Xiao et.al. | 2506.09998 | null |
2025-06-11 | From Judgment to Interference: Early Stopping LLM Harmful Outputs via Streaming Content Monitoring | Yang Li et.al. | 2506.09996 | null |
2025-06-11 | Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation | Xinyu Yang et.al. | 2506.09991 | null |
2025-06-11 | Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMs | Hiroshi Matsuda et.al. | 2506.09983 | null |
2025-06-11 | When Detection Fails: The Power of Fine-Tuned Models to Generate Human-Like Social Media Text | Hillary Dawkins et.al. | 2506.09975 | null |
2025-06-11 | SRLAgent: Enhancing Self-Regulated Learning Skills through Gamification and LLM Assistance | Wentao Ge et.al. | 2506.09968 | null |
2025-06-11 | Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing | Junfei Wu et.al. | 2506.09965 | null |
2025-06-11 | LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection Challenge | Sahar Abdelnabi et.al. | 2506.09956 | null |
2025-06-11 | Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking | Wuwei Zhang et.al. | 2506.09944 | null |
2025-06-11 | VerIF: Verification Engineering for Reinforcement Learning in Instruction Following | Hao Peng et.al. | 2506.09942 | null |
2025-06-11 | PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants | Zheng Zhao et.al. | 2506.09902 | null |
2025-06-11 | The Emergence of Abstract Thought in Large Language Models Beyond Any Language | Yuxin Chen et.al. | 2506.09890 | null |
2025-06-11 | Attention Head Embeddings with Trainable Deep Kernels for Hallucination Detection in LLMs | Rodion Oblovatny et.al. | 2506.09886 | null |
2025-06-11 | Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning | Xiangning Yu et.al. | 2506.09853 | null |
2025-06-11 | Dataset of News Articles with Provenance Metadata for Media Relevance Assessment | Tomas Peterka et.al. | 2506.09847 | null |
2025-06-11 | A Deep Generative Model for the Simulation of Discrete Karst Networks | Dany Lauzon et.al. | 2506.09832 | null |
2025-06-11 | EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection | Christoph Schuhmann et.al. | 2506.09827 | null |
2025-06-11 | Metritocracy: Representative Metrics for Lite Benchmarks | Ariel Procaccia et.al. | 2506.09813 | null |
2025-06-11 | Do LLMs Give Psychometrically Plausible Responses in Educational Assessments? | Andreas Säuberli et.al. | 2506.09796 | null |
2025-06-11 | Where Journalism Silenced Voices: Exploring Discrimination in the Representation of Indigenous Communities in Bangladesh | Abhijit Paul et.al. | 2506.09771 | null |
2025-06-11 | Comparing human and LLM proofreading in L2 writing: Impact on lexical and syntactic features | Hakyung Sung et.al. | 2506.09021 | null |
2025-06-11 | Boosting Rust Unit Test Coverage through Hybrid Program Analysis and Large Language Models | Bei Chu et.al. | 2506.09002 | null |
2025-06-11 | Towards Better Code Generation: Adaptive Decoding with Uncertainty Guidance | Kaifeng He et.al. | 2506.08980 | null |
2025-06-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-10 | ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering | Yuki Imajuku et.al. | 2506.09050 | null |
2025-06-10 | VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning | Li Kang et.al. | 2506.09049 | null |
2025-06-10 | Understanding Task Vectors in In-Context Learning: Emergence, Functionality, and Limitations | Yuxin Dong et.al. | 2506.09048 | null |
2025-06-10 | Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation | Xiaowen Ma et.al. | 2506.09046 | null |
2025-06-10 | Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better | Dianyi Wang et.al. | 2506.09040 | null |
2025-06-10 | AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions | Polina Kirichenko et.al. | 2506.09038 | null |
2025-06-10 | FZOO: Fast Zeroth-Order Optimizer for Fine-Tuning Large Language Models towards Adam-Scale Speed | Sizhe Dang et.al. | 2506.09034 | null |
2025-06-10 | Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning | Haozhen Zhang et.al. | 2506.09033 | null |
2025-06-10 | Diffuse and Disperse: Image Generation with Representation Regularization | Runqian Wang et.al. | 2506.09027 | null |
2025-06-10 | e3: Learning to Explore Enables Extrapolation of Test-Time Compute for LLMs | Amrith Setlur et.al. | 2506.09026 | null |
2025-06-10 | Edit Flows: Flow Matching with Edit Operations | Marton Havasi et.al. | 2506.09018 | null |
2025-06-10 | Learning to Reason Across Parallel Samples for LLM Reasoning | Jianing Qi et.al. | 2506.09014 | null |
2025-06-10 | Branched Schrödinger Bridge Matching | Sophia Tang et.al. | 2506.09007 | null |
2025-06-10 | Do Concept Replacement Techniques Really Erase Unacceptable Concepts? | Anudeep Das et.al. | 2506.08991 | null |
2025-06-10 | SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning | Xiao Liang et.al. | 2506.08989 | null |
2025-06-10 | ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations | Amirreza Rouhi et.al. | 2506.08968 | null |
2025-06-10 | Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model | Ailin Huang et.al. | 2506.08967 | null |
2025-06-09
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-09 | Hidden in plain sight: VLMs overlook their visual representations | Stephanie Fu et.al. | 2506.08008 | null |
2025-06-09 | Dreamland: Controllable World Creation with Simulator and Generative Models | Sicheng Mo et.al. | 2506.08006 | null |
2025-06-09 | Aligning Text, Images, and 3D Structure Token-by-Token | Aadarsh Sahoo et.al. | 2506.08002 | null |
2025-06-09 | Reparameterized LLM Training via Orthogonal Equivalence Transformation | Zeju Qiu et.al. | 2506.08001 | null |
2025-06-09 | MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation | Junhao Chen et.al. | 2506.07999 | null |
2025-06-09 | Generative Modeling of Weights: Generalization or Memorization? | Boya Zeng et.al. | 2506.07998 | null |
2025-06-09 | Supporting Construction Worker Well-Being with a Multi-Agent Conversational AI System | Fan Yang et.al. | 2506.07997 | null |
2025-06-09 | HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization | Hongzheng Chen et.al. | 2506.07972 | null |
2025-06-09 | SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design | Wenxin Tang et.al. | 2506.07964 | null |
2025-06-09 | Reinforcing Multimodal Understanding and Generation with Dual Self-rewards | Jixiang Hong et.al. | 2506.07963 | null |
2025-06-09 | Correlated Errors in Large Language Models | Elliot Kim et.al. | 2506.07962 | null |
2025-06-09 | TokenBreak: Bypassing Text Classification Models Through Token Manipulation | Kasimir Schulz et.al. | 2506.07948 | null |
2025-06-09 | Statistical Hypothesis Testing for Auditing Robustness in Language Models | Paulius Rauba et.al. | 2506.07947 | null |
2025-06-09 | ProtocolLLM: RTL Benchmark for SystemVerilog Generation of Communication Protocols | Arnav Sheth et.al. | 2506.07945 | null |
2025-06-09 | Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations | Yizhen Li et.al. | 2506.07943 | null |
2025-06-09 | Adversarial Attack Classification and Robustness Testing for Large Language Models for Code | Yang Liu et.al. | 2506.07942 | null |
2025-06-09 | Squeeze3D: Your 3D Generation Model is Secretly an Extreme Neural Compressor | Rishit Dagli et.al. | 2506.07932 | null |
2025-06-09 | Solving Inequality Proofs with Large Language Models | Jiayi Sheng et.al. | 2506.07927 | null |
2025-06-09 | LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement | Dimitris Panagopoulos et.al. | 2506.07915 | null |
2025-06-09 | FunDiff: Diffusion Models over Function Spaces for Physics-Informed Generative Modeling | Sifan Wang et.al. | 2506.07902 | null |
2025-06-06
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-06 | Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias | Yuanzhe Hu et.al. | 2506.06280 | null |
2025-06-06 | CoMemo: LVLMs Need Image Context with Image Memory | Shi Liu et.al. | 2506.06279 | null |
2025-06-06 | Distillation Robustifies Unlearning | Bruce W. Lee et.al. | 2506.06278 | null |
2025-06-06 | STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis | Jiatao Gu et.al. | 2506.06276 | null |
2025-06-06 | AdvSumm: Adversarial Training for Bias Mitigation in Text Summarization | Mukur Gupta et.al. | 2506.06273 | null |
2025-06-06 | PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time | Weizhi Zhang et.al. | 2506.06254 | null |
2025-06-06 | Visual Graph Arena: Evaluating Visual Conceptualization of Vision and Multimodal Large Language Models | Zahra Babaiee et.al. | 2506.06242 | null |
2025-06-06 | Bridging External and Parametric Knowledge: Mitigating Hallucination of LLMs with Shared-Private Semantic Synergy in Dual-Stream Knowledge | Yi Sui et.al. | 2506.06240 | null |
2025-06-06 | CompilerGPT: Leveraging Large Language Models for Analyzing and Acting on Compiler Optimization Reports | Peter Pirkelbauer et.al. | 2506.06227 | null |
2025-06-06 | PROVSYN: Synthesizing Provenance Graphs for Data Augmentation in Intrusion Detection Systems | Yi Huang et.al. | 2506.06226 | null |
2025-06-06 | Can Theoretical Physics Research Benefit from Language Agents? | Sirui Lu et.al. | 2506.06214 | null |
2025-06-06 | Model-Driven Graph Contrastive Learning | Ali Azizpour et.al. | 2506.06212 | null |
2025-06-06 | Building Models of Neurological Language | Henry Watkins et.al. | 2506.06208 | null |
2025-06-06 | Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning | Sheng Chen et.al. | 2506.06205 | null |
2025-06-06 | Does It Run and Is That Enough? Revisiting Text-to-Chart Generation with a Multi-Agent Approach | James Ford et.al. | 2506.06175 | null |
2025-06-06 | Technical Report for Egocentric Mistake Detection for the HoloAssist Challenge | Constantin Patsch et.al. | 2506.06174 | null |
2025-06-06 | The Lock-in Hypothesis: Stagnation by Algorithm | Tianyi Alex Qiu et.al. | 2506.06166 | null |
2025-06-06 | Recommender systems, stigmergy, and the tyranny of popularity | Zackary Okun Dunivin et.al. | 2506.06162 | null |
2025-06-06 | ENMA: Tokenwise Autoregression for Generative Neural PDE Operators | Armand Kassaï Koupaï et.al. | 2506.06158 | null |
2025-06-06 | Masked Language Models are Good Heterogeneous Graph Generalizers | Jinyu Yang et.al. | 2506.06157 | null |
2025-06-05
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-05 | Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets | Lei Hsiung et.al. | 2506.05346 | null |
2025-06-05 | Inference-Time Hyper-Scaling with KV Cache Compression | Adrian Łańcucki et.al. | 2506.05345 | null |
2025-06-05 | SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs | Jiahui Wang et.al. | 2506.05344 | null |
2025-06-05 | ContentV: Efficient Training of Video Generation Models with Limited Compute | Wenfeng Lin et.al. | 2506.05343 | null |
2025-06-05 | Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning | Xingjian Ran et.al. | 2506.05341 | null |
2025-06-05 | VideoMolmo: Spatio-Temporal Grounding Meets Pointing | Ghazi Shazan Ahmad et.al. | 2506.05336 | null |
2025-06-05 | Search Arena: Analyzing Search-Augmented LLMs | Mihran Miroyan et.al. | 2506.05334 | null |
2025-06-05 | Unleashing Hour-Scale Video Training for Long Video-Language Understanding | Jingyang Lin et.al. | 2506.05332 | null |
2025-06-05 | MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning | Xinyan Chen et.al. | 2506.05331 | null |
2025-06-05 | LSM-2: Learning from Incomplete Wearable Sensor Data | Maxwell A. Xu et.al. | 2506.05321 | null |
2025-06-05 | Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay | Yifan Sun et.al. | 2506.05316 | null |
2025-06-05 | Constrained Entropic Unlearning: A Primal-Dual Framework for Large Language Models | Taha Entesari et.al. | 2506.05314 | null |
2025-06-05 | Learning normalized image densities via dual score matching | Florentin Guth et.al. | 2506.05310 | null |
2025-06-05 | Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games | Niv Eckhaus et.al. | 2506.05309 | **[link](https://github.com/niveck/LLMafia)** |
2025-06-05 | ProRefine: Inference-time Prompt Refinement with Textual Feedback | Deepak Pandita et.al. | 2506.05305 | null |
2025-06-05 | Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos | Weifeng Lin et.al. | 2506.05302 | null |
2025-06-05 | Sample Complexity and Representation Ability of Test-time Scaling Paradigms | Baihe Huang et.al. | 2506.05295 | null |
2025-06-05 | Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning | Nan Huo et.al. | 2506.05278 | null |
2025-06-05 | How to Unlock Time Series Editing? Diffusion-Driven Approach with Multi-Grained Control | Hao Yu et.al. | 2506.05276 | null |
2025-06-05 | From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos | Animesh Gupta et.al. | 2506.05274 | null |
2025-06-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-04 | Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector | Boyong He et.al. | 2506.04211 | **[link](https://github.com/heboyong/Diffusion-Domain-Teacher)** |
2025-06-04 | Language-Image Alignment with Fixed Text Encoders | Jingfeng Yang et.al. | 2506.04209 | null |
2025-06-04 | EPiC: Towards Lossless Speedup for Reasoning Training through Edge-Preserving CoT Condensation | Jinghan Jia et.al. | 2506.04205 | null |
2025-06-04 | Cascadia: A Cascade Serving System for Large Language Models | Youhe Jiang et.al. | 2506.04203 | null |
2025-06-04 | TracLLM: A Generic Framework for Attributing Long Context LLMs | Yanting Wang et.al. | 2506.04202 | null |
2025-06-04 | R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning | Qingfei Zhao et.al. | 2506.04185 | null |
2025-06-04 | SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models | Yuhao Wu et.al. | 2506.04180 | null |
2025-06-04 | SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling | Anhao Zhao et.al. | 2506.04179 | null |
2025-06-04 | Does Prompt Design Impact Quality of Data Imputation by LLMs? | Shreenidhi Srinivasan et.al. | 2506.04172 | null |
2025-06-04 | Physics-Constrained Flow Matching: Sampling Generative Models with Hard Constraints | Utkarsh Utkarsh et.al. | 2506.04171 | null |
2025-06-04 | Neural and Cognitive Impacts of AI: The Influence of Task Subjectivity on Human-LLM Collaboration | Matthew Russell et.al. | 2506.04167 | null |
2025-06-04 | N $^2$ : A Unified Python Package and Test Bench for Nearest Neighbor-Based Matrix Completion | Caleb Chin et.al. | 2506.04166 | null |
2025-06-04 | VISCA: Inferring Component Abstractions for Automated End-to-End Testing | Parsa Alian et.al. | 2506.04161 | null |
2025-06-04 | A Dataset for Addressing Patient's Information Needs related to Clinical Course of Hospitalization | Sarvesh Soni et.al. | 2506.04156 | null |
2025-06-04 | Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis | Kejian Zhu et.al. | 2506.04142 | null |
2025-06-04 | Are Lexicon-Based Tools Still the Gold Standard for Valence Analysis in Low-Resource Flemish? | Ratna Kandala et.al. | 2506.04139 | null |
2025-06-04 | TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems | Shaina Raza et.al. | 2506.04133 | null |
2025-06-04 | Guided Speculative Inference for Efficient Test-Time Alignment of LLMs | Jonathan Geuter et.al. | 2506.04118 | null |
2025-06-04 | AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment | Anastasiia Ivanova et.al. | 2506.04089 | null |
2025-06-04 | Multimodal Tabular Reasoning with Privileged Structured Information | Jun-Peng Jiang et.al. | 2506.04088 | null |
2025-06-04 | Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback | Xiaoying Zhang et.al. | 2506.03106 | null |
2025-06-03
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-03 | Entity-Augmented Neuroscience Knowledge Retrieval Using Ontology and Semantic Understanding Capability of LLM | Pralaypati Ta et.al. | 2506.03145 | null |
2025-06-03 | Not All Tokens Are Meant to Be Forgotten | Xiangyu Zhou et.al. | 2506.03142 | null |
2025-06-03 | SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation | Siqi Chen et.al. | 2506.03139 | null |
2025-06-03 | Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning | Yinjie Wang et.al. | 2506.03136 | null |
2025-06-03 | Native-Resolution Image Synthesis | Zidong Wang et.al. | 2506.03131 | null |
2025-06-03 | AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation | Lu Qiu et.al. | 2506.03126 | null |
2025-06-03 | AUTOCIRCUIT-RL: Reinforcement Learning-Driven LLM for Automated Circuit Topology Generation | Prashanth Vijayaraghavan et.al. | 2506.03122 | null |
2025-06-03 | Retrieval-Augmented Generation as Noisy In-Context Learning: A Unified Theory and Risk Bounds | Yang Guo et.al. | 2506.03100 | null |
2025-06-03 | TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models | Chetwin Low et.al. | 2506.03099 | null |
2025-06-03 | DPO Learning with LLMs-Judge Signal for Computer Use Agents | Man Luo et.al. | 2506.03095 | null |
2025-06-03 | Literary Evidence Retrieval via Long-Context Language Models | Katherine Thai et.al. | 2506.03090 | null |
2025-06-03 | SG2VID: Scene Graphs Enable Fine-Grained Control for Video Synthesis | Ssharvien Kumar Sivakumar et.al. | 2506.03082 | null |
2025-06-03 | ORV: 4D Occupancy-centric Robot Video Generation | Xiuyu Yang et.al. | 2506.03079 | null |
2025-06-03 | StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs | Qijun Luo et.al. | 2506.03077 | null |
2025-06-03 | EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models | Mingzhe Li et.al. | 2506.03067 | null |
2025-06-03 | MAEBE: Multi-Agent Emergent Behavior Framework | Sinem Erisken et.al. | 2506.03053 | null |
2025-06-03 | Facts Do Care About Your Language: Assessing Answer Quality of Multilingual LLMs | Yuval Kansal et.al. | 2506.03051 | null |
2025-06-03 | Sample complexity of Schrödinger potential estimation | Nikita Puchkin et.al. | 2506.03043 | null |
2025-06-03 | Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective | Jintian Shao et.al. | 2506.03038 | null |
2025-06-02
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-02 | Guiding Generative Storytelling with Knowledge Graphs | Zhijun Pan et.al. | 2505.24803 | null |
2025-05-30
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-30 | Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents | Yaxin Luo et.al. | 2505.24878 | null |
2025-05-30 | ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL | Yu Zhang et.al. | 2505.24875 | null |
2025-05-30 | MiniMax-Remover: Taming Bad Noise Helps Video Object Removal | Bojia Zi et.al. | 2505.24873 | null |
2025-05-30 | MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning | Yiqing Liang et.al. | 2505.24871 | null |
2025-05-30 | GenSpace: Benchmarking Spatially-Aware Image Generation | Zehan Wang et.al. | 2505.24870 | null |
2025-05-30 | SiLVR: A Simple Language-based Video Reasoning Framework | Ce Zhang et.al. | 2505.24869 | null |
2025-05-30 | TalkingHeadBench: A Multi-Modal Benchmark & Analysis of Talking-Head DeepFake Detection | Xinqi Xiong et.al. | 2505.24866 | null |
2025-05-30 | ViStoryBench: Comprehensive Benchmark Suite for Story Visualization | Cailin Zhuang et.al. | 2505.24862 | null |
2025-05-30 | MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs | Gabrielle Kaili-May Liu et.al. | 2505.24858 | null |
2025-05-30 | Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning | Shuyao Xu et.al. | 2505.24850 | null |
2025-05-30 | MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning | Jingyan Shen et.al. | 2505.24846 | null |
2025-05-30 | Cascading Adversarial Bias from Injection to Distillation in Language Models | Harsh Chaudhari et.al. | 2505.24842 | null |
2025-05-30 | Vision LLMs Are Bad at Hierarchical Visual Understanding, and LLMs Are the Bottleneck | Yuwen Tan et.al. | 2505.24840 | null |
2025-05-30 | VideoCAD: A Large-Scale Video Dataset for Learning UI Interactions and 3D Reasoning from CAD Software | Brandon Man et.al. | 2505.24838 | null |
2025-05-30 | Improving Reliability and Explainability of Medical Question Answering through Atomic Fact Checking in Retrieval-Augmented LLMs | Juraj Vladika et.al. | 2505.24830 | null |
2025-05-30 | LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text | Li yunhan et.al. | 2505.24826 | null |
2025-05-30 | PhySense: Principle-Based Physics Reasoning Benchmarking for Large Language Models | Yinggan Xu et.al. | 2505.24823 | null |
2025-05-30 | Inference Acceleration of Autoregressive Normalizing Flows by Selective Jacobi Decoding | Jiaru Zhang et.al. | 2505.24791 | null |
2025-05-30 | Drop Dropout on Single-Epoch Language Model Pretraining | Houjun Liu et.al. | 2505.24788 | null |
2025-05-30 | OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation | Size Wu et.al. | 2505.23661 | **[link](https://github.com/wusize/openuni)** |
2025-05-29
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-29 | From Chat Logs to Collective Insights: Aggregative Question Answering | Wentao Zhang et.al. | 2505.23765 | null |
2025-05-29 | DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning | Ziyin Zhang et.al. | 2505.23754 | **[link](https://github.com/jiahao004/deeptheorem)** |
2025-05-29 | ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks | Akashah Shabbir et.al. | 2505.23752 | **[link](https://github.com/mbzuai-oryx/thinkgeo)** |
2025-05-29 | MAGREF: Masked Guidance for Any-Reference Video Generation | Yufan Deng et.al. | 2505.23742 | **[link](https://github.com/magref-video/magref)** |
2025-05-29 | Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time | Mohamad Chehade et.al. | 2505.23729 | null |
2025-05-29 | MuLoCo: Muon is a practical inner optimizer for DiLoCo | Benjamin Thérien et.al. | 2505.23725 | null |
2025-05-29 | SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA | Minrui Luo et.al. | 2505.23724 | null |
2025-05-29 | ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering | Zexi Liu et.al. | 2505.23723 | **[link](https://github.com/zeroxleo/ml-agent)** |
2025-05-29 | Label-Guided In-Context Learning for Named Entity Recognition | Fan Bai et.al. | 2505.23722 | **[link](https://github.com/bflashcp3f/deer)** |
2025-05-29 | Don't Take the Premise for Granted: Evaluating the Premise Critique Ability of Large Language Models | Jinzhe Li et.al. | 2505.23715 | **[link](https://github.com/mlgroupjlu/premise_critique)** |
2025-05-29 | SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models | Zixiang Xu et.al. | 2505.23713 | **[link](https://github.com/xzx34/socialmaze)** |
2025-05-29 | Let's Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM's Math Capability | Ruida Wang et.al. | 2505.23703 | null |
2025-05-29 | Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation | Ziling Cheng et.al. | 2505.23701 | null |
2025-05-29 | Data-to-Dashboard: Multi-Agent LLM Framework for Insightful Visualization in Enterprise Analytics | Ran Zhang et.al. | 2505.23695 | **[link](https://github.com/77luvc/d2d_data2dashboard)** |
2025-05-29 | VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos | Tingyu Song et.al. | 2505.23693 | **[link](https://github.com/sighingsnow/vf-eval)** |
2025-05-29 | ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term Interactions | Beong-woo Kwak et.al. | 2505.23662 | null |
2025-05-29 | Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation | Hongxiang Zhang et.al. | 2505.23657 | null |
2025-05-29 | ARC: Argument Representation and Coverage Analysis for Zero-Shot Long Document Summarization with Instruction Following LLMs | Mohamed Elaraby et.al. | 2505.23654 | null |
2025-05-29 | How does Transformer Learn Implicit Reasoning? | Jiaran Ye et.al. | 2505.23653 | **[link](https://github.com/jiaran-ye/implicitreasoning)** |
2025-05-29 | Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems | Hoang Pham et.al. | 2505.22571 | null |
2025-05-28
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-28 | Zero-Shot Vision Encoder Grafting via LLM Surrogates | Kaiyu Yue et.al. | 2505.22664 | **[link](https://github.com/facebookresearch/zero)** |
2025-05-28 | AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models | Feng Luo et.al. | 2505.22662 | null |
2025-05-28 | GuessArena: Guess Who I Am? A Self-Adaptive Framework for Evaluating LLMs in Domain-Specific Knowledge and Reasoning | Qingchen Yu et.al. | 2505.22661 | null |
2025-05-28 | 3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model | Wenbo Hu et.al. | 2505.22657 | null |
2025-05-28 | Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents | Michael Kirchhof et.al. | 2505.22655 | null |
2025-05-28 | The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason | Ang Lv et.al. | 2505.22653 | null |
2025-05-28 | On Learning Verifiers for Chain-of-Thought Reasoning | Maria-Florina Balcan et.al. | 2505.22650 | null |
2025-05-28 | Characterizing Bias: Benchmarking Large Language Models in Simplified versus Traditional Chinese | Hanjia Lyu et.al. | 2505.22645 | null |
2025-05-28 | Learning Composable Chains-of-Thought | Fangcong Yin et.al. | 2505.22635 | null |
2025-05-28 | Stochastic Chameleons: Irrelevant Context Hallucinations Reveal Class-Based (Mis)Generalization in LLMs | Ziling Cheng et.al. | 2505.22630 | null |
2025-05-28 | Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding | Chengyue Wu et.al. | 2505.22618 | null |
2025-05-28 | The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models | Ganqu Cui et.al. | 2505.22617 | null |
2025-05-28 | Self-Error-Instruct: Generalizing from Errors for LLMs Mathematical Reasoning | Erxin Yu et.al. | 2505.22591 | null |
2025-05-28 | Precise In-Parameter Concept Erasure in Large Language Models | Yoav Gur-Arieh et.al. | 2505.22586 | null |
2025-05-28 | DocReRank: Single-Page Hard Negative Query Generation for Training Multi-Modal RAG Rerankers | Navve Wasserman et.al. | 2505.22584 | null |
2025-05-28 | Less, but Better: Efficient Multilingual Expansion for LLMs via Layer-wise Mixture-of-Experts | Xue Zhang et.al. | 2505.22582 | null |
2025-05-28 | Fusion Steering: Prompt-Specific Activation Control | Waldemar Chang et.al. | 2505.22572 | null |
2025-05-28 | Universal Visuo-Tactile Video Understanding for Embodied Interaction | Yifan Xie et.al. | 2505.22566 | null |
2025-05-28 | Do Large Language Models Think Like the Brain? Sentence-Level Evidence from fMRI and Hierarchical Embeddings | Yu Lei et.al. | 2505.22563 | null |
2025-05-28 | Diagnosing and Resolving Cloud Platform Instability with Multi-modal RAG LLMs | Yifan Wang et.al. | 2505.21419 | null |
2025-05-27
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-27 | How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective | Shimao Zhang et.al. | 2505.21505 | null |
2025-05-27 | Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making | Yihan Wang et.al. | 2505.21503 | null |
2025-05-27 | Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers | Wei Pang et.al. | 2505.21497 | null |
2025-05-27 | Reinforcing General Reasoning without Verifiers | Xiangxin Zhou et.al. | 2505.21493 | null |
2025-05-27 | Hardware-Efficient Attention for Fast Decoding | Ted Zadouri et.al. | 2505.21487 | null |
2025-05-27 | Robust Hypothesis Generation: LLM-Automated Language Bias for Inductive Logic Programming | Yang Yang et.al. | 2505.21486 | null |
2025-05-27 | Are Language Models Consequentialist or Deontological Moral Reasoners? | Keenan Samway et.al. | 2505.21479 | null |
2025-05-27 | Policy Optimized Text-to-Image Pipeline Design | Uri Gadot et.al. | 2505.21478 | null |
2025-05-27 | Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration | Zijun Liu et.al. | 2505.21471 | **[link](https://github.com/thunlp-mt/extagents)** |
2025-05-27 | PropMolFlow: Property-guided Molecule Generation with Geometry-Complete Flow Matching | Cheng Zeng et.al. | 2505.21469 | null |
2025-05-27 | Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance | Shintaro Ozaki et.al. | 2505.21458 | null |
2025-05-27 | Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling | Xiangxin Zhou et.al. | 2505.21452 | null |
2025-05-27 | Words Like Knives: Backstory-Personalized Modeling and Detection of Violent Communication | Jocelyn Shen et.al. | 2505.21451 | null |
2025-05-27 | OmniSync: Towards Universal Lip Synchronization via Diffusion Transformers | Ziqiao Peng et.al. | 2505.21448 | null |
2025-05-27 | Can Large Reasoning Models Self-Train? | Sheikh Shafayat et.al. | 2505.21444 | null |
2025-05-27 | Hume: Introducing System-2 Thinking in Visual-Language-Action Model | Haoming Song et.al. | 2505.21432 | null |
2025-05-27 | Policy Induction: Predicting Startup Success via Explainable Memory-Augmented In-Context Learning | Xianling Mu et.al. | 2505.21427 | null |
2025-05-27 | GUARD:Dual-Agent based Backdoor Defense on Chain-of-Thought in Neural Code Generation | Naizhu Jin et.al. | 2505.21425 | null |
2025-05-27 | Autonomous Multi-Modal LLM Agents for Treatment Planning in Focused Ultrasound Ablation Surgery | Lina Zhao et.al. | 2505.21418 | null |
2025-05-27 | TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent | Dominik Meier et.al. | 2505.20118 | null |
2025-05-27 | Adaptive Deep Reasoning: Triggering Deep Thinking When Needed | Yunhao Wang et.al. | 2505.20101 | null |
2025-05-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-26 | Pangu Light: Weight Re-Initialization for Pruning and Accelerating LLMs | Hanting Chen et.al. | 2505.20155 | null |
2025-05-26 | UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models | Xueyan Zhang et.al. | 2505.20154 | null |
2025-05-26 | FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities | Jin Wang et.al. | 2505.20147 | null |
2025-05-26 | StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs | Jialin Yang et.al. | 2505.20139 | null |
2025-05-26 | Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers | Zhengliang Shi et.al. | 2505.20128 | null |
2025-05-26 | Agentic AI Process Observability: Discovering Behavioral Variability | Fabiana Fournier et.al. | 2505.20127 | null |
2025-05-26 | Understanding Generalization in Diffusion Models via Probability Flow Distance | Huijie Zhang et.al. | 2505.20123 | null |
2025-05-26 | Named Entity Recognition in Historical Italian: The Case of Giacomo Leopardi's Zibaldone | Cristian Santini et.al. | 2505.20113 | null |
2025-05-26 | ResSVD: Residual Compensated SVD for Large Language Model Compression | Haolei Bai et.al. | 2505.20112 | null |
2025-05-26 | Proxy-Free GFlowNet | Ruishuo Chen et.al. | 2505.20110 | null |
2025-05-26 | Language-Agnostic Suicidal Risk Detection Using Large Language Models | June-Woo Kim et.al. | 2505.20109 | null |
2025-05-26 | From Data to Modeling: Fully Open-vocabulary Scene Graph Generation | Zuyao Chen et.al. | 2505.20106 | null |
2025-05-26 | AdaTP: Attention-Debiased Token Pruning for Video Large Language Models | Fengyuan Sun et.al. | 2505.20100 | null |
2025-05-26 | Large Language Models Meet Knowledge Graphs for Question Answering: Synthesis and Opportunities | Chuangtao Ma et.al. | 2505.20099 | null |
2025-05-26 | S2LPP: Small-to-Large Prompt Prediction across LLMs | Liang Cheng et.al. | 2505.20097 | null |
2025-05-26 | Multi-Domain Explainability of Preferences | Nitay Calderon et.al. | 2505.20088 | null |
2025-05-26 | Safety Through Reasoning: An Empirical Study of Reasoning Guardrail Models | Makesh Narsimhan Sreedhar et.al. | 2505.20087 | null |
2025-05-26 | Incentivizing Reasoning from Weak Supervision | Yige Yuan et.al. | 2505.20072 | null |
2025-05-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | The Staircase of Ethics: Probing LLM Value Priorities through Multi-Step Induction to Complex Moral Dilemmas | Ya Wu et.al. | 2505.18154 | null |
2025-05-23 | Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs | Wafa Alghallabi et.al. | 2505.18152 | null |
2025-05-23 | Generative Distribution Embeddings | Nic Fishman et.al. | 2505.18150 | null |
2025-05-23 | Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find | Owen Bianchi et.al. | 2505.18148 | null |
2025-05-23 | Gaming Tool Preferences in Agentic LLMs | Kazem Faghih et.al. | 2505.18135 | null |
2025-05-23 | Frankentext: Stitching random text fragments into long-form narratives | Chau Minh Pham et.al. | 2505.18128 | null |
2025-05-23 | UNJOIN: Enhancing Multi-Table Text-to-SQL Generation via Schema Simplification | Poojah Ganesan et.al. | 2505.18122 | null |
2025-05-23 | ProgRM: Build Better GUI Agents with Progress Rewards | Danyang Zhang et.al. | 2505.18121 | null |
2025-05-23 | Bidirectional Knowledge Distillation for Enhancing Sequential Recommendation with Large Language Models | Jiongran Wu et.al. | 2505.18120 | null |
2025-05-23 | Bridging Supervised Learning and Reinforcement Learning in Math Reasoning | Huayu Chen et.al. | 2505.18116 | null |
2025-05-23 | Instructify: Demystifying Metadata to Visual Instruction Tuning Data Conversion | Jacob Hansen et.al. | 2505.18115 | null |
2025-05-23 | Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM | Zinuo Li et.al. | 2505.18110 | null |
2025-05-23 | ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework | Lisheng Huang et.al. | 2505.18105 | null |
2025-05-23 | How Can I Publish My LLM Benchmark Without Giving the True Answers Away? | Takashi Ishida et.al. | 2505.18102 | null |
2025-05-23 | Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL | Joey Hong et.al. | 2505.18098 | null |
2025-05-23 | DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations | Ziqiao Peng et.al. | 2505.18096 | null |
2025-05-23 | QwenLong-CPRS: Towards $\infty$ -LLMs with Dynamic Context Optimization | Weizhou Shen et.al. | 2505.18092 | null |
2025-05-23 | Data Mixing Can Induce Phase Transitions in Knowledge Acquisition | Xinran Gu et.al. | 2505.18091 | null |
2025-05-23 | Stable Reinforcement Learning for Efficient Reasoning | Muzhi Dai et.al. | 2505.18086 | null |
2025-05-23 | Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding | Xiaoyi Zhang et.al. | 2505.18079 | null |
2025-05-23 | VeriFastScore: Speeding up long-form factuality evaluation | Rishanth Rajendhran et.al. | 2505.16973 | **[link](https://github.com/rishanthrajendhran/verifastscore)** |
2025-05-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-22 | GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning | Chengqi Duan et.al. | 2505.17022 | **[link](https://github.com/gogoduan/got-r1)** |
2025-05-22 | CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms | Shilin Yan et.al. | 2505.17020 | **[link](https://github.com/shilinyan99/crosslmm)** |
2025-05-22 | Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO | Chengzhuo Tong et.al. | 2505.17017 | **[link](https://github.com/ziyuguo99/image-generation-cot)** |
2025-05-22 | R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning | Huatong Song et.al. | 2505.17005 | **[link](https://github.com/rucaibox/r1-searcher-plus)** |
2025-05-22 | Do Large Language Models Excel in Complex Logical Reasoning with Formal Language? | Jin Jiang et.al. | 2505.16998 | **[link](https://github.com/jiangjin1999/formaleval)** |
2025-05-22 | X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs | Rui Ye et.al. | 2505.16997 | null |
2025-05-22 | DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization | Chao Zhang et.al. | 2505.16995 | null |
2025-05-22 | $\text{R}^2\text{ec}$ : Towards Large Recommender Models with Reasoning | Runyang You et.al. | 2505.16994 | **[link](https://github.com/yryangang/rrec)** |
2025-05-22 | MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems | Rui Ye et.al. | 2505.16988 | **[link](https://github.com/masworks/maslab)** |
2025-05-22 | T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning | Amartya Chakraborty et.al. | 2505.16986 | null |
2025-05-22 | UFT: Unifying Supervised and Reinforcement Fine-Tuning | Mingyang Liu et.al. | 2505.16984 | **[link](https://github.com/liumy2010/uft)** |
2025-05-22 | LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding | Junlong Tong et.al. | 2505.16983 | **[link](https://github.com/eit-nlp/streamingllm)** |
2025-05-22 | Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine | Adib Bazgir et.al. | 2505.16982 | null |
2025-05-22 | Know the Ropes: A Heuristic Strategy for LLM-based Multi-Agent System Design | Zhenkun Li et.al. | 2505.16979 | null |
2025-05-22 | HyGenar: An LLM-Driven Hybrid Genetic Algorithm for Few-Shot Grammar Generation | Weizhi Tang et.al. | 2505.16978 | **[link](https://github.com/rutatang/hygenar)** |
2025-05-22 | Creatively Upscaling Images with Global-Regional Priors | Yurui Qian et.al. | 2505.16976 | null |
2025-05-22 | SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development | Yaxin Du et.al. | 2505.16975 | **[link](https://github.com/justlittlewhite/swe-dev)** |
2025-05-22 | CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark | Ahmed Heakl et.al. | 2505.16968 | **[link](https://github.com/gustavostahl/cass)** |
2025-05-22 | Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval | Nandan Thakur et.al. | 2505.16967 | null |
2025-05-22 | HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving | Zhiwen Chen et.al. | 2505.15793 | null |
2025-05-22 | Multi-modal Integration Analysis of Alzheimer's Disease Using Large Language Models and Knowledge Graphs | Kanan Kiguchi et.al. | 2505.15747 | null |
2025-05-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-21 | Learning to Reason via Mixture-of-Thought for Logical Reasoning | Tong Zheng et.al. | 2505.15817 | **[link](https://github.com/zhengkid/Truth_Table_Logical_Reasoning)** |
2025-05-21 | On the creation of narrow AI: hierarchy and nonlocality of neural network skills | Eric J. Michaud et.al. | 2505.15811 | null |
2025-05-21 | Neural Conditional Transport Maps | Carlos Rodriguez-Pardo et.al. | 2505.15808 | null |
2025-05-21 | Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering | Hwan Chang et.al. | 2505.15805 | null |
2025-05-21 | STAR-R1: Spacial TrAnsformation Reasoning by Reinforcing Multimodal LLMs | Zongzhao Li et.al. | 2505.15804 | null |
2025-05-21 | Interspatial Attention for Efficient 4D Human Video Generation | Ruizhi Shao et.al. | 2505.15800 | null |
2025-05-21 | Reverse Engineering Human Preferences with Reinforcement Learning | Lisa Alazraki et.al. | 2505.15795 | null |
2025-05-21 | Long-Form Information Alignment Evaluation Beyond Atomic Facts | Danna Zheng et.al. | 2505.15792 | null |
2025-05-21 | Large Language Models as Computable Approximations to Solomonoff Induction | Jun Wan et.al. | 2505.15784 | null |
2025-05-21 | IA-T2I: Internet-Augmented Text-to-Image Generation | Chuanhao Li et.al. | 2505.15779 | null |
2025-05-21 | Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space | Zhen Zhang et.al. | 2505.15778 | **[link](https://github.com/eric-ai-lab/soft-thinking)** |
2025-05-21 | Beyond Hard and Soft: Hybrid Context Compression for Balancing Local and Global Information Retention | Huanxuan Liao et.al. | 2505.15774 | null |
2025-05-21 | Constructing a 3D Town from a Single Image | Kaizhi Zheng et.al. | 2505.15765 | null |
2025-05-21 | Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval | Taiye Chen et.al. | 2505.15753 | null |
2025-05-21 | Evolutionary Computation and Large Language Models: A Survey of Methods, Synergies, and Applications | Dikshit Chauhan et.al. | 2505.15741 | null |
2025-05-21 | HybridProver: Augmenting Theorem Proving with LLM-Driven Proof Synthesis and Refinement | Jilin Hu et.al. | 2505.15740 | null |
2025-05-21 | Alignment Under Pressure: The Case for Informed Adversaries When Evaluating LLM Defenses | Xiaoxue Yang et.al. | 2505.15738 | null |
2025-05-21 | DEBATE, TRAIN, EVOLVE: Self Evolution of Language Model Reasoning | Gaurav Srivastava et.al. | 2505.15734 | null |
2025-05-20
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-20 | Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning | Haolei Xu et.al. | 2505.14684 | null |
2025-05-20 | NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search | Sunhao Dai et.al. | 2505.14680 | null |
2025-05-20 | UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models | Xiaojie Gu et.al. | 2505.14679 | null |
2025-05-20 | Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning | Jiaer Xia et.al. | 2505.14677 | null |
2025-05-20 | Training-Free Watermarking for Autoregressive Image Generation | Yu Tong et.al. | 2505.14673 | null |
2025-05-20 | Quartet: Native FP4 Training Can Be Optimal for Large Language Models | Roberto L. Castro et.al. | 2505.14669 | null |
2025-05-20 | ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions | Bufang Yang et.al. | 2505.14668 | null |
2025-05-20 | SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment | Wonje Jeung et.al. | 2505.14667 | null |
2025-05-20 | Abacus: A Cost-Based Optimizer for Semantic Operator Systems | Matthew Russo et.al. | 2505.14661 | null |
2025-05-20 | Cost-Augmented Monte Carlo Tree Search for LLM-Assisted Planning | Zihao Zhang et.al. | 2505.14656 | null |
2025-05-20 | Beyond Words: Multimodal LLM Knows When to Speak | Zikai Liao et.al. | 2505.14654 | null |
2025-05-20 | General-Reasoner: Advancing LLM Reasoning Across All Domains | Xueguang Ma et.al. | 2505.14652 | null |
2025-05-20 | Think Only When You Need with Large Hybrid-Reasoning Models | Lingjie Jiang et.al. | 2505.14631 | null |
2025-05-20 | KERL: Knowledge-Enhanced Personalized Recipe Recommendation using Large Language Models | Fnu Mohbat et.al. | 2505.14629 | **[link](https://github.com/mohbattharani/kerl)** |
2025-05-20 | Debating for Better Reasoning: An Unsupervised Multimodal Approach | Ashutosh Adhikari et.al. | 2505.14627 | null |
2025-05-20 | TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning | Zhangchen Xu et.al. | 2505.14625 | null |
2025-05-20 | Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs | Morgan Lindsay Heisler et.al. | 2505.14620 | null |
2025-05-20 | Linear Control of Test Awareness Reveals Differential Compliance in Reasoning Models | Sahar Abdelnabi et.al. | 2505.14617 | null |
2025-05-20 | SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas | Anjiang Wei et.al. | 2505.14615 | null |
2025-05-20 | sudoLLM : On Multi-role Alignment of Language Models | Soumadeep Saha et.al. | 2505.14607 | null |
2025-05-20 | Sense and Sensitivity: Examining the Influence of Semantic Recall on Long Context Code Reasoning | Adam Štorek et.al. | 2505.13353 | null |
2025-05-19
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-19 | Mean Flows for One-step Generative Modeling | Zhengyang Geng et.al. | 2505.13447 | null |
2025-05-19 | Unlocking Non-Invasive Brain-to-Text | Dulhan Jayalath et.al. | 2505.13446 | null |
2025-05-19 | Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards | Xiaoyuan Liu et.al. | 2505.13445 | null |
2025-05-19 | Optimizing Anytime Reasoning via Budget Relative Policy Optimization | Penghui Qi et.al. | 2505.13438 | **[link](https://github.com/sail-sg/anytimereasoner)** |
2025-05-19 | Synthetic-Powered Predictive Inference | Meshi Bashari et.al. | 2505.13432 | null |
2025-05-19 | Fine-tuning Quantized Neural Networks with Zeroth-order Optimization | Sifeng Shang et.al. | 2505.13430 | null |
2025-05-19 | Learnware of Language Models: Specialized Small Language Models Can Do Big | Zhi-Hao Tan et.al. | 2505.13425 | null |
2025-05-19 | Make Still Further Progress: Chain of Thoughts for Tabular Data Leaderboard | Si-Yang Liu et.al. | 2505.13421 | null |
2025-05-19 | Dementia Through Different Eyes: Explainable Modeling of Human and LLM Perceptions for Early Awareness | Lotem Peled-Cohen et.al. | 2505.13418 | null |
2025-05-19 | Gluon: Making Muon & Scion Great Again! (Bridging Theory and Practice of LMO-based Optimizers for LLMs) | Artem Riabinin et.al. | 2505.13416 | null |
2025-05-19 | AutoMathKG: The automated mathematical knowledge graph based on LLM and vector database | Rong Bian et.al. | 2505.13406 | null |
2025-05-19 | MR. Judge: Multimodal Reasoner as a Judge | Renjie Pi et.al. | 2505.13403 | null |
2025-05-19 | Thinkless: LLM Learns When to Think | Gongfan Fang et.al. | 2505.13379 | **[link](https://github.com/vainf/thinkless)** |
2025-05-19 | Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation | Yasi Zhang et.al. | 2505.13377 | null |
2025-05-19 | Seeing, Saying, Solving: An LLM-to-TL Framework for Cooperative Robots | Dan BW Choe et.al. | 2505.13376 | null |
2025-05-19 | Minimum-Excess-Work Guidance | Christopher Kolloff et.al. | 2505.13375 | null |
2025-05-19 | What Prompts Don't Say: Understanding and Managing Underspecification in LLM Prompts | Chenyang Yang et.al. | 2505.13360 | null |
2025-05-19 | One-Step Offline Distillation of Diffusion-based Models via Koopman Modeling | Nimrod Berman et.al. | 2505.13358 | **[link](https://github.com/azencot-group/kdm)** |
2025-05-19 | Multi-Armed Bandits Meet Large Language Models | Djallel Bouneffouf et.al. | 2505.13355 | null |
2025-05-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-16 | QVGen: Pushing the Limit of Quantized Video Generative Models | Yushi Huang et.al. | 2505.11497 | null |
2025-05-16 | SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning | Yige Xu et.al. | 2505.11484 | null |
2025-05-16 | Improving Assembly Code Performance with Large Language Models via Reinforcement Learning | Anjiang Wei et.al. | 2505.11480 | null |
2025-05-16 | HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages | Zhilin Wang et.al. | 2505.11475 | null |
2025-05-16 | Disentangling Reasoning and Knowledge in Medical Large Language Models | Rahul Thapa et.al. | 2505.11462 | null |
2025-05-16 | ProxyPrompt: Securing System Prompts against Prompt Extraction Attacks | Zhixiong Zhuang et.al. | 2505.11459 | null |
2025-05-16 | LLMs unlock new paths to monetizing exploits | Nicholas Carlini et.al. | 2505.11449 | null |
2025-05-16 | Is Compression Really Linear with Code Intelligence? | Xianzhen Luo et.al. | 2505.11441 | null |
2025-05-16 | MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production | Chao Jin et.al. | 2505.11432 | null |
2025-05-16 | When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs | Xiaomin Li et.al. | 2505.11423 | null |
2025-05-16 | EdgeWisePersona: A Dataset for On-Device User Profiling from Natural Language Interactions | Patryk Bartkowiak et.al. | 2505.11417 | null |
2025-05-16 | MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems | Yinsicheng Jiang et.al. | 2505.11415 | null |
2025-05-16 | CARES: Comprehensive Evaluation of Safety and Adversarial Robustness in Medical LLMs | Sijia Chen et.al. | 2505.11413 | null |
2025-05-16 | Visual Planning: Let's Think Only with Images | Yi Xu et.al. | 2505.11409 | null |
2025-05-16 | EmotionHallucer: Evaluating Emotion Hallucinations in Multimodal Large Language Models | Bohao Xing et.al. | 2505.11405 | null |
2025-05-16 | Can AI automatically analyze public opinion? A LLM agents-based agentic pipeline for timely public opinion analysis | Jing Liu et.al. | 2505.11401 | null |
2025-05-16 | GuideBench: Benchmarking Domain-Oriented Guideline Following for LLM Agents | Lingxiao Diao et.al. | 2505.11368 | null |
2025-05-16 | Phare: A Safety Probe for Large Language Models | Pierre Le Jeune et.al. | 2505.11365 | null |
2025-05-16 | LegoSLM: Connecting LLM with Speech Encoder using CTC Posteriors | Rao Ma et.al. | 2505.11352 | null |
2025-05-16 | Benchmarking Critical Questions Generation: A Challenging Reasoning Task for Large Language Models | Banca Calvo Figueras et.al. | 2505.11341 | null |
2025-05-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-15 | T2A-Feedback: Improving Basic Capabilities of Text-to-Audio Generation via Fine-grained AI Feedback | Zehan Wang et.al. | 2505.10561 | null |
2025-05-15 | Neural Thermodynamic Laws for Large Language Model Training | Ziming Liu et.al. | 2505.10559 | null |
2025-05-15 | Flowing Through Hilbert Space: Quantum-Enhanced Generative Models for Lattice Field Theory | Jehu Martinez et.al. | 2505.10553 | null |
2025-05-15 | CheXGenBench: A Unified Benchmark For Fidelity, Privacy and Utility of Synthetic Chest Radiographs | Raman Dutt et.al. | 2505.10496 | **[link](https://github.com/Raman1121/CheXGenBench)** |
2025-05-15 | RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs | Vibha Belavadi et.al. | 2505.10495 | null |
2025-05-15 | Can You Really Trust Code Copilots? Evaluating Large Language Models from a Code Security Perspective | Yutao Mou et.al. | 2505.10494 | **[link](https://github.com/murraytom/cov-eval)** |
2025-05-15 | CL-RAG: Bridging the Gap in Retrieval-Augmented Generation with Curriculum Learning | Shaohan Wang et.al. | 2505.10493 | null |
2025-05-15 | Campus AI vs Commercial AI: A Late-Breaking Study on How LLM As-A-Service Customizations Shape Trust and Usage Patterns | Leon Hannig et.al. | 2505.10490 | null |
2025-05-15 | UniEval: Unified Holistic Evaluation for Unified Multimodal Understanding and Generation | Yi Li et.al. | 2505.10483 | null |
2025-05-15 | Large Language Models for Cancer Communication: Evaluating Linguistic Quality, Safety, and Accessibility in Generative AI | Agnik Saha et.al. | 2505.10472 | null |
2025-05-15 | AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge | Ranjan Sapkota et.al. | 2505.10468 | null |
2025-05-15 | Superposition Yields Robust Neural Scaling | Yizhou liu et.al. | 2505.10465 | **[link](https://github.com/liuyz0/superpositionscaling)** |
2025-05-15 | Are Large Language Models Robust in Understanding Code Against Semantics-Preserving Mutations? | Pedro Orvalho et.al. | 2505.10443 | null |
2025-05-15 | Learning to Think: Information-Theoretic Reinforcement Fine-Tuning for LLMs | Jingyao Wang et.al. | 2505.10425 | null |
2025-05-15 | Are LLM-generated plain language summaries truly understandable? A large-scale crowdsourced evaluation | Yue Guo et.al. | 2505.10409 | null |
2025-05-15 | Two-Stage Generative Model for Intracranial Aneurysm Meshes with Morphological Marker Conditioning | Wenhao Ding et.al. | 2505.10407 | **[link](https://github.com/anonymousaneug/aneug)** |
2025-05-15 | Visual Fidelity Index for Generative Semantic Communications with Critical Information Embedding | Jianhao Huang et.al. | 2505.10405 | null |
2025-05-15 | Rethinking Repetition Problems of LLMs in Code Generation | Yihong Dong et.al. | 2505.10402 | **[link](https://github.com/lyc127/rpg)** |
2025-05-15 | Multi-domain Multilingual Sentiment Analysis in Industry: Predicting Aspect-based Opinion Quadruples | Benjamin White et.al. | 2505.10389 | null |
2025-05-15 | Are Sparse Autoencoders Useful for Java Function Bug Detection? | Rui Melo et.al. | 2505.10375 | null |
2025-05-15 | Beyond Likes: How Normative Feedback Complements Engagement Signals on Social Media | Yuchen Wu et.al. | 2505.09583 | null |
2025-05-15 | SafePath: Conformal Prediction for Safe LLM-Based Autonomous Navigation | Achref Doula et.al. | 2505.09427 | null |
2025-05-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-14 | Customizing a Large Language Model for VHDL Design of High-Performance Microprocessors | Nicolas Dupuis et.al. | 2505.09610 | null |
2025-05-14 | Adversarial Suffix Filtering: a Defense Pipeline for LLMs | David Khachaturov et.al. | 2505.09602 | null |
2025-05-14 | How Hungry is AI? Benchmarking Energy, Water, and Carbon Footprint of LLM Inference | Nidhal Jegham et.al. | 2505.09598 | null |
2025-05-14 | WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models | Abdullah Mushtaq et.al. | 2505.09595 | null |
2025-05-14 | Ethics and Persuasion in Reinforcement Learning from Human Feedback: A Procedural Rhetorical Approach | Shannon Lodoen et.al. | 2505.09576 | null |
2025-05-14 | MIGRATION-BENCH: Repository-Level Code Migration Benchmark from Java 8 | Linbo Liu et.al. | 2505.09569 | null |
2025-05-14 | Layered Unlearning for Adversarial Relearning | Timothy Qian et.al. | 2505.09500 | **[link](https://github.com/12tqian/layered-unlearning)** |
2025-05-14 | Card Sorting Simulator: Augmenting Design of Logical Information Architectures with Large Language Models | Eduard Kuric et.al. | 2505.09478 | null |
2025-05-14 | Deploying Foundation Model-Enabled Air and Ground Robots in the Field: Challenges and Opportunities | Zachary Ravichandran et.al. | 2505.09477 | null |
2025-05-14 | Omni-R1: Do You Really Need Audio to Fine-Tune Your Audio LLM? | Andrew Rouditchenko et.al. | 2505.09439 | null |
2025-05-14 | Evaluating GPT- and Reasoning-based Large Language Models on Physics Olympiad Problems: Surpassing Human Performance and Implications for Educational Assessment | Paul Tschisgale et.al. | 2505.09438 | null |
2025-05-14 | CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios | Raghav Garg et.al. | 2505.09436 | **[link](https://github.com/kapilsprinklr/cxmarena)** |
2025-05-14 | The Influence of Human-inspired Agentic Sophistication in LLM-driven Strategic Reasoners | Vince Trencsenyi et.al. | 2505.09396 | null |
2025-05-14 | Quantum-Enhanced Parameter-Efficient Learning for Typhoon Trajectory Forecasting | Chen-Yu Liu et.al. | 2505.09395 | null |
2025-05-14 | Qwen3 Technical Report | An Yang et.al. | 2505.09388 | null |
2025-05-14 | Efficient Modelling of Lyman-α opacity fluctuations during late EoR | Barun Maity et.al. | 2505.09369 | null |
2025-05-14 | Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis | Bingxin Ke et.al. | 2505.09358 | null |
2025-05-14 | Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures | Chenggang Zhao et.al. | 2505.09343 | null |
2025-05-14 | Granite-speech: open-source speech-aware LLMs with strong English ASR capabilities | George Saon et.al. | 2505.08699 | null |
2025-05-13
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-13 | PCS-UQ: Uncertainty Quantification via the Predictability-Computability-Stability Framework | Abhineet Agarwal et.al. | 2505.08784 | null |
2025-05-13 | CodePDE: An Inference Framework for LLM-driven PDE Solver Generation | Shanda Li et.al. | 2505.08783 | null |
2025-05-13 | Generative Molecular Design with Steerable and Granular Synthesizability Control | Jeff Guo et.al. | 2505.08774 | null |
2025-05-13 | SPAT: Sensitivity-based Multihead-attention Pruning on Time Series Forecasting Models | Suhan Guo et.al. | 2505.08768 | null |
2025-05-13 | AC-Reason: Towards Theory-Guided Actual Causality Reasoning with Large Language Models | Yanxi Zhang et.al. | 2505.08750 | null |
2025-05-13 | DeepMath-Creative: A Benchmark for Evaluating Mathematical Creativity of Large Language Models | Xiaoyang Chen et.al. | 2505.08744 | **[link](https://github.com/deepmathllm/deepmath)** |
2025-05-13 | Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies | Xiaoliang Luo et.al. | 2505.08739 | null |
2025-05-13 | NurValues: Real-World Nursing Values Evaluation for Large Language Models in Clinical Context | Ben Yao et.al. | 2505.08734 | null |
2025-05-13 | Securing RAG: A Risk Assessment and Mitigation Framework | Lukas Ammann et.al. | 2505.08728 | null |
2025-05-13 | Memorization-Compression Cycles Improve Generalization | Fangyuan Yu et.al. | 2505.08727 | null |
2025-05-13 | PWC-MoE: Privacy-Aware Wireless Collaborative Mixture of Experts | Yang Su et.al. | 2505.08719 | null |
2025-05-13 | LLM-based Prompt Ensemble for Reliable Medical Entity Recognition from EHRs | K M Sajjadul Islam et.al. | 2505.08704 | null |
2025-05-13 | A Survey of Deep Learning for Complex Speech Spectrograms | Yuying Xie et.al. | 2505.08694 | null |
2025-05-13 | Adaptive Schema-aware Event Extraction with Retrieval-Augmented Generation | Sheng Liang et.al. | 2505.08690 | null |
2025-05-13 | Revealing economic facts: LLMs know more than they say | Marcus Buckmann et.al. | 2505.08662 | null |
2025-05-13 | Enhancing Software Development with Context-Aware Conversational Agents: A User Study on Developer Interactions with Chatbots | Glaucia Melo et.al. | 2505.08648 | null |
2025-05-13 | WixQA: A Multi-Dataset Benchmark for Enterprise Retrieval-Augmented Generation | Dvir Cohen et.al. | 2505.08643 | null |
2025-05-13 | TRAIL: Trace Reasoning and Agentic Issue Localization | Darshan Deshpande et.al. | 2505.08638 | null |
2025-05-13 | Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models | Donghoon Kim et.al. | 2505.08622 | null |
2025-05-13 | Codifying Character Logic in Role-Playing | Letian Peng et.al. | 2505.07705 | **[link](https://github.com/KomeijiForce/Codified_Profile_Koishiday_2025)** |
2025-05-13 | OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit | Arun S. Maiya et.al. | 2505.07672 | **[link](https://github.com/amaiya/onprem)** |
2025-05-12
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-12 | H $^{\mathbf{3}}$ DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning | Yiyang Lu et.al. | 2505.07819 | null |
2025-05-12 | DanceGRPO: Unleashing GRPO on Visual Generation | Zeyue Xue et.al. | 2505.07818 | null |
2025-05-12 | Continuous Visual Autoregressive Generation via Score Maximization | Chenze Shao et.al. | 2505.07812 | **[link](https://github.com/shaochenze/ear)** |
2025-05-12 | Improving Trajectory Stitching with Flow Models | Reece O'Mahoney et.al. | 2505.07802 | null |
2025-05-12 | Overflow Prevention Enhances Long-Context Recurrent LLMs | Assaf Ben-Kish et.al. | 2505.07793 | null |
2025-05-12 | Domain Regeneration: How well do LLMs match syntactic properties of text domains? | Da Ju et.al. | 2505.07784 | null |
2025-05-12 | Relative Overfitting and Accept-Reject Framework | Yanxin Liu et.al. | 2505.07783 | null |
2025-05-12 | MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering | Rushi Qiang et.al. | 2505.07782 | null |
2025-05-12 | Synthesizing Diverse Network Flow Datasets with Scalable Dynamic Multigraph Generation | Arya Grayeli et.al. | 2505.07777 | null |
2025-05-12 | Agent RL Scaling Law: Agent RL with Spontaneous Code Execution for Mathematical Problem Solving | Xinji Mai et.al. | 2505.07773 | **[link](https://github.com/anonymize-author/agentrl)** |
2025-05-12 | Enhancing Code Generation via Bidirectional Comment-Level Mutual Grounding | Yifeng Di et.al. | 2505.07768 | null |
2025-05-12 | LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention | Jiangling Zhang et.al. | 2505.07734 | null |
2025-05-12 | Spoken Language Understanding on Unseen Tasks With In-Context Learning | Neeraj Agrawal et.al. | 2505.07731 | null |
2025-05-12 | Circuit Partitioning Using Large Language Models for Quantum Compilation and Simulations | Pranav Sinha et.al. | 2505.07711 | null |
2025-05-12 | PatchTrack: A Comprehensive Analysis of ChatGPT's Influence on Pull Request Outcomes | Daniel Ogenrwot et.al. | 2505.07700 | null |
2025-05-12 | SpecRouter: Adaptive Routing for Multi-Level Speculative Decoding in Large Language Models | Hang Wu et.al. | 2505.07680 | null |
2025-05-12 | Benchmarking Retrieval-Augmented Generation for Chemistry | Xianrui Zhong et.al. | 2505.07671 | null |
2025-05-12 | A Case Study Investigating the Role of Generative AI in Quality Evaluations of Epics in Agile Software Development | Werner Geyer et.al. | 2505.07664 | null |
2025-05-09
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-09 | From Millions of Tweets to Actionable Insights: Leveraging LLMs for User Profiling | Vahid Rahimzadeh et.al. | 2505.06184 | null |
2025-05-09 | A Large Language Model-Enhanced Q-learning for Capacitated Vehicle Routing Problem with Time Windows | Linjiang Cao et.al. | 2505.06178 | null |
2025-05-09 | MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills | Niladri Shekhar Dutt et.al. | 2505.06176 | null |
2025-05-09 | Turbo-ICL: In-Context Learning-Based Turbo Equalization | Zihang Song et.al. | 2505.06175 | null |
2025-05-09 | A Scaling Law for Token Efficiency in LLM Fine-Tuning Under Fixed Compute Budgets | Ryan Lagasse et.al. | 2505.06150 | null |
2025-05-09 | Can Prompting LLMs Unlock Hate Speech Detection across Languages? A Zero-shot and Few-shot Study | Faeze Ghorbanpour et.al. | 2505.06149 | null |
2025-05-09 | ELA-ZSON: Efficient Layout-Aware Zero-Shot Object Navigation Agent with Hierarchical Planning | Jiawei Hou et.al. | 2505.06131 | null |
2025-05-09 | Constraints to Lorentz violation and ultrahigh-energy electrons in D-foamy space-times | Chengyi Li et.al. | 2505.06121 | null |
2025-05-09 | LLMs Get Lost In Multi-Turn Conversation | Philippe Laban et.al. | 2505.06120 | **[link](https://github.com/microsoft/lost_in_conversation)** |
2025-05-09 | Photovoltaic Defect Image Generator with Boundary Alignment Smoothing Constraint for Domain Shift Mitigation | Dongying Li et.al. | 2505.06117 | null |
2025-05-09 | LLMs Outperform Experts on Challenging Biology Benchmarks | Lennart Justen et.al. | 2505.06108 | null |
2025-05-09 | Free and Fair Hardware: A Pathway to Copyright Infringement-Free Verilog Generation using LLMs | Sam Bush et.al. | 2505.06096 | null |
2025-05-09 | Assessing Tenstorrent's RISC-V MatMul Acceleration Capabilities | Hiari Pizzini Cavagna et.al. | 2505.06085 | null |
2025-05-09 | Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information | Joshua Harris et.al. | 2505.06046 | null |
2025-05-09 | Short-circuiting Shortcuts: Mechanistic Investigation of Shortcuts in Text Classification | Leon Eshuijs et.al. | 2505.06032 | **[link](https://github.com/watermeleon/shortcut_mechanisms)** |
2025-05-09 | Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation | Stefan Vasilev et.al. | 2505.06027 | null |
2025-05-09 | Exploring the Feasibility of Multilingual Grammatical Error Correction with a Single LLM up to 9B parameters: A Comparative Study of 17 Models | Dawid Wisniewski et.al. | 2505.06004 | null |
2025-05-09 | Task-Adapter++: Task-specific Adaptation with Order-aware Alignment for Few-shot Action Recognition | Congqi Cao et.al. | 2505.06002 | null |
2025-05-09 | Offline Multi-agent Reinforcement Learning via Score Decomposition | Dan Qiao et.al. | 2505.05968 | null |
2025-05-09 | GEORCE: A Fast New Control Algorithm for Computing Geodesics | Frederik Möbius Rygaard et.al. | 2505.05961 | null |
2025-05-09 | EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation | Biao Yi et.al. | 2505.05440 | null |
2025-05-09 | LiTransProQA: an LLM-based Literary Translation evaluation metric with Professional Question Answering | Ran Zhang et.al. | 2505.05423 | null |
2025-05-08
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-08 | 3D Scene Generation: A Survey | Beichen Wen et.al. | 2505.05474 | **[link](https://github.com/hzxie/awesome-3d-scene-generation)** |
2025-05-08 | StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant | Haibo Wang et.al. | 2505.05467 | null |
2025-05-08 | ComPO: Preference Alignment via Comparison Oracles | Peter Chen et.al. | 2505.05465 | null |
2025-05-08 | Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging | Shiqi Chen et.al. | 2505.05464 | **[link](https://github.com/shiqichen17/vlm_merging)** |
2025-05-08 | Conversational Process Model Redesign | Nataliia Klievtsova et.al. | 2505.05453 | null |
2025-05-08 | clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations | Chalamalasetti Kranti et.al. | 2505.05445 | null |
2025-05-08 | GesPrompt: Leveraging Co-Speech Gestures to Augment LLM-Based Interaction in Virtual Reality | Xiyun Hu et.al. | 2505.05441 | null |
2025-05-08 | Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data | Yudong Wang et.al. | 2505.05427 | null |
2025-05-08 | Frame In, Frame Out: Do LLMs Generate More Biased News Headlines than Humans? | Valeria Pastorino et.al. | 2505.05406 | null |
2025-05-08 | Modelling and Verifying Neuronal Archetypes in Coq | Abdorrahim Bahrami et.al. | 2505.05362 | null |
2025-05-08 | DSDrive: Distilling Large Language Model for Lightweight End-to-End Autonomous Driving with Unified Reasoning and Planning | Wenru Liu et.al. | 2505.05360 | null |
2025-05-08 | Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source Localization | Sooyoung Park et.al. | 2505.05343 | **[link](https://github.com/swimmiing/ACL-SSL)** |
2025-05-08 | FLAM: Frame-Wise Language-Audio Modeling | Yusong Wu et.al. | 2505.05335 | null |
2025-05-08 | ICon: In-Context Contribution for Automatic Data Selection | Yixin Yang et.al. | 2505.05327 | null |
2025-05-08 | Toward Reasonable Parrots: Why Large Language Models Should Argue with Us by Design | Elena Musi et.al. | 2505.05298 | null |
2025-05-08 | PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes | Ahmed Abdelreheem et.al. | 2505.05288 | null |
2025-05-08 | HEXGEN-TEXT2SQL: Optimizing LLM Inference Request Scheduling for Agentic Text-to-SQL Workflow | You Peng et.al. | 2505.05286 | null |
2025-05-08 | Latte: Transfering LLMs` Latent-level Knowledge for Few-shot Tabular Learning | Ruxue Shi et.al. | 2505.05237 | null |
2025-05-08 | MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection | Zhihao Zhang et.al. | 2505.04594 | null |
2025-05-08 | Defining and Quantifying Creative Behavior in Popular Image Generators | Aditi Ramaswamy et.al. | 2505.04497 | null |
2025-05-07
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-07 | EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning | Zhenghao Xing et.al. | 2505.04623 | null |
2025-05-07 | On Path to Multimodal Generalist: General-Level and General-Bench | Hao Fei et.al. | 2505.04620 | null |
2025-05-07 | OmniGIRL: A Multilingual and Multimodal Benchmark for GitHub Issue Resolution | Lianghong Guo et.al. | 2505.04606 | null |
2025-05-07 | ZeroSearch: Incentivize the Search Capability of LLMs without Searching | Hao Sun et.al. | 2505.04588 | null |
2025-05-07 | SlideItRight: Using AI to Find Relevant Slides and Provide Feedback for Open-Ended Questions | Chloe Qianhui Zhao et.al. | 2505.04584 | null |
2025-05-07 | Comparative Analysis of Carbon Footprint in Manual vs. LLM-Assisted Code Development | Kuen Sum Cheung et.al. | 2505.04521 | null |
2025-05-07 | Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs | Yehui Tang et.al. | 2505.04519 | null |
2025-05-07 | Detecting Spelling and Grammatical Anomalies in Russian Poetry Texts | Ilya Koziev et.al. | 2505.04507 | null |
2025-05-07 | A Design Space for the Critical Validation of LLM-Generated Tabular Data | Madhav Sachdeva et.al. | 2505.04487 | null |
2025-05-07 | Efficient Flow Matching using Latent Variables | Anirban Samaddar et.al. | 2505.04486 | null |
2025-05-07 | CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation | Jiahao Li et.al. | 2505.04481 | null |
2025-05-07 | TrajEvo: Designing Trajectory Prediction Heuristics via LLM-driven Evolution | Zhikai Zhao et.al. | 2505.04480 | null |
2025-05-07 | Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration | Shigeki Karita et.al. | 2505.04457 | null |
2025-05-07 | M2Rec: Multi-scale Mamba for Efficient Sequential Recommendation | Qianru Zhang et.al. | 2505.04445 | null |
2025-05-07 | Towards Effectively Leveraging Execution Traces for Program Repair with Code LLMs | Mirazul Haque et.al. | 2505.04441 | null |
2025-05-07 | OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models | Xiaoyu Xu et.al. | 2505.04416 | null |
2025-05-07 | YABLoCo: Yet Another Benchmark for Long Context Code Generation | Aidar Valeev et.al. | 2505.04406 | null |
2025-05-07 | Large Means Left: Political Bias in Large Language Models Increases with Their Number of Parameters | David Exler et.al. | 2505.04393 | null |
2025-05-06
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-06 | WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch | Zimu Lu et.al. | 2505.03733 | null |
2025-05-06 | Graph Drawing for LLMs: An Empirical Evaluation | Walter Didimo et.al. | 2505.03678 | null |
2025-05-06 | PhysLLM: Harnessing Large Language Models for Cross-Modal Remote Physiological Sensing | Yiping Xie et.al. | 2505.03621 | null |
2025-05-06 | From Pixels to Polygons: A Survey of Deep Learning Approaches for Medical Image-to-Mesh Reconstruction | Fengming Lin et.al. | 2505.03599 | null |
2025-05-06 | LlamaFirewall: An open source guardrail system for building secure AI agents | Sahana Chennabasappa et.al. | 2505.03574 | null |
2025-05-06 | Say It Another Way: A Framework for User-Grounded Paraphrasing | Cléa Chataigner et.al. | 2505.03563 | null |
2025-05-06 | Real-Time Person Image Synthesis Using a Flow Matching Model | Jiwoo Jeong et.al. | 2505.03562 | null |
2025-05-06 | A Comprehensive Survey of Large AI Models for Future Communications: Foundations, Applications and Challenges | Feibo Jiang et.al. | 2505.03556 | null |
2025-05-06 | A Hashgraph-Inspired Consensus Mechanism for Reliable Multi-Model Reasoning | Kolawole E. Ogunsina et.al. | 2505.03553 | null |
2025-05-06 | STORY2GAME: Generating (Almost) Everything in an Interactive Fiction Game | Eric Zhou et.al. | 2505.03547 | null |
2025-05-06 | Faster MoE LLM Inference for Extremely Large Models | Haoqi Yang et.al. | 2505.03531 | null |
2025-05-06 | Causal Intervention Framework for Variational Auto Encoder Mechanistic Interpretability | Dip Roy et.al. | 2505.03530 | null |
2025-05-06 | Ruled by the Representation Space: On the University's Embrace of Large Language Models | Katia Schwerzmann et.al. | 2505.03513 | null |
2025-05-06 | Modality-Guided Dynamic Graph Fusion and Temporal Diffusion for Self-Supervised RGB-T Tracking | Shenglan Li et.al. | 2505.03507 | **[link](https://github.com/lishenglana/gdstrack)** |
2025-05-06 | BadLingual: A Novel Lingual-Backdoor Attack against Large Language Models | Zihan Wang et.al. | 2505.03501 | null |
2025-05-06 | Augmenting Human Cognition through Everyday AR | Xiaoan Liu et.al. | 2505.03492 | null |
2025-05-06 | A new membership inference attack that spots memorization in generative and predictive models: Loss-Based with Reference Model algorithm (LBRM) | Faiz Taleb et.al. | 2505.03490 | null |
2025-05-06 | am-ELO: A Stable Framework for Arena-based LLM Evaluation | Zirui Liu et.al. | 2505.03475 | null |
2025-05-06 | Evaluation of LLMs on Long-tail Entity Linking in Historical Documents | Marta Boscariol et.al. | 2505.03473 | null |
2025-05-06 | Uncertainty-Aware Large Language Models for Explainable Disease Diagnosis | Shuang Zhou et.al. | 2505.03467 | null |
2025-05-06 | Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation | Gerard Pons et.al. | 2505.02737 | null |
2025-05-05
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-05 | Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation | Lu Ling et.al. | 2505.02836 | null |
2025-05-05 | AutoLibra: Agent Metric Induction from Open-Ended Feedback | Hao Zhu et.al. | 2505.02820 | null |
2025-05-05 | ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations | Dmitriy Shopkhoev et.al. | 2505.02819 | null |
2025-05-05 | Knowing You Don't Know: Learning When to Continue Search in Multi-round RAG through Self-Practicing | Diji Yang et.al. | 2505.02811 | null |
2025-05-05 | Towards Quantifying the Hessian Structure of Neural Networks | Zhaorui Dong et.al. | 2505.02809 | null |
2025-05-05 | Generating HomeAssistant Automations Using an LLM-based Chatbot | Mathyas Giudici et.al. | 2505.02802 | null |
2025-05-05 | HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models | Zheng Lin et.al. | 2505.02795 | null |
2025-05-05 | Giving Simulated Cells a Voice: Evolving Prompt-to-Intervention Models for Cellular Control | Nam H. Le et.al. | 2505.02766 | null |
2025-05-05 | Bye-bye, Bluebook? Automating Legal Procedure with Large Language Models | Matthew Dahl et.al. | 2505.02763 | null |
2025-05-05 | FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models | Zhouliang Yu et.al. | 2505.02735 | null |
2025-05-05 | Enhancing LLMs' Clinical Reasoning with Real-World Data from a Nationwide Sepsis Registry | Junu Kim et.al. | 2505.02722 | null |
2025-05-05 | Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play | Yemin Shi et.al. | 2505.02707 | null |
2025-05-05 | Structure Causal Models and LLMs Integration in Medical Visual Question Answering | Zibo Xu et.al. | 2505.02703 | null |
2025-05-05 | Exploring LLM-Powered Role and Action-Switching Pedagogical Agents for History Education in Virtual Reality | Zihao Zhu et.al. | 2505.02699 | null |
2025-05-05 | AI Standardized Patient Improves Human Conversations in Advanced Cancer Care | Kurtis Haut et.al. | 2505.02694 | null |
2025-05-05 | Predicting Movie Hits Before They Happen with LLMs | Shaghayegh Agah et.al. | 2505.02693 | null |
2025-05-05 | Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models | Xiaobao Wu et.al. | 2505.02686 | null |
2025-05-05 | A Survey on Progress in LLM Alignment from the Perspective of Reward Design | Miaomiao Ji et.al. | 2505.02666 | null |
2025-05-05 | A Survey of Slow Thinking-based Reasoning LLMs using Reinforced Learning and Inference-time Scaling Law | Qianjun Pan et.al. | 2505.02665 | null |
2025-05-02
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-02 | GENMO: A GENeralist Model for Human MOtion | Jiefeng Li et.al. | 2505.01425 | null |
2025-05-02 | Provable Efficiency of Guidance in Diffusion Models for General Data Distribution | Gen Li et.al. | 2505.01382 | null |
2025-05-02 | TRAVELER: A Benchmark for Evaluating Temporal Reasoning across Vague, Implicit and Explicit References | Svenja Kenneweg et.al. | 2505.01325 | null |
2025-05-02 | Helping Big Language Models Protect Themselves: An Enhanced Filtering and Summarization System | Sheikh Samit Muhaimin et.al. | 2505.01315 | null |
2025-05-02 | Document Retrieval Augmented Fine-Tuning (DRAFT) for safety-critical software assessments | Regan Bolton et.al. | 2505.01307 | null |
2025-05-02 | ViSA-Flow: Accelerating Robot Skill Learning via Large-Scale Video Semantic Action Flow | Changhe Chen et.al. | 2505.01288 | null |
2025-05-02 | Scoring-Assisted Generative Exploration for Proteins (SAGE-Prot): A Framework for Multi-Objective Protein Optimization via Iterative Sequence Generation and Evaluation | Hocheol Lim et.al. | 2505.01277 | null |
2025-05-02 | FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing | Gaoxiang Cong et.al. | 2505.01263 | null |
2025-05-02 | Enhancing Obsolescence Forecasting with Deep Generative Data Augmentation: A Semi-Supervised Framework for Low-Data Industrial Applications | Elie Saad et.al. | 2505.01261 | null |
2025-05-02 | Digital Pathway Curation (DPC): a comparative pipeline to assess the reproducibility, consensus and accuracy across Gemini, PubMed, and scientific reviewers in biomedical research | Flavio Lichtenstein et.al. | 2505.01259 | null |
2025-05-02 | EvalxNLP: A Framework for Benchmarking Post-Hoc Explainability Methods on NLP Models | Mahdi Dhaini et.al. | 2505.01238 | null |
2025-05-02 | A Combinatorial Proof of Universal Optimality for Computing a Planar Convex Hull | Ivor van der Hoog et.al. | 2505.01194 | null |
2025-05-02 | LLM Security: Vulnerabilities, Attacks, Defenses, and Countermeasures | Francisco Aguilera-Martínez et.al. | 2505.01177 | null |
2025-05-02 | On the Limitations of Steering in Language Model Alignment | Chebrolu Niranjan et.al. | 2505.01162 | null |
2025-05-02 | Methodological Foundations for AI-Driven Survey Question Generation | Ted K. Mburu et.al. | 2505.01150 | null |
2025-05-02 | Retrieval-Augmented Generation in Biomedicine: A Survey of Technologies, Datasets, and Clinical Applications | Jiawei He et.al. | 2505.01146 | null |
2025-05-02 | Evaluating the Impact of Data Cleaning on the Quality of Generated Pull Request Descriptions | Kutay Tire et.al. | 2505.01120 | null |
2025-05-02 | Incorporating Inductive Biases to Energy-based Generative Models | Yukun Li et.al. | 2505.01111 | null |
2025-05-02 | MateICL: Mitigating Attention Dispersion in Large-Scale In-Context Learning | Murtadha Ahmed et.al. | 2505.01110 | null |
2025-05-02 | Any-to-Any Vision-Language Model for Multimodal X-ray Imaging and Radiological Report Generation | Daniele Molino et.al. | 2505.01091 | null |
2025-05-01
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-01 | T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT | Dongzhi Jiang et.al. | 2505.00703 | null |
2025-05-01 | GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution | Aditya Arora et.al. | 2505.00687 | null |
2025-05-01 | MINERVA: Evaluating Complex Video Reasoning | Arsha Nagrani et.al. | 2505.00681 | null |
2025-05-01 | Steering Large Language Models with Register Analysis for Arbitrary Style Transfer | Xinchen Yang et.al. | 2505.00679 | null |
2025-05-01 | Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions | Yiming Du et.al. | 2505.00675 | null |
2025-05-01 | DeepCritic: Deliberate Critique with Large Language Models | Wenkai Yang et.al. | 2505.00662 | null |
2025-05-01 | Large Language Models Understanding: an Inherent Ambiguity Barrier | Daniel N. Nissani et.al. | 2505.00654 | null |
2025-05-01 | Open-Source LLM-Driven Federated Transformer for Predictive IoV Management | Yazan Otoum et.al. | 2505.00651 | null |
2025-05-01 | Investigating Task Arithmetic for Zero-Shot Information Retrieval | Marco Braga et.al. | 2505.00649 | null |
2025-05-01 | The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them) | Zihao Wang et.al. | 2505.00626 | null |
2025-05-01 | FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation | Chaitali Bhattacharyya et.al. | 2505.00624 | null |
2025-05-01 | Combining LLMs with Logic-Based Framework to Explain MCTS | Ziyan An et.al. | 2505.00610 | null |
2025-05-01 | Can LLMs Help Improve Analogical Reasoning For Strategic Decisions? Experimental Evidence from Humans and GPT-4 | Phanish Puranam et.al. | 2505.00603 | null |
2025-05-01 | Block Circulant Adapter for Large Language Models | Xinyu Ding et.al. | 2505.00582 | null |
2025-05-01 | FreqKV: Frequency Domain Key-Value Compression for Efficient Context Window Extension | Jushi Kai et.al. | 2505.00570 | null |
2025-05-01 | Triggering Hallucinations in LLMs: A Quantitative Study of Prompt-Induced Hallucination in Large Language Models | Makoto Sato et.al. | 2505.00557 | null |
2025-05-01 | Leveraging Partial SMILES Validation Scheme for Enhanced Drug Design in Reinforcement Learning Frameworks | Xinyu Wang et.al. | 2505.00530 | null |
2025-05-01 | HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection | Deanna Emery et.al. | 2505.00506 | null |
2025-05-01 | UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces | Alaa Saleh et.al. | 2505.00472 | null |
2025-05-01 | A General Model for Linearly Polarized Optical Vector Beams | Jonathan Nichols et.al. | 2505.00471 | null |
2025-04-30
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-30 | ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction | Qihao Liu et.al. | 2504.21855 | null |
2025-04-30 | TRUST: An LLM-Based Dialogue System for Trauma Understanding and Structured Assessments | Sichang Tu et.al. | 2504.21851 | null |
2025-04-30 | 3D Stylization via Large Reconstruction Model | Ipek Oztas et.al. | 2504.21836 | null |
2025-04-30 | From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems | Huan Zhang et.al. | 2504.21815 | null |
2025-04-30 | Why Compress What You Can Generate? When GPT-4o Generation Ushers in Image Compression Fields | Yixin Gao et.al. | 2504.21814 | null |
2025-04-30 | An Empirical Study on the Effectiveness of Large Language Models for Binary Code Understanding | Xiuwei Shang et.al. | 2504.21803 | null |
2025-04-30 | MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness | Junsheng Huang et.al. | 2504.21773 | null |
2025-04-30 | Anatomical Similarity as a New Metric to Evaluate Brain Generative Models | Bahram Jafrasteh et.al. | 2504.21771 | null |
2025-04-30 | LASHED: LLMs And Static Hardware Analysis for Early Detection of RTL Bugs | Baleegh Ahmad et.al. | 2504.21770 | null |
2025-04-30 | LLM-based Interactive Imitation Learning for Robotic Manipulation | Jonas Werner et.al. | 2504.21769 | null |
2025-04-30 | CodeFlowBench: A Multi-turn, Iterative Benchmark for Complex Code Generation | Sizhe Wang et.al. | 2504.21751 | null |
2025-04-30 | TheraQuest: A Gamified, LLM-Powered Simulation for Massage Therapy Training | Shengqian Wang et.al. | 2504.21735 | null |
2025-04-30 | LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics | Marc Glocker et.al. | 2504.21716 | null |
2025-04-30 | XBreaking: Explainable Artificial Intelligence for Jailbreaking LLMs | Marco Arazzi et.al. | 2504.21700 | null |
2025-04-30 | Hoist with His Own Petard: Inducing Guardrails to Facilitate Denial-of-Service Attacks on Retrieval-Augmented Generation of LLMs | Pan Suo et.al. | 2504.21680 | null |
2025-04-30 | Traceback of Poisoning Attacks to Retrieval-Augmented Generation | Baolei Zhang et.al. | 2504.21668 | null |
2025-04-30 | Meeseeks: An Iterative Benchmark Evaluating LLMs Multi-Turn Instruction-Following Ability | Jiaming Wang et.al. | 2504.21625 | null |
2025-04-30 | RDF-Based Structured Quality Assessment Representation of Multilingual LLM Evaluations | Jonas Gwozdz et.al. | 2504.21605 | null |
2025-04-30 | Leveraging Pre-trained Large Language Models with Refined Prompting for Online Task and Motion Planning | Huihui Guo et.al. | 2504.21596 | null |
2025-04-30 | DNB-AI-Project at SemEval-2025 Task 5: An LLM-Ensemble Approach for Automated Subject Indexing | Lisa Kluge et.al. | 2504.21589 | null |
2025-04-30 | End-to-end Audio Deepfake Detection from RAW Waveforms: a RawNet-Based Approach with Cross-Dataset Evaluation | Andrea Di Pierno et.al. | 2504.20923 | **[link](https://github.com/adipiz99/RawNetLite)** |
2025-04-29
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-29 | YoChameleon: Personalized Vision and Language Generation | Thao Nguyen et.al. | 2504.20998 | null |
2025-04-29 | Toward Efficient Exploration by Large Language Model Agents | Dilip Arumugam et.al. | 2504.20997 | null |
2025-04-29 | X-Fusion: Introducing New Modality to Frozen Large Language Models | Sicheng Mo et.al. | 2504.20996 | null |
2025-04-29 | TesserAct: Learning 4D Embodied World Models | Haoyu Zhen et.al. | 2504.20995 | null |
2025-04-29 | ACE: A Security Architecture for LLM-Integrated App Systems | Evan Li et.al. | 2504.20984 | null |
2025-04-29 | Jekyll-and-Hyde Tipping Point in an AI's Behavior | Neil F. Johnson et.al. | 2504.20980 | null |
2025-04-29 | Real-Time Wayfinding Assistant for Blind and Low-Vision Users | Dabbrata Das et.al. | 2504.20976 | null |
2025-04-29 | SetKE: Knowledge Editing for Knowledge Elements Overlap | Yifan Wei et.al. | 2504.20972 | null |
2025-04-29 | AegisLLM: Scaling Agentic Systems for Self-Reflective Defense in LLM Security | Zikui Cai et.al. | 2504.20965 | null |
2025-04-29 | OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification | Shangyu Li et.al. | 2504.20964 | null |
2025-04-29 | Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models | Maryna Vyshnyvetska et.al. | 2504.20951 | null |
2025-04-29 | Trace-of-Thought: Enhanced Arithmetic Problem Solving via Reasoning Distillation From Large to Small Language Models | Tyler McDonald et.al. | 2504.20946 | null |
2025-04-29 | ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification | Ziqing Fan et.al. | 2504.20930 | null |
2025-04-29 | DYNAMAX: Dynamic computing for Transformers and Mamba based architectures | Miguel Nogales et.al. | 2504.20922 | null |
2025-04-29 | An Empirical Study on the Capability of LLMs in Decomposing Bug Reports | Zhiyuan Chen et.al. | 2504.20911 | null |
2025-04-29 | Evaluating Generative Models for Tabular Data: Novel Metrics and Benchmarking | Dayananda Herurkar et.al. | 2504.20900 | null |
2025-04-29 | LELANTE: LEveraging LLM for Automated ANdroid TEsting | Shamit Fatin et.al. | 2504.20896 | null |
2025-04-29 | The Leaderboard Illusion | Shivalika Singh et.al. | 2504.20879 | null |
2025-04-29 | AI-GenBench: A New Ongoing Benchmark for AI-Generated Image Detection | Lorenzo Pellegrini et.al. | 2504.20865 | null |
2025-04-29 | LLM-Generated Fake News Induces Truth Decay in News Ecosystem: A Case Study on Neural News Recommendation | Beizhe Hu et.al. | 2504.20013 | null |
2025-04-29 | From Concept to Practice: an Automated LLM-aided UVM Machine for RTL Verification | Junhao Ye et.al. | 2504.19959 | null |
2025-04-29 | The Automation Advantage in AI Red Teaming | Rob Mulla et.al. | 2504.19855 | null |
2025-04-28
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-28 | AutoJudge: Judge Decoding Without Manual Annotation | Roman Garipov et.al. | 2504.20039 | null |
2025-04-28 | Better To Ask in English? Evaluating Factual Accuracy of Multilingual LLMs in English and Low-Resource Languages | Pritika Rohera et.al. | 2504.20022 | null |
2025-04-28 | Modular Machine Learning: An Indispensable Path towards New-Generation Large Language Models | Xin Wang et.al. | 2504.20020 | null |
2025-04-28 | Applying LLM-Powered Virtual Humans to Child Interviews in Child-Centered Design | Linshi Li et.al. | 2504.20016 | null |
2025-04-28 | Towards Automated Scoping of AI for Social Good Projects | Jacob Emmerson et.al. | 2504.20010 | null |
2025-04-28 | Chatbot Arena Meets Nuggets: Towards Explanations and Diagnostics in the Evaluation of LLM Responses | Sahel Sharifymoghaddam et.al. | 2504.20006 | null |
2025-04-28 | Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom | Rishika Sen et.al. | 2504.20000 | null |
2025-04-28 | TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons | Emre Can Acikgoz et.al. | 2504.19982 | null |
2025-04-28 | Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets | Adam Younsi et.al. | 2504.19981 | null |
2025-04-28 | Securing Agentic AI: A Comprehensive Threat Model and Mitigation Framework for Generative AI Agents | Vineeth Sai Narajala et.al. | 2504.19956 | null |
2025-04-28 | Enhancing Surgical Documentation through Multimodal Visual-Temporal Transformers and Generative AI | Hugo Georgenthum et.al. | 2504.19918 | null |
2025-04-28 | Can AI Agents Design and Implement Drug Discovery Pipelines? | Khachik Smbatyan et.al. | 2504.19912 | null |
2025-04-28 | GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets | Mingqian He et.al. | 2504.19898 | null |
2025-04-28 | CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition | Quynh Phung et.al. | 2504.19894 | null |
2025-04-28 | DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images | Mamadou Keita et.al. | 2504.19876 | null |
2025-04-28 | semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage | Ke Hong et.al. | 2504.19867 | null |
2025-04-28 | CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback | Chenhan Jiang et.al. | 2504.19860 | null |
2025-04-25
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-25 | TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation | Gwen Yidou Weng et.al. | 2504.18535 | null |
2025-04-25 | Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation | Shivam Duggal et.al. | 2504.18509 | null |
2025-04-25 | Action-Minimization Meets Generative Modeling: Efficient Transition Path Sampling with the Onsager-Machlup Functional | Sanjeev Raja et.al. | 2504.18506 | null |
2025-04-25 | Facets, Taxonomies, and Syntheses: Navigating Structured Representations in LLM-Assisted Literature Review | Raymond Fok et.al. | 2504.18496 | null |
2025-04-25 | Investigating Co-Constructive Behavior of Large Language Models in Explanation Dialogues | Leandra Fichtel et.al. | 2504.18483 | null |
2025-04-25 | Generative Induction of Dialogue Task Schemas with Streaming Refinement and Simulated Interactions | James D. Finch et.al. | 2504.18474 | null |
2025-04-25 | PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts | Yiming Wang et.al. | 2504.18428 | null |
2025-04-25 | Kimi-Audio Technical Report | KimiTeam et.al. | 2504.18425 | null |
2025-04-25 | LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning | Rui Li et.al. | 2504.18424 | null |
2025-04-25 | LLMpatronous: Harnessing the Power of LLMs For Vulnerability Detection | Rajesh Yarra et.al. | 2504.18423 | null |
2025-04-25 | BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs | Hongyu Wang et.al. | 2504.18415 | null |
2025-04-25 | Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providers | Jared Moore et.al. | 2504.18412 | **[link](https://github.com/jlcmoore/llms-as-therapists)** |
2025-04-25 | Can Code Outlove Blood? A LLM-based VR Experience to Prompt Reflection on Parental Verbal Abuse | Jiaying Fu et.al. | 2504.18410 | null |
2025-04-25 | HepatoGEN: Generating Hepatobiliary Phase MRI with Perceptual and Adversarial Models | Jens Hooge et.al. | 2504.18405 | null |
2025-04-25 | Bridge the Domains: Large Language Models Enhanced Cross-domain Sequential Recommendation | Qidong Liu et.al. | 2504.18383 | null |
2025-04-25 | Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant | Lei Shen et.al. | 2504.18373 | null |
2025-04-25 | ThreMoLIA: Threat Modeling of Large Language Model-Integrated Applications | Felix Viktor Jedrzejewski et.al. | 2504.18369 | null |
2025-04-25 | Enhanced Sampling, Public Dataset and Generative Model for Drug-Protein Dissociation Dynamics | Maodong Li et.al. | 2504.18367 | null |
2025-04-25 | Revisiting Data Auditing in Large Vision-Language Models | Hongyu Zhu et.al. | 2504.18349 | null |
2025-04-25 | Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review | Toghrul Abbasli et.al. | 2504.18346 | null |
2025-04-25 | DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training | Xiaoyu Tian et.al. | 2504.17565 | null |
2025-04-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-24 | Replay to Remember: Retaining Domain Knowledge in Streaming Language Models | Sneh Pillai et.al. | 2504.17780 | null |
2025-04-24 | The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs | Piotr Nawrot et.al. | 2504.17768 | null |
2025-04-24 | Step1X-Edit: A Practical Framework for General Image Editing | Shiyu Liu et.al. | 2504.17761 | null |
2025-04-24 | Towards Robust LLMs: an Adversarial Robustness Measurement Framework | Natan Levy et.al. | 2504.17723 | null |
2025-04-24 | Multilingual Performance Biases of Large Language Models in Education | Vansh Gupta et.al. | 2504.17720 | null |
2025-04-24 | Ensemble Bayesian Inference: Leveraging Small Language Models to Achieve LLM-level Accuracy in Profile Matching Tasks | Haru-Tada Sato et.al. | 2504.17685 | null |
2025-04-24 | INSIGHT: Bridging the Student-Teacher Gap in Times of Large Language Models | Jarne Thys et.al. | 2504.17677 | null |
2025-04-24 | Energy Considerations of Large Language Model Inference and Efficiency Optimizations | Jared Fernandez et.al. | 2504.17674 | null |
2025-04-24 | Cross-region Model Training with Communication-Computation Overlapping and Delay Compensation | Ying Zhu et.al. | 2504.17672 | null |
2025-04-24 | DiMeR: Disentangled Mesh Reconstruction Model | Lutao Jiang et.al. | 2504.17670 | null |
2025-04-24 | Towards a HIPAA Compliant Agentic AI System in Healthcare | Subash Neupane et.al. | 2504.17669 | null |
2025-04-24 | Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics | Zena Al-Khalili et.al. | 2504.17665 | null |
2025-04-24 | Likelihood-Free Variational Autoencoders | Chen Xu et.al. | 2504.17622 | null |
2025-04-24 | L3: DIMM-PIM Integrated Architecture and Coordination for Scalable Long-Context LLM Inference | Qingyuan Liu et.al. | 2504.17584 | null |
2025-04-24 | HalluLens: LLM Hallucination Benchmark | Yejin Bang et.al. | 2504.17550 | null |
2025-04-24 | A Comprehensive Survey of Knowledge-Based Vision Question Answering Systems: The Lifecycle of Knowledge in Visual Reasoning Task | Jiaqi Deng et.al. | 2504.17547 | null |
2025-04-24 | Auditing the Ethical Logic of Generative AI Models | W. Russell Neuman et.al. | 2504.17544 | null |
2025-04-24 | Large Language Model-Driven Concolic Execution for Highly Structured Test Input Generation | Haoxin Tu et.al. | 2504.17542 | null |
2025-04-24 | Towards Machine-Generated Code for the Resolution of User Intentions | Justus Flerlage et.al. | 2504.17531 | null |
2025-04-23
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-23 | Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of Light | Ali Hassani et.al. | 2504.16922 | null |
2025-04-23 | IberBench: LLM Evaluation on Iberian Languages | José Ángel González et.al. | 2504.16921 | null |
2025-04-23 | OptimAI: Optimization from Natural Language Using LLM-Powered AI Agents | Raghav Thind et.al. | 2504.16918 | null |
2025-04-23 | DreamO: A Unified Framework for Image Customization | Chong Mou et.al. | 2504.16915 | null |
2025-04-23 | Tracing Thought: Using Chain-of-Thought Reasoning to Identify the LLM Behind AI-Generated Text | Shifali Agrahari et.al. | 2504.16913 | null |
2025-04-23 | BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation | Ruotong Wang et.al. | 2504.16907 | null |
2025-04-23 | Practical approaches for crystal structure predictions with inpainting generation and universal interatomic potentials | Peichen Zhong et.al. | 2504.16893 | null |
2025-04-23 | Do Large Language Models know who did what to whom? | Joseph M. Denning et.al. | 2504.16884 | null |
2025-04-23 | Enhancing Critical Thinking with AI: A Tailored Warning System for RAG Models | Xuyang Zhu et.al. | 2504.16883 | null |
2025-04-23 | Context-Enhanced Vulnerability Detection Based on Large Language Model | Yixin Yang et.al. | 2504.16877 | null |
2025-04-23 | Exploring How LLMs Capture and Represent Domain-Specific Knowledge | Mirian Hipolito Garcia et.al. | 2504.16871 | null |
2025-04-23 | Planning with Diffusion Models for Target-Oriented Dialogue Systems | Hanwen Du et.al. | 2504.16858 | null |
2025-04-23 | Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification | Alexander Shvets et.al. | 2504.16856 | null |
2025-04-23 | Monte Carlo Planning with Large Language Model for Text-Based Game Agents | Zijing Shi et.al. | 2504.16855 | null |
2025-04-23 | Improving Significant Wave Height Prediction Using Chronos Models | Yilin Zhai et.al. | 2504.16834 | null |
2025-04-23 | LRASGen: LLM-based RESTful API Specification Generation | Sida Deng et.al. | 2504.16833 | null |
2025-04-23 | GreenMind: A Next-Generation Vietnamese Large Language Model for Structured and Logical Reasoning | Luu Quy Tung et.al. | 2504.16832 | null |
2025-04-23 | Process Reward Models That Think | Muhammad Khalifa et.al. | 2504.16828 | null |
2025-04-23 | LLM-assisted Graph-RAG Information Extraction from IFC Data | Sima Iranmanesh et.al. | 2504.16813 | null |
2025-04-23 | Decoupled Global-Local Alignment for Improving Compositional Understanding | Xiaoxing Hu et.al. | 2504.16801 | null |
2025-04-23 | CAPO: Cost-Aware Prompt Optimization | Tom Zehle et.al. | 2504.16005 | **[link](https://github.com/finitearth/capo)** |
2025-04-23 | From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs | Yaxiong Wu et.al. | 2504.15965 | null |
2025-04-23 | Language Models to Support Multi-Label Classification of Industrial Data | Waleed Abdeen et.al. | 2504.15922 | null |
2025-04-22
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-22 | TTRL: Test-Time Reinforcement Learning | Yuxin Zuo et.al. | 2504.16084 | null |
2025-04-22 | MR. Video: "MapReduce" is the Principle for Long Video Understanding | Ziqi Pang et.al. | 2504.16082 | null |
2025-04-22 | LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities | Thomas Schmied et.al. | 2504.16078 | null |
2025-04-22 | PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models | Shi Qiu et.al. | 2504.16074 | null |
2025-04-22 | Boosting Generative Image Modeling via Joint Image-Feature Synthesis | Theodoros Kouzelis et.al. | 2504.16064 | null |
2025-04-22 | Automated Static Vulnerability Detection via a Holistic Neuro-symbolic Approach | Penghui Li et.al. | 2504.16057 | null |
2025-04-22 | Honey, I Shrunk the Language Model: Impact of Knowledge Distillation Methods on Performance and Explainability | Daniel Hendriks et.al. | 2504.16056 | null |
2025-04-22 | Certified Mitigation of Worst-Case LLM Copyright Infringement | Jingyu Zhang et.al. | 2504.16046 | null |
2025-04-22 | LLMs meet Federated Learning for Scalable and Secure IoT Management | Yazan Otoum et.al. | 2504.16032 | null |
2025-04-22 | LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale | Joya Chen et.al. | 2504.16030 | null |
2025-04-22 | Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V3 | Ahmed R. Sadik et.al. | 2504.16027 | null |
2025-04-22 | Token-Aware Coding Flow: A Study with Nano Surge in Reasoning Model | Junwei Hu et.al. | 2504.15989 | null |
2025-04-22 | Deep learning of point processes for modeling high-frequency data | Yoshihiro Gyotoku et.al. | 2504.15944 | null |
2025-04-22 | FairTranslate: An English-French Dataset for Gender Bias Evaluation in Machine Translation by Overcoming Gender Binarity | Fanny Jourdan et.al. | 2504.15941 | **[link](https://github.com/fanny-jourdan/FairTranslate)** |
2025-04-22 | Low-Rank Adaptation of Neural Fields | Anh Truong et.al. | 2504.15933 | null |
2025-04-22 | StreamRL: Scalable, Heterogeneous, and Elastic RL for LLMs with Disaggregated Stream Generation | Yinmin Zhong et.al. | 2504.15930 | null |
2025-04-22 | ViSMaP: Unsupervised Hour-long Video Summarisation by Meta-Prompting | Jian Hu et.al. | 2504.15921 | null |
2025-04-22 | Synergistic Weak-Strong Collaboration by Aligning Preferences | Yizhu Jiao et.al. | 2504.15188 | null |
2025-04-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-21 | Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning | Jie Cheng et.al. | 2504.15275 | **[link](https://github.com/cjreinforce/pure)** |
2025-04-21 | Interpretable Locomotion Prediction in Construction Using a Memory-Driven LLM Agent With Chain-of-Thought Reasoning | Ehsan Ahmadi et.al. | 2504.15263 | null |
2025-04-21 | CRUST-Bench: A Comprehensive Benchmark for C-to-safe-Rust Transpilation | Anirudh Khatry et.al. | 2504.15254 | **[link](https://github.com/anirudhkhatry/crust-bench)** |
2025-04-21 | Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators | Yilun Zhou et.al. | 2504.15253 | **[link](https://github.com/salesforceairesearch/jetts-benchmark)** |
2025-04-21 | MR. Guard: Multilingual Reasoning Guardrail using Curriculum Learning | Yahan Yang et.al. | 2504.15241 | null |
2025-04-21 | A Self-Improving Coding Agent | Maxime Robeyns et.al. | 2504.15228 | null |
2025-04-21 | EvalAgent: Discovering Implicit Evaluation Criteria from the Web | Manya Wadhwa et.al. | 2504.15219 | null |
2025-04-21 | DRAGON: Distributional Rewards Optimize Diffusion Generative Models | Yatong Bai et.al. | 2504.15217 | null |
2025-04-21 | Integrating Symbolic Execution into the Fine-Tuning of Code-Generating LLMs | Marina Sakharova et.al. | 2504.15210 | null |
2025-04-21 | Compute-Optimal LLMs Provably Generalize Better With Scale | Marc Finzi et.al. | 2504.15208 | null |
2025-04-21 | Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges | Nandan Thakur et.al. | 2504.15205 | null |
2025-04-21 | Zero-Shot, But at What Cost? Unveiling the Hidden Overhead of MILS's LLM-CLIP Framework for Image Captioning | Yassir Benhammou et.al. | 2504.15199 | null |
2025-04-21 | Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform | Xianpan Zhou et.al. | 2504.15182 | null |
2025-04-21 | The Synthetic Imputation Approach: Generating Optimal Synthetic Texts For Underrepresented Categories In Supervised Classification Tasks | Joan C. Timoneda et.al. | 2504.15160 | null |
2025-04-21 | EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models | Ziwen Xu et.al. | 2504.15133 | **[link](https://github.com/zjunlp/easyedit)** |
2025-04-21 | Kuwain 1.5B: An Arabic SLM via Language Injection | Khalil Hennara et.al. | 2504.15120 | null |
2025-04-21 | Rethinking the Potential of Multimodality in Collaborative Problem Solving Diagnosis with Large Language Models | K. Wong et.al. | 2504.15093 | null |
2025-04-21 | Empowering AI to Generate Better AI Code: Guided Generation of Deep Learning Projects with LLMs | Chen Xie et.al. | 2504.15080 | null |
2025-04-21 | Think2SQL: Reinforce LLM Reasoning Capabilities for Text2SQL | Simone Papicchio et.al. | 2504.15077 | null |
2025-04-18
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-18 | Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? | Yang Yue et.al. | 2504.13837 | null |
2025-04-18 | Science Hierarchography: Hierarchical Organization of Science Literature | Muhan Gao et.al. | 2504.13834 | null |
2025-04-18 | Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning | Yixuan Even Xu et.al. | 2504.13818 | null |
2025-04-18 | Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations | Chenghao Xiao et.al. | 2504.13816 | null |
2025-04-18 | BadApex: Backdoor Attack Based on Adaptive Optimization Mechanism of Black-box Large Language Models | Zhengxian Wu et.al. | 2504.13775 | null |
2025-04-18 | DP2Unlearning: An Efficient and Guaranteed Unlearning Framework for LLMs | Tamim Al Mahmud et.al. | 2504.13774 | null |
2025-04-18 | Detecting Malicious Source Code in PyPI Packages with LLMs: Does RAG Come in Handy? | Motunrayo Ibiyo et.al. | 2504.13769 | null |
2025-04-18 | ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis | Andrea Rigo et.al. | 2504.13745 | null |
2025-04-18 | Controlled Territory and Conflict Tracking (CONTACT): (Geo-)Mapping Occupied Territory from Open Source Intelligence | Paul K. Mandal et.al. | 2504.13730 | null |
2025-04-18 | MLEP: Multi-granularity Local Entropy Patterns for Universal AI-generated Image Detection | Lin Yuan et.al. | 2504.13726 | null |
2025-04-18 | OpenDeception: Benchmarking and Investigating AI Deceptive Behaviors via Open-ended Interaction Simulation | Yichen Wu et.al. | 2504.13707 | null |
2025-04-18 | Exploring Multimodal Prompt for Visualization Authoring with Large Language Models | Zhen Wen et.al. | 2504.13700 | null |
2025-04-18 | Intelligent Interaction Strategies for Context-Aware Cognitive Augmentation | Xiangrong et.al. | 2504.13684 | null |
2025-04-18 | Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results | Andrea Santilli et.al. | 2504.13677 | null |
2025-04-18 | Large Language Models Will Change The Way Children Think About Technology And Impact Every Interaction Paradigm | Russell Beale et.al. | 2504.13667 | null |
2025-04-18 | Do Prompt Patterns Affect Code Quality? A First Empirical Assessment of ChatGPT-Generated Code | Antonio Della Porta et.al. | 2504.13656 | null |
2025-04-18 | Exploring the Potential for Large Language Models to Demonstrate Rational Probabilistic Beliefs | Gabriel Freedman et.al. | 2504.13644 | null |
2025-04-18 | Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling | Shaomu Tan et.al. | 2504.13630 | null |
2025-04-18 | Divergent LLM Adoption and Heterogeneous Convergence Paths in Research Writing | Cong William Lin et.al. | 2504.13629 | null |
2025-04-18 | Visual Intention Grounding for Egocentric Assistants | Pengzhan Sun et.al. | 2504.13621 | null |
2025-04-18 | SkyReels-V2: Infinite-length Film Generative Model | Guibin Chen et.al. | 2504.13074 | null |
2025-04-17
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-17 | Aligning Constraint Generation with Design Intent in Parametric CAD | Evan Casey et.al. | 2504.13178 | null |
2025-04-17 | SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs | Haoxuan Li et.al. | 2504.13172 | null |
2025-04-17 | Sleep-time Compute: Beyond Inference Scaling at Test-time | Kevin Lin et.al. | 2504.13171 | **[link](https://github.com/letta-ai/sleep-time-compute)** |
2025-04-17 | Exploring Expert Failures Improves LLM Agent Tuning | Li-Cheng Lan et.al. | 2504.13145 | null |
2025-04-17 | Energy-Based Reward Models for Robust Language Model Alignment | Anamika Lochab et.al. | 2504.13134 | null |
2025-04-17 | Science-T2I: Addressing Scientific Illusions in Image Synthesis | Jialuo Li et.al. | 2504.13129 | null |
2025-04-17 | LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard | Varun Rao et.al. | 2504.13125 | null |
2025-04-17 | VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models | Haojian Huang et.al. | 2504.13122 | **[link](https://github.com/haroldchen19/vistadpo)** |
2025-04-17 | UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models | Guanlong Jiao et.al. | 2504.13109 | null |
2025-04-17 | EventVAD: Training-Free Event-Aware Video Anomaly Detection | Yihua Shao et.al. | 2504.13092 | null |
2025-04-17 | Retrieval-Augmented Generation with Conflicting Evidence | Han Wang et.al. | 2504.13079 | **[link](https://github.com/hannight/ramdocs)** |
2025-04-17 | An All-Atom Generative Model for Designing Protein Complexes | Ruizhe Chen et.al. | 2504.13075 | null |
2025-04-17 | Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models | Sudesh Ramesh Bhagat et.al. | 2504.13068 | null |
2025-04-17 | ArtistAuditor: Auditing Artist Style Pirate in Text-to-Image Generation Models | Linkang Du et.al. | 2504.13061 | **[link](https://github.com/jozenn/artistauditor)** |
2025-04-17 | GraphAttack: Exploiting Representational Blindspots in LLM Safety Mechanisms | Sinan He et.al. | 2504.13052 | null |
2025-04-17 | Design Topological Materials by Reinforcement Fine-Tuned Generative Model | Haosheng Xu et.al. | 2504.13048 | null |
2025-04-17 | How Large Language Models Are Changing MOOC Essay Answers: A Comparison of Pre- and Post-LLM Responses | Leo Leppänen et.al. | 2504.13038 | null |
2025-04-17 | InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning | Zheng Wang et.al. | 2504.13032 | null |
2025-04-17 | ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images | Sangwook Kim et.al. | 2504.13023 | null |
2025-04-16
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-16 | BitNet b1.58 2B4T Technical Report | Shuming Ma et.al. | 2504.12285 | null |
2025-04-16 | HLS-Eval: A Benchmark and Framework for Evaluating LLMs on High-Level Synthesis Design Tasks | Stefan Abi-Karam et.al. | 2504.12268 | null |
2025-04-16 | VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate | Zhihang Yuan et.al. | 2504.12259 | null |
2025-04-16 | FLIP Reasoning Challenge | Andreas Plesner et.al. | 2504.12256 | null |
2025-04-16 | AnomalyGen: An Automated Semantic Log Sequence Generation Framework with LLM for Anomaly Detection | Xinyu Li et.al. | 2504.12250 | null |
2025-04-16 | MOS: Towards Effective Smart Contract Vulnerability Detection through Mixture-of-Experts Tuning of Large Language Models | Hang Yuan et.al. | 2504.12234 | null |
2025-04-16 | Watermarking Needs Input Repetition Masking | David Khachaturov et.al. | 2504.12229 | null |
2025-04-16 | Coding-Prior Guided Diffusion Network for Video Deblurring | Yike Liu et.al. | 2504.12222 | null |
2025-04-16 | d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning | Siyan Zhao et.al. | 2504.12216 | null |
2025-04-16 | From Requirements to Architecture: Semi-Automatically Generating Software Architectures | Tobias Eisenreich et.al. | 2504.12192 | null |
2025-04-16 | What Do Large Language Models Know? Tacit Knowledge as a Potential Causal-Explanatory Structure | Céline Budding et.al. | 2504.12187 | null |
2025-04-16 | SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data | Suyoung Bae et.al. | 2504.12185 | null |
2025-04-16 | Deep Generative Models for Bayesian Inference on High-Rate Sensor Data: Applications in Automotive Radar and Medical Imaging | Tristan S. W. Stevens et.al. | 2504.12154 | null |
2025-04-16 | ARCeR: an Agentic RAG for the Automated Definition of Cyber Ranges | Matteo Lupinacci et.al. | 2504.12143 | null |
2025-04-16 | Multilingual Contextualization of Large Language Models for Document-Level Machine Translation | Miguel Moura Ramos et.al. | 2504.12140 | null |
2025-04-16 | Clarifying Ambiguities: on the Role of Ambiguity Types in Prompting Methods for Clarification Generation | Anfu Tang et.al. | 2504.12113 | null |
2025-04-16 | Towards LLM Agents for Earth Observation | Chia Hsiang Kao et.al. | 2504.12110 | null |
2025-04-16 | Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation | Shizhan Cai et.al. | 2504.12108 | null |
2025-04-16 | Gauging Overprecision in LLMs: An Empirical Study | Adil Bahaj et.al. | 2504.12098 | null |
2025-04-16 | Reasoning-Based AI for Startup Evaluation (R.A.I.S.E.): A Memory-Augmented, Multi-Step Decision Framework | Jack Preuveneers et.al. | 2504.12090 | null |
2025-04-16 | Elucidating the Design Space of Multimodal Protein Language Models | Cheng-Yen Hsieh et.al. | 2504.11454 | null |
2025-04-15
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-15 | Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception | Ziqi Pang et.al. | 2504.11457 | null |
2025-04-15 | DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning | Zhiwei He et.al. | 2504.11456 | null |
2025-04-15 | TextArena | Leon Guertler et.al. | 2504.11442 | null |
2025-04-15 | Masculine Defaults via Gendered Discourse in Podcasts and Large Language Models | Maria Teleki et.al. | 2504.11431 | null |
2025-04-15 | A Dual-Space Framework for General Knowledge Distillation of Large Language Models | Xue Zhang et.al. | 2504.11426 | null |
2025-04-15 | Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts | Quanyu Long et.al. | 2504.11420 | null |
2025-04-15 | Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning | Ali Taghibakhshi et.al. | 2504.11409 | null |
2025-04-15 | RankAlign: A Ranking View of the Generator-Validator Gap in Large Language Models | Juan Diego Rodriguez et.al. | 2504.11381 | null |
2025-04-15 | Ring Artifacts Correction Based on Global-Local Features Interaction Guidance in the Projection Domain | Yunze Liu et.al. | 2504.11375 | null |
2025-04-15 | Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions | Wang Bill Zhu et.al. | 2504.11373 | null |
2025-04-15 | DataSentinel: A Game-Theoretic Detection of Prompt Injection Attacks | Yupei Liu et.al. | 2504.11358 | null |
2025-04-15 | A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce | Wei Xiong et.al. | 2504.11343 | null |
2025-04-15 | Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints | Ruicheng Ao et.al. | 2504.11320 | null |
2025-04-15 | Learning to Be A Doctor: Searching for Effective Medical Agent Architectures | Yangyang Zhuang et.al. | 2504.11301 | null |
2025-04-15 | The Obvious Invisible Threat: LLM-Powered GUI Agents' Vulnerability to Fine-Print Injections | Chaoran Chen et.al. | 2504.11281 | null |
2025-04-15 | From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs | Guocong Li et.al. | 2504.11277 | null |
2025-04-15 | Towards Automated Safety Requirements Derivation Using Agent-based RAG | Balahari Vignesh Balu et.al. | 2504.11243 | null |
2025-04-15 | Nondeterministic Polynomial-time Problem Challenge: An Ever-Scaling Reasoning Benchmark for LLMs | Chang Yang et.al. | 2504.11239 | null |
2025-04-15 | AutoRAN: Automated and Zero-Touch Open RAN Systems | Stefano Maxenti et.al. | 2504.11233 | null |
2025-04-14
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-14 | xVerify: Efficient Answer Verifier for Reasoning Model Evaluations | Ding Chen et.al. | 2504.10481 | null |
2025-04-14 | InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models | Jinguo Zhu et.al. | 2504.10479 | null |
2025-04-14 | Art3D: Training-Free 3D Generation from Flat-Colored Illustration | Xiaoyan Cong et.al. | 2504.10466 | null |
2025-04-14 | M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models | Junxiong Wang et.al. | 2504.10449 | null |
2025-04-14 | Multimodal Long Video Modeling Based on Temporal Dynamic Context | Haoran Hao et.al. | 2504.10443 | null |
2025-04-14 | Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing | Taihang Hu et.al. | 2504.10434 | null |
2025-04-14 | LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models | Minqian Liu et.al. | 2504.10430 | null |
2025-04-14 | Can We Edit LLMs for Long-Tail Biomedical Knowledge? | Xinhao Yi et.al. | 2504.10421 | null |
2025-04-14 | CliniChat: A Multi-Source Knowledge-Driven Framework for Clinical Interview Dialogue Reconstruction and Evaluation | Jing Chen et.al. | 2504.10418 | null |
2025-04-14 | LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models | Parshin Shojaee et.al. | 2504.10415 | **[link](https://github.com/deep-symbolic-mathematics/llm-srbench)** |
2025-04-14 | HUMOTO: A 4D Dataset of Mocap Human Object Interactions | Jiaxin Lu et.al. | 2504.10414 | null |
2025-04-14 | Performance of Large Language Models in Supporting Medical Diagnosis and Treatment | Diogo Sousa et.al. | 2504.10405 | null |
2025-04-14 | Can LLMs Assist Expert Elicitation for Probabilistic Causal Modeling? | Olha Shaposhnyk et.al. | 2504.10397 | null |
2025-04-14 | LLM-driven Constrained Copy Generation through Iterative Refinement | Varun Vasudevan et.al. | 2504.10391 | null |
2025-04-14 | SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning | Yiting Wang et.al. | 2504.10369 | null |
2025-04-14 | S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models | Wenyuan Zhang et.al. | 2504.10368 | null |
2025-04-14 | FingER: Content Aware Fine-grained Evaluation with Reasoning for AI-Generated Videos | Rui Chen et.al. | 2504.10358 | null |
2025-04-14 | MultiLoKo: a multilingual local knowledge benchmark for LLMs spanning 31 languages | Dieuwke Hupkes et.al. | 2504.10356 | null |
2025-04-14 | Forecasting from Clinical Textual Time Series: Adaptations of the Encoder and Decoder Language Model Families | Shahriar Noroozizadeh et.al. | 2504.10340 | null |
2025-04-14 | Heimdall: test-time scaling on the generative verification | Wenlei Shi et.al. | 2504.10337 | null |
2025-04-11
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-11 | Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images | Boyang Deng et.al. | 2504.08727 | null |
2025-04-11 | DocAgent: A Multi-Agent System for Automated Code Documentation Generation | Dayu Yang et.al. | 2504.08725 | null |
2025-04-11 | Generating Fine Details of Entity Interactions | Xinyi Gu et.al. | 2504.08714 | null |
2025-04-11 | Large Language Models as Span Annotators | Zdeněk Kasner et.al. | 2504.08697 | null |
2025-04-11 | SeaView: Software Engineering Agent Visual Interface for Enhanced Workflow | Timothy Bula et.al. | 2504.08696 | null |
2025-04-11 | TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning | Hang Ni et.al. | 2504.08694 | null |
2025-04-11 | Fast-Slow-Thinking: Complex Task Solving with Large Language Models | Yiliu Sun et.al. | 2504.08690 | null |
2025-04-11 | Voice Interaction With Conversational AI Could Facilitate Thoughtful Reflection and Substantive Revision in Writing | Jiho Kim et.al. | 2504.08687 | null |
2025-04-11 | Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model | Team Seawead et.al. | 2504.08685 | null |
2025-04-11 | Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning | Fangzhi Xu et.al. | 2504.08672 | null |
2025-04-11 | Variability-Driven User-Story Generation using LLM and Triadic Concept Analysis | Alexandre Bazin et.al. | 2504.08666 | null |
2025-04-11 | Safe Flow Matching: Robot Motion Planning with Control Barrier Functions | Xiaobing Dai et.al. | 2504.08661 | null |
2025-04-11 | Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents | Alessio Buscemi et.al. | 2504.08640 | null |
2025-04-11 | MooseAgent: A LLM Based Multi-agent Framework for Automating Moose Simulation | Tao Zhang et.al. | 2504.08621 | null |
2025-04-11 | Analyzing 16,193 LLM Papers for Fun and Profits | Zhiqiu Xia et.al. | 2504.08619 | null |
2025-04-11 | ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration | Yongsheng Yu et.al. | 2504.08591 | null |
2025-04-11 | COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails | Miguel Espinosa et.al. | 2504.08548 | null |
2025-04-11 | Slicing the Gaussian Mixture Wasserstein Distance | Moritz Piening et.al. | 2504.08544 | null |
2025-04-11 | Task Memory Engine (TME): Enhancing State Awareness for Multi-Step LLM Agent Tasks | Ye Ye et.al. | 2504.08525 | null |
2025-04-11 | Adopting Large Language Models to Automated System Integration | Robin D. Pesl et.al. | 2504.08490 | null |
2025-04-11 | Scaling Laws for Native Multimodal Models | Mustafa Shukor et.al. | 2504.07951 | null |
2025-04-11 | An LLM-Driven Multi-Agent Debate System for Mendelian Diseases | Xinyang Zhou et.al. | 2504.07881 | null |
2025-04-11 | Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs | Yichun Yin et.al. | 2504.07866 | null |
2025-04-10
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-10 | C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing | Zhongyang Li et.al. | 2504.07964 | **[link](https://github.com/tianyi-lab/c3po)** |
2025-04-10 | PixelFlow: Pixel-Space Generative Models with Flow | Shoufa Chen et.al. | 2504.07963 | **[link](https://github.com/shoufachen/pixelflow)** |
2025-04-10 | VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning | Yukun Qi et.al. | 2504.07956 | null |
2025-04-10 | Porting an LLM based Application from ChatGPT to an On-Premise Environment | Teemu Paloniemi et.al. | 2504.07907 | null |
2025-04-10 | Redefining Machine Translation on Social Network Services with Large Language Models | Hongcheng Guo et.al. | 2504.07901 | null |
2025-04-10 | How do Large Language Models Understand Relevance? A Mechanistic Interpretability Perspective | Qi Liu et.al. | 2504.07898 | null |
2025-04-10 | DiverseFlow: Sample-Efficient Diverse Mode Coverage in Flows | Mashrur M. Morshed et.al. | 2504.07894 | null |
2025-04-10 | Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge | Riccardo Cantini et.al. | 2504.07887 | **[link](https://github.com/SCAlabUnical/CLEAR-Bias_LLM_benchmark)** |
2025-04-10 | Token Level Routing Inference System for Edge Devices | Jianshu She et.al. | 2504.07878 | null |
2025-04-10 | Robust Hallucination Detection in LLMs via Adaptive Token Selection | Mengjia Niu et.al. | 2504.07863 | null |
2025-04-10 | Understanding Learner-LLM Chatbot Interactions and the Impact of Prompting Guidelines | Cansu Koyuturk et.al. | 2504.07840 | null |
2025-04-10 | Deceptive Automated Interpretability: Language Models Coordinating to Fool Oversight Systems | Simon Lermen et.al. | 2504.07831 | null |
2025-04-10 | MOSAIC: Modeling Social AI for Content Dissemination and Regulation in Multi-Agent Simulations | Genglin Liu et.al. | 2504.07830 | null |
2025-04-10 | Cluster-Driven Expert Pruning for Mixture-of-Experts Large Language Models | Hongcheng Guo et.al. | 2504.07807 | **[link](https://github.com/fighoture/moe_unsupervised_pruning)** |
2025-04-10 | A System for Comprehensive Assessment of RAG Frameworks | Mattia Rengo et.al. | 2504.07803 | **[link](https://github.com/eustema-s-p-a/scarf)** |
2025-04-10 | FairEval: Evaluating Fairness in LLM-Based Recommendations with Personality Awareness | Chandan Kumar Sah et.al. | 2504.07801 | null |
2025-04-10 | Plan-and-Refine: Diverse and Comprehensive Retrieval-Augmented Generation | Alireza Salemi et.al. | 2504.07794 | **[link](https://github.com/alirezasalemi7/pr-rag)** |
2025-04-09
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-09 | Sculpting Subspaces: Constrained Full Fine-Tuning in LLMs for Continual Learning | Nikhil Shivakumar Nayak et.al. | 2504.07097 | null |
2025-04-09 | OmniCaptioner: One Captioner to Rule Them All | Yiting Lu et.al. | 2504.07089 | null |
2025-04-09 | KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs | Elan Markowitz et.al. | 2504.07087 | null |
2025-04-09 | Identifying Unknown Stochastic Dynamics via Finite expression methods | Senwei Liang et.al. | 2504.07085 | null |
2025-04-09 | DeduCE: Deductive Consistency as a Framework to Evaluate LLM Reasoning | Atharva Pandey et.al. | 2504.07080 | null |
2025-04-09 | A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models | Zhouhang Xie et.al. | 2504.07070 | null |
2025-04-09 | HalluciNot: Hallucination Detection Through Context and Common Knowledge Verification | Bibek Paudel et.al. | 2504.07069 | null |
2025-04-09 | TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling | Liang-Hsuan Tseng et.al. | 2504.07053 | null |
2025-04-09 | To Backtrack or Not to Backtrack: When Sequential Search Limits Model Reasoning | Tian Qin et.al. | 2504.07052 | null |
2025-04-09 | Distilling Textual Priors from LLM to Efficient Image Fusion | Ran Zhang et.al. | 2504.07029 | null |
2025-04-09 | Evaluating Retrieval Augmented Generative Models for Document Queries in Transportation Safety | Chad Melton et.al. | 2504.07022 | null |
2025-04-09 | LLM-IFT: LLM-Powered Information Flow Tracking for Secure Hardware | Nowfel Mashnoor et.al. | 2504.07015 | null |
2025-04-09 | Latent Diffusion U-Net Representations Contain Positional Embeddings and Anomalies | Jonas Loos et.al. | 2504.07008 | **[link](https://github.com/JonasLoos/sd-representation-anomalies)** |
2025-04-09 | Towards LLMs Robustness to Changes in Prompt Format Styles | Lilian Ngweta et.al. | 2504.06969 | null |
2025-04-09 | Review of Case-Based Reasoning for LLM Agents: Theoretical Foundations, Architectural Components, and Cognitive Integration | Kostas Hatalis et.al. | 2504.06943 | null |
2025-04-09 | FeedbackEval: A Benchmark for Evaluating Large Language Models in Feedback-Driven Code Repair Tasks | Dekun Dai et.al. | 2504.06939 | null |
2025-04-09 | The Importance of Being Discrete: Measuring the Impact of Discretization in End-to-End Differentially Private Synthetic Data | Georgi Ganev et.al. | 2504.06923 | null |
2025-04-09 | Identifying Aspects in Peer Reviews | Sheng Lu et.al. | 2504.06910 | null |
2025-04-09 | EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation | Diljeet Jagpal et.al. | 2504.06861 | null |
2025-04-09 | LVC: A Lightweight Compression Framework for Enhancing VLMs in Long Video Understanding | Ziyi Wang et.al. | 2504.06835 | null |
2025-04-09 | Navigating the Rabbit Hole: Emergent Biases in LLM-Generated Attack Narratives Targeting Mental Health Groups | Rijul Magu et.al. | 2504.06160 | null |
2025-04-09 | Leanabell-Prover: Posttraining Scaling in Formal Reasoning | Jingyuan Zhang et.al. | 2504.06122 | null |
2025-04-09 | CAI: An Open, Bug Bounty-Ready Cybersecurity AI | Víctor Mayoral-Vilches et.al. | 2504.06017 | null |
2025-04-08
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-08 | GOLLuM: Gaussian Process Optimized LLMs -- Reframing LLM Finetuning through Bayesian Optimization | Bojana Ranković et.al. | 2504.06265 | null |
2025-04-08 | OmniSVG: A Unified Scalable Vector Graphics Generation Model | Yiying Yang et.al. | 2504.06263 | null |
2025-04-08 | Hogwild! Inference: Parallel LLM Generation via Concurrent Attention | Gleb Rodionov et.al. | 2504.06261 | null |
2025-04-08 | FEABench: Evaluating Language Models on Multiphysics Reasoning Ability | Nayantara Mudur et.al. | 2504.06260 | null |
2025-04-08 | Transfer between Modalities with MetaQueries | Xichen Pan et.al. | 2504.06256 | null |
2025-04-08 | Electronic Structure Guided Inverse Design Using Generative Models | Shuyi Jia et.al. | 2504.06249 | null |
2025-04-08 | LExT: Towards Evaluating Trustworthiness of Natural Language Explanations | Krithi Shailya et.al. | 2504.06227 | null |
2025-04-08 | Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation | Biao Zhang et.al. | 2504.06225 | null |
2025-04-08 | Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs | Dongyang Fan et.al. | 2504.06219 | null |
2025-04-08 | From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models | Chejian Xu et.al. | 2504.06214 | null |
2025-04-08 | TxGemma: Efficient and Agentic LLMs for Therapeutics | Eric Wang et.al. | 2504.06196 | null |
2025-04-08 | ARLO: A Tailorable Approach for Transforming Natural Language Software Requirements into Architecture using LLMs | Tooraj Helmi et.al. | 2504.06143 | null |
2025-04-08 | QGen Studio: An Adaptive Question-Answer Generation, Training and Evaluation Platform | Movina Moses et.al. | 2504.06136 | null |
2025-04-08 | FaceCloak: Learning to Protect Face Templates | Sudipta Banerjee et.al. | 2504.06131 | null |
2025-04-08 | Nonuniform-Tensor-Parallelism: Mitigating GPU failure impact for Scaled-up LLM Training | Daiyaan Arfeen et.al. | 2504.06095 | null |
2025-04-08 | Multi-Sense Embeddings for Language Models and Knowledge Distillation | Qitong Wang et.al. | 2504.06036 | null |
2025-04-08 | Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi | Monojit Choudhury et.al. | 2504.06011 | null |
2025-04-08 | Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG | Hengran Zhang et.al. | 2504.05220 | null |
2025-04-07
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-07 | Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations | Pedro Ferreira et.al. | 2504.05294 | null |
2025-04-07 | The challenge of uncertainty quantification of large language models in medicine | Zahra Atf et.al. | 2504.05278 | null |
2025-04-07 | Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation | Yucheng Chu et.al. | 2504.05276 | null |
2025-04-07 | Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models | Yang Yan et.al. | 2504.05262 | null |
2025-04-07 | How to evaluate control measures for LLM agents? A trajectory from today to superintelligence | Tomek Korbak et.al. | 2504.05259 | null |
2025-04-07 | Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models | Adrián Bazaga et.al. | 2504.05258 | null |
2025-04-07 | LLM-based Automated Grading with Human-in-the-Loop | Hang Li et.al. | 2504.05239 | null |
2025-04-07 | Mapping biodiversity at very-high resolution in Europe | César Leblanc et.al. | 2504.05231 | null |
2025-04-07 | LLM-Alignment Live-Streaming Recommendation | Yueyang Liu et.al. | 2504.05217 | null |
2025-04-07 | Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling | Hengran Zhang et.al. | 2504.05216 | null |
2025-04-07 | Post-Training Language Models for Continual Relation Extraction | Sefika Efeoglu et.al. | 2504.05214 | null |
2025-04-07 | Quantum Program Linting with LLMs: Emerging Results from a Comparative Study | Seung Yeob Shin et.al. | 2504.05204 | null |
2025-04-07 | P2Mark: Plug-and-play Parameter-intrinsic Watermarking for Neural Speech Generation | Yong Ren et.al. | 2504.05197 | null |
2025-04-07 | Concise Reasoning via Reinforcement Learning | Mehdi Fatemi et.al. | 2504.05185 | null |
2025-04-07 | BRIDGES: Bridging Graph Modality and Large Language Models within EDA Tasks | Wei Li et.al. | 2504.05180 | null |
2025-04-07 | Learning symmetries in datasets | Veronica Sanz et.al. | 2504.05174 | null |
2025-04-07 | Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness | Dongzhuoran Zhou et.al. | 2504.05163 | null |
2025-04-07 | DDPM Score Matching and Distribution Learning | Sinho Chewi et.al. | 2504.05161 | null |
2025-04-07 | Pr $εε$ mpt: Sanitizing Sensitive Prompts for LLMs | Amrita Roy Chowdhury et.al. | 2504.05147 | null |
2025-04-04
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-04 | MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models | Wulin Xie et.al. | 2504.03641 | null |
2025-04-04 | Do Larger Language Models Imply Better Reasoning? A Pretraining Scaling Law for Reasoning | Xinyi Wang et.al. | 2504.03635 | null |
2025-04-04 | Enhancing Causal Effect Estimation with Diffusion-Generated Data | Li Chen et.al. | 2504.03630 | null |
2025-04-04 | Align to Structure: Aligning Large Language Models with Structural Information | Zae Myung Kim et.al. | 2504.03622 | null |
2025-04-04 | VISTA-OCR: Towards generative and interactive end to end OCR models | Laziz Hamdi et.al. | 2504.03621 | null |
2025-04-04 | Multilingual Retrieval-Augmented Generation for Knowledge-Intensive Task | Leonardo Ranaldi et.al. | 2504.03616 | null |
2025-04-04 | Autonomous and Self-Adapting System for Synthetic Media Detection and Attribution | Aref Azizpour et.al. | 2504.03615 | null |
2025-04-04 | AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset | Bingxiang He et.al. | 2504.03612 | null |
2025-04-04 | APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay | Akshara Prabhakar et.al. | 2504.03601 | null |
2025-04-04 | EnrichIndex: Using LLMs to Enrich Retrieval Indices Offline | Peter Baile Chen et.al. | 2504.03598 | null |
2025-04-04 | Hallucination Detection on a Budget: Efficient Bayesian Estimation of Semantic Entropy | Kamil Ciosek et.al. | 2504.03579 | null |
2025-04-04 | SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement | Runnan Fang et.al. | 2504.03561 | null |
2025-04-04 | Agentic Knowledgeable Self-awareness | Shuofei Qiao et.al. | 2504.03553 | null |
2025-04-04 | Diverse In-Context Example Selection After Decomposing Programs and Aligned Utterances Improves Semantic Parsing | Mayank Kothyari et.al. | 2504.03541 | null |
2025-04-04 | HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration | Boyuan Wang et.al. | 2504.03536 | null |
2025-04-04 | Neutralizing the Narrative: AI-Powered Debiasing of Online News Articles | Chen Wei Kuo et.al. | 2504.03520 | null |
2025-04-04 | Structured Legal Document Generation in India: A Model-Agnostic Wrapper Approach with VidhikDastaavej | Shubham Kumar Nigam et.al. | 2504.03486 | null |
2025-04-04 | D-Garment: Physics-Conditioned Latent Diffusion for Dynamic Garment Deformations | Antoine Dumoulin et.al. | 2504.03468 | null |
2025-04-04 | Generating ensembles of spatially-coherent in-situ forecasts using flow matching | David Landry et.al. | 2504.03463 | null |
2025-04-04 | Conditioning Diffusions Using Malliavin Calculus | Jakiw Pidstrigach et.al. | 2504.03461 | null |
2025-04-04 | A Survey of Large Language Models in Mental Health Disorder Detection on Social Media | Zhuohan Ge et.al. | 2504.02800 | null |
2025-04-04 | RBT4DNN: Requirements-based Testing of Neural Networks | Nusrat Jahan Mozumder et.al. | 2504.02737 | null |
2025-04-04 | Why do LLMs attend to the first token? | Federico Barbero et.al. | 2504.02732 | null |
2025-04-03
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-03 | Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models | Mateusz Pach et.al. | 2504.02821 | null |
2025-04-03 | Efficient Autoregressive Shape Generation via Octree-Based Adaptive Tokenization | Kangle Deng et.al. | 2504.02817 | null |
2025-04-03 | Generative Evaluation of Complex Reasoning in Large Language Models | Haowei Lin et.al. | 2504.02810 | null |
2025-04-03 | MegaMath: Pushing the Limits of Open Math Corpora | Fan Zhou et.al. | 2504.02807 | null |
2025-04-03 | A Framework for Robust Cognitive Evaluation of LLMs | Karin de Langis et.al. | 2504.02789 | null |
2025-04-03 | From Consumption to Collaboration: Measuring Interaction Patterns to Augment Human Cognition in Open-Ended Tasks | Joshua Holstein et.al. | 2504.02780 | null |
2025-04-03 | BT-ACTION: A Test-Driven Approach for Modular Understanding of User Instruction Leveraging Behaviour Trees and LLMs | Alexander Leszczynski et.al. | 2504.02779 | null |
2025-04-03 | MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs | Jaap Jumelet et.al. | 2504.02768 | null |
2025-04-03 | How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices? | Andres Algaba et.al. | 2504.02767 | null |
2025-04-03 | Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model | Shengjun Zhang et.al. | 2504.02764 | null |
2025-04-03 | Echoes of the hidden: Uncovering coordination beyond network structure | Shahar Somin et.al. | 2504.02757 | null |
2025-04-03 | Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study | Aryan Agrawal et.al. | 2504.02733 | null |
2025-04-03 | ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization | Kehua Feng et.al. | 2504.02725 | null |
2025-04-03 | TeleMoM: Consensus-Driven Telecom Intelligence via Mixture of Models | Xinquan Wang et.al. | 2504.02712 | null |
2025-04-03 | The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context | Nikhil Verma et.al. | 2504.02708 | null |
2025-04-03 | LLM for Complex Reasoning Task: An Exploratory Study in Fermi Problems | Zishuo Liu et.al. | 2504.02671 | null |
2025-04-03 | Affordable AI Assistants with Knowledge Graph of Thoughts | Maciej Besta et.al. | 2504.02670 | null |
2025-04-03 | VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step | Hanyang Wang et.al. | 2504.01956 | null |
2025-04-03 | Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation | Baban Gain et.al. | 2504.01919 | null |
2025-04-02
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-04-02 | The LLM Wears Prada: Analysing Gender Bias and Stereotypes through Online Shopping Data | Massimiliano Luca et.al. | 2504.01951 | null |
2025-04-02 | OpenCodeReasoning: Advancing Data Distillation for Competitive Coding | Wasi Uddin Ahmad et.al. | 2504.01943 | null |
2025-04-02 | A Unified Approach to Analysis and Design of Denoising Markov Models | Yinuo Ren et.al. | 2504.01938 | null |
2025-04-02 | Critical Thinking: Which Kinds of Complexity Govern Optimal Reasoning Length? | Celine Lee et.al. | 2504.01935 | null |
2025-04-02 | Review, Refine, Repeat: Understanding Iterative Decoding of AI Agents with Dynamic Evaluation and Selection | Souradip Chakraborty et.al. | 2504.01931 | null |
2025-04-02 | A thorough benchmark of automatic text classification: From traditional approaches to large language models | Washington Cunha et.al. | 2504.01930 | null |
2025-04-02 | Is the Reversal Curse a Binding Problem? Uncovering Limitations of Transformers from a Basic Generalization Failure | Boshi Wang et.al. | 2504.01928 | null |
2025-04-02 | Gen-C: Populating Virtual Worlds with Generative Crowds | Andreas Panayiotou et.al. | 2504.01924 | null |
2025-04-02 | Advancing AI-Scientist Understanding: Making LLM Think Like a Physicist with Interpretable Reasoning | Yinggan Xu et.al. | 2504.01911 | null |
2025-04-02 | Build Code Needs Maintenance Too: A Study on Refactoring and Technical Debt in Build Systems | Anwar Ghammam et.al. | 2504.01907 | null |
2025-04-02 | STAR-1: Safer Alignment of Reasoning LLMs with 1K Data | Zijun Wang et.al. | 2504.01903 | null |
2025-04-02 | Multi-fidelity Parameter Estimation Using Conditional Diffusion Models | Caroline Tatsuoka et.al. | 2504.01894 | null |
2025-04-02 | TransientTables: Evaluating LLMs' Reasoning on Temporally Evolving Semi-structured Tables | Abhilash Shankarampeta et.al. | 2504.01879 | null |
2025-04-02 | Interpreting Emergent Planning in Model-Free Reinforcement Learning | Thomas Bush et.al. | 2504.01871 | null |
2025-04-02 | From Code Generation to Software Testing: AI Copilot with Context-Based RAG | Yuchen Wang et.al. | 2504.01866 | null |
2025-04-02 | Cross-Lingual Consistency: A Novel Inference Framework for Advancing Reasoning in Large Language Models | Zhiwei Yu et.al. | 2504.01857 | null |
2025-04-02 | Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks | Ali Al-Kaswan et.al. | 2504.01850 | null |
2025-04-02 | PaperBench: Evaluating AI's Ability to Replicate AI Research | Giulio Starace et.al. | 2504.01848 | null |
2025-03-31
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-31 | Consistent Subject Generation via Contrastive Instantiated Concepts | Lee Hsin-Ying et.al. | 2503.24387 | null |
2025-03-31 | Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation | Shengqiong Wu et.al. | 2503.24379 | null |
2025-03-31 | Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models | Rui Wang et.al. | 2503.24377 | **[link](https://github.com/devoallen/awesome-reasoning-economy-papers)** |
2025-03-31 | Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 | Yi Chen et.al. | 2503.24376 | **[link](https://github.com/tencentarc/seed-bench-r1)** |
2025-03-31 | Effectively Controlling Reasoning Models through Thinking Intervention | Tong Wu et.al. | 2503.24370 | null |
2025-03-31 | SQuat: Subspace-orthogonal KV Cache Quantization | Hao Wang et.al. | 2503.24358 | null |
2025-03-31 | ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion | Rana Muhammad Shahroz Khan et.al. | 2503.24354 | null |
2025-03-31 | BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models | Alok Abhishek et.al. | 2503.24310 | null |
2025-03-31 | A Systematic Evaluation of LLM Strategies for Mental Health Text Analysis: Fine-tuning vs. Prompt Engineering vs. RAG | Arshia Kermani et.al. | 2503.24307 | null |
2025-03-31 | Is analogy enough to draw novel adjective-noun inferences? | Hayley Ross et.al. | 2503.24293 | null |
2025-03-31 | Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning | Jiacheng Lin et.al. | 2503.24289 | **[link](https://github.com/linjc16/Rec-R1)** |
2025-03-31 | Evaluating and Designing Sparse Autoencoders by Approximating Quasi-Orthogonality | Sewoong Lee et.al. | 2503.24277 | **[link](https://github.com/sewoonglee/top-afa-sae)** |
2025-03-31 | Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation | Dun Yuan et.al. | 2503.24245 | null |
2025-03-31 | What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models | Qiyuan Zhang et.al. | 2503.24235 | null |
2025-03-31 | Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes | Daichi Otsuka et.al. | 2503.24229 | null |
2025-03-31 | PAARS: Persona Aligned Agentic Retail Shoppers | Saab Mansour et.al. | 2503.24228 | null |
2025-03-31 | Synthetic News Generation for Fake News Classification | Abdul Sittar et.al. | 2503.24206 | null |
2025-03-31 | TwT: Thinking without Tokens by Habitual Reasoning Distillation with Multi-Teachers' Guidance | Jingxian Xu et.al. | 2503.24198 | null |
2025-03-31 | Text2Tracks: Prompt-based Music Recommendation via Generative Retrieval | Enrico Palumbo et.al. | 2503.24193 | null |
2025-03-31 | Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms | Shuoming Zhang et.al. | 2503.24191 | null |
2025-03-28
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-28 | Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions | Mohammad Almansoori et.al. | 2503.22678 | null |
2025-03-28 | DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness | Ruining Li et.al. | 2503.22677 | null |
2025-03-28 | QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks? | Belinda Z. Li et.al. | 2503.22674 | null |
2025-03-28 | Unicorn: Text-Only Data Synthesis for Vision Language Model Training | Xiaomin Yu et.al. | 2503.22655 | **[link](https://github.com/yu-xm/unicorn)** |
2025-03-28 | Using AI to Summarize US Presidential Campaign TV Advertisement Videos, 1952-2012 | Adam Breuer et.al. | 2503.22589 | **[link](https://github.com/adambreuer/ai-summarizevid)** |
2025-03-28 | LLM-enabled Instance Model Generation | Fengjunjie Pan et.al. | 2503.22587 | null |
2025-03-28 | Historical Ink: Exploring Large Language Models for Irony Detection in 19th-Century Spanish | Kevin Cohen et.al. | 2503.22585 | **[link](https://github.com/historicalink/ironydetection)** |
2025-03-28 | Drop the Golden Apples: Identifying Third-Party Reuse by DB-Less Software Composition Analysis | Lyuye Zhang et.al. | 2503.22576 | null |
2025-03-28 | RELD: Regularization by Latent Diffusion Models for Image Restoration | Pasquale Cascarano et.al. | 2503.22563 | null |
2025-03-28 | Niyama : Breaking the Silos of LLM Inference Serving | Kanishk Goel et.al. | 2503.22562 | null |
2025-03-28 | Bridging the Dimensional Chasm: Uncover Layer-wise Dimensional Reduction in Transformers through Token Correlation | Zhuo-Yang Song et.al. | 2503.22547 | null |
2025-03-28 | Exploiting Mixture-of-Experts Redundancy Unlocks Multimodal Generative Abilities | Raman Dutt et.al. | 2503.22517 | null |
2025-03-28 | Unlocking LLM Repair Capabilities in Low-Resource Programming Languages Through Cross-Language Translation and Multi-Agent Refinement | Wenqiang Luo et.al. | 2503.22512 | null |
2025-03-28 | WorkTeam: Constructing Workflows from Natural Language with Multi-Agents | Hanchao Liu et.al. | 2503.22473 | null |
2025-03-28 | Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey | Shengyue Guan et.al. | 2503.22458 | null |
2025-03-28 | Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning | Abdullah Vanlioglu et.al. | 2503.22456 | null |
2025-03-28 | STADE: Standard Deviation as a Pruning Metric | Diego Coello de Portugal Mecke et.al. | 2503.22451 | **[link](https://github.com/coello-dev/stade)** |
2025-03-28 | NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving | Fuhao Li et.al. | 2503.22436 | null |
2025-03-28 | CoSIL: Software Issue Localization via LLM-Driven Code Repository Graph Searching | Zhonghao Jiang et.al. | 2503.22424 | **[link](https://github.com/zhonghaojiang/cosil)** |
2025-03-28 | Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis | Jiangyong Huang et.al. | 2503.22420 | null |
2025-03-27
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-27 | A Unified Image-Dense Annotation Generation Model for Underwater Scenes | Hongkai Lin et.al. | 2503.21771 | **[link](https://github.com/hongklin/tide)** |
2025-03-27 | Exploring the Evolution of Physics Cognition in Video Generation: A Survey | Minghui Lin et.al. | 2503.21765 | **[link](https://github.com/minnie-lin/awesome-physics-cognition-based-video-generation)** |
2025-03-27 | MemInsight: Autonomous Memory Augmentation for LLM Agents | Rana Salama et.al. | 2503.21760 | null |
2025-03-27 | Fwd2Bot: LVLM Visual Token Compression with Double Forward Bottleneck | Adrian Bulat et.al. | 2503.21757 | null |
2025-03-27 | A Unified Framework for Diffusion Bridge Problems: Flow Matching and Schrödinger Matching into One | Minyoung Kim et.al. | 2503.21756 | null |
2025-03-27 | VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness | Dian Zheng et.al. | 2503.21755 | **[link](https://github.com/vchitect/vbench)** |
2025-03-27 | 3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models | Yuhan Zhang et.al. | 2503.21745 | null |
2025-03-27 | GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics | Arsham Gholamzadeh Khoee et.al. | 2503.21735 | null |
2025-03-27 | Effective Skill Unlearning through Intervention and Abstention | Yongce Li et.al. | 2503.21730 | **[link](https://github.com/trustworthy-ml-lab/effective_skill_unlearning)** |
2025-03-27 | Collab: Controlled Decoding using Mixture of Agents for LLM Alignment | Souradip Chakraborty et.al. | 2503.21720 | null |
2025-03-27 | CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers? | Jiefu Ou et.al. | 2503.21717 | null |
2025-03-27 | Enhancing Repository-Level Software Repair via Repository-Aware Knowledge Graphs | Boyang Yang et.al. | 2503.21710 | null |
2025-03-27 | Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data | Zhiyuan Ma et.al. | 2503.21694 | **[link](https://github.com/theericma/triplaneturbo)** |
2025-03-27 | LLM-Gomoku: A Large Language Model-Based System for Strategic Gomoku with Self-Play and Reinforcement Learning | Hui Wang et.al. | 2503.21683 | null |
2025-03-27 | A friendly introduction to triangular transport | Maximilian Ramgraber et.al. | 2503.21673 | null |
2025-03-27 | COMI-LINGUA: Expert Annotated Large-Scale Dataset for Multitask NLP in Hindi-English Code-Mixing | Rajvee Sheth et.al. | 2503.21670 | null |
2025-03-27 | UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning | Zhengxi Lu et.al. | 2503.21620 | null |
2025-03-27 | A Measure Based Generalizable Approach to Understandability | Vikas Kushwaha et.al. | 2503.21615 | null |
2025-03-27 | Evaluating book summaries from internal knowledge in Large Language Models: a cross-model and semantic consistency approach | Javier Coronado-Blázquez et.al. | 2503.21613 | null |
2025-03-27 | Prompt, Divide, and Conquer: Bypassing Large Language Model Safety Filters via Segmented and Distributed Prompt Processing | Johan Wahréus et.al. | 2503.21598 | null |
2025-03-27 | Beyond Believability: Accurate Human Behavior Simulation with Fine-Tuned LLMs | Yuxuan Lu et.al. | 2503.20749 | null |
2025-03-26
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-26 | Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark | Sondos Mahmoud Bsharat et.al. | 2503.20786 | null |
2025-03-26 | Understanding R1-Zero-Like Training: A Critical Perspective | Zichen Liu et.al. | 2503.20783 | null |
2025-03-26 | Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields | Shijie Zhou et.al. | 2503.20776 | null |
2025-03-26 | Reliable algorithm selection for machine learning-guided design | Clara Fannjiang et.al. | 2503.20767 | null |
2025-03-26 | MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search | Yunhai Hu et.al. | 2503.20757 | null |
2025-03-26 | Continual learning via probabilistic exchangeable sequence modelling | Hanwen Xing et.al. | 2503.20725 | null |
2025-03-26 | From Annotation to Adaptation: Metrics, Synthetic Data, and Aspect Extraction for Aspect-Based Sentiment Analysis with Large Language Models | Nikita Neveditsin et.al. | 2503.20715 | null |
2025-03-26 | GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection | Xingyu Peng et.al. | 2503.20682 | null |
2025-03-26 | Vision as LoRA | Han Wang et.al. | 2503.20680 | null |
2025-03-26 | BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation | Yuyang Peng et.al. | 2503.20672 | null |
2025-03-26 | TAMA: A Human-AI Collaborative Thematic Analysis Framework Using Multi-Agent LLMs for Clinical Interviews | Huimin Xu et.al. | 2503.20666 | null |
2025-03-26 | ARMO: Autoregressive Rigging for Multi-Category Objects | Mingze Sun et.al. | 2503.20663 | null |
2025-03-26 | TN-Eval: Rubric and Evaluation Protocols for Measuring the Quality of Behavioral Therapy Notes | Raj Sanjay Shah et.al. | 2503.20648 | null |
2025-03-26 | Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging | Han Wu et.al. | 2503.20641 | null |
2025-03-26 | Collaborative Storytelling and LLM: A Linguistic Analysis of Automatically-Generated Role-Playing Game Sessions | Alessandro Maisto et.al. | 2503.20623 | null |
2025-03-26 | Diffusion Counterfactuals for Image Regressors | Trung Duc Ha et.al. | 2503.20595 | null |
2025-03-26 | Supply chain network rewiring dynamics at the firm-level | Tobias Reisch et.al. | 2503.20594 | null |
2025-03-26 | What to Retrieve for Effective Retrieval-Augmented Code Generation? An Empirical Study and Beyond | Wenchao Gu et.al. | 2503.20589 | null |
2025-03-26 | Synthetic Data Augmentation for Cross-domain Implicit Discourse Relation Recognition | Frances Yung et.al. | 2503.20588 | null |
2025-03-25
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-25 | CoLLM: A Large Language Model for Composed Image Retrieval | Chuong Huynh et.al. | 2503.19910 | **[link](https://github.com/hmchuong/CoLLM)** |
2025-03-25 | Scaling Vision Pre-Training to 4K Resolution | Baifeng Shi et.al. | 2503.19903 | null |
2025-03-25 | ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models | Fernando Julio Cendra et.al. | 2503.19902 | null |
2025-03-25 | CausalRAG: Integrating Causal Graphs into Retrieval-Augmented Generation | Nengbo Wang et.al. | 2503.19878 | null |
2025-03-25 | Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking | Xiaoyu Tian et.al. | 2503.19855 | null |
2025-03-25 | FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs | Carlos Plou et.al. | 2503.19850 | null |
2025-03-25 | A Comparative Analysis of Word Segmentation, Part-of-Speech Tagging, and Named Entity Recognition for Historical Chinese Sources, 1900-1950 | Zhao Fang et.al. | 2503.19844 | null |
2025-03-25 | FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model | Jun Zhou et.al. | 2503.19839 | null |
2025-03-25 | TopoGEN: topology-driven microstructure generation for in silico modeling of fiber network mechanics | Sara Cardona et.al. | 2503.19832 | null |
2025-03-25 | IgCraft: A versatile sequence generation framework for antibody discovery and engineering | Matthew Greenig et.al. | 2503.19821 | null |
2025-03-25 | PAVE: Patching and Adapting Video Large Language Models | Zhuoming Liu et.al. | 2503.19794 | null |
2025-03-25 | Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models | Kartik Thakral et.al. | 2503.19783 | null |
2025-03-25 | ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation | Haoyu Fu et.al. | 2503.19755 | null |
2025-03-25 | Inducing Personality in LLM-Based Honeypot Agents: Measuring the Effect on Human-Like Agenda Generation | Lewis Newsham et.al. | 2503.19752 | null |
2025-03-25 | Optimizing Photonic Structures with Large Language Model Driven Algorithm Discovery | Haoran Yin et.al. | 2503.19742 | null |
2025-03-25 | Writing as a testbed for open ended agents | Sian Gooding et.al. | 2503.19711 | null |
2025-03-25 | AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation | Itay Nakash et.al. | 2503.19693 | null |
2025-03-25 | AIGC-assisted Federated Learning for Vehicular Edge Intelligence: Vehicle Selection, Resource Allocation and Model Augmentation | Xianke Qiang et.al. | 2503.19676 | null |
2025-03-25 | CoSimGen: Controllable Diffusion Model for Simultaneous Image and Mask Generation | Rupak Bose et.al. | 2503.19661 | null |
2025-03-25 | HausaNLP at SemEval-2025 Task 3: Towards a Fine-Grained Model-Aware Hallucination Detection | Maryam Bala et.al. | 2503.19650 | null |
2025-03-24
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-24 | Equivariant Image Modeling | Ruixiao Dong et.al. | 2503.18948 | **[link](https://github.com/drx-code/EquivariantModeling)** |
2025-03-24 | Aether: Geometric-Aware Unified World Modeling | Aether Team et.al. | 2503.18945 | null |
2025-03-24 | SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding | Mingze Xu et.al. | 2503.18943 | null |
2025-03-24 | Video-T1: Test-Time Scaling for Video Generation | Fangfu Liu et.al. | 2503.18942 | null |
2025-03-24 | Exploring Training and Inference Scaling Laws in Generative Retrieval | Hongru Cai et.al. | 2503.18941 | null |
2025-03-24 | CoMP: Continual Multimodal Pre-training for Vision Foundation Models | Yitong Chen et.al. | 2503.18931 | **[link](https://github.com/SliMM-X/CoMP-MM)** |
2025-03-24 | Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training | Brian R. Bartoldson et.al. | 2503.18929 | null |
2025-03-24 | Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models | Meng Cao et.al. | 2503.18923 | null |
2025-03-24 | xKV: Cross-Layer SVD for KV-Cache Compression | Chi-Chih Chang et.al. | 2503.18893 | null |
2025-03-24 | AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration | Zhexuan Wang et.al. | 2503.18891 | null |
2025-03-24 | I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders | Andrey Galichin et.al. | 2503.18878 | null |
2025-03-24 | A semantic communication-based workload-adjustable transceiver for wireless AI-generated content (AIGC) delivery | Runze Cheng et.al. | 2503.18874 | null |
2025-03-24 | Reimagining Memory Access for LLM Inference: Compression-Aware Memory Controller Design | Rui Xie et.al. | 2503.18869 | null |
2025-03-24 | Structuring Scientific Innovation: A Framework for Modeling and Discovering Impactful Knowledge Combinations | Junlan Chen et.al. | 2503.18865 | null |
2025-03-24 | 3DSwapping: Texture Swapping For 3D Object From Single Reference Image | Xiao Cao et.al. | 2503.18853 | null |
2025-03-24 | EconEvals: Benchmarks and Litmus Tests for LLM Agents in Unknown Environments | Sara Fish et.al. | 2503.18825 | null |
2025-03-24 | Defeating Prompt Injections by Design | Edoardo Debenedetti et.al. | 2503.18813 | null |
2025-03-24 | Classical Planning with LLM-Generated Heuristics: Challenging the State of the Art with Python Code | Augusto B. Corrêa et.al. | 2503.18809 | null |
2025-03-24 | REALM: A Dataset of Real-World LLM Use Cases | Jingwen Cheng et.al. | 2503.18792 | null |
2025-03-24 | BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache | Dayou Du et.al. | 2503.18773 | null |
2025-03-21
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-03-21 | Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique | Yansi Li et.al. | 2503.17363 | null |
2025-03-21 | Position: Interactive Generative Video as Next-Generation Game Engine | Jiwen Yu et.al. | 2503.17359 | null |
2025-03-21 | OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement | Yihe Deng et.al. | 2503.17352 | null |
2025-03-21 | Efficient Intent-Based Filtering for Multi-Party Conversations Using Knowledge Distillation from LLMs | Reem Gody et.al. | 2503.17336 | null |
2025-03-21 | CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities | Yuxuan Zhu et.al. | 2503.17332 | **[link](https://github.com/uiuc-kang-lab/cve-bench)** |
2025-03-21 | LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition Language | Kun Chu et.al. | 2503.17309 | null |
2025-03-21 | Bugdar: AI-Augmented Secure Code Review for GitHub Pull Requests | John Naulty et.al. | 2503.17302 | null |
2025-03-21 | Offline Model-Based Optimization: Comprehensive Review | Minsu Kim et.al. | 2503.17286 | null |
2025-03-21 | CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement | Gaifan Zhang et.al. | 2503.17279 | null |
2025-03-21 | Unsupervised Joint Learning of Optical Flow and Intensity with Event Cameras | Shuang Guo et.al. | 2503.17262 | null |
2025-03-21 | SafeMERGE: Preserving Safety Alignment in Fine-Tuned Large Language Models via Selective Layer-Wise Model Merging | Aladin Djuhera et.al. | 2503.17239 | null |
2025-03-21 | FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs | Albert Sawczyn et.al. | 2503.17229 | null |
2025-03-21 | Neuro-Symbolic Scene Graph Conditioning for Synthetic Image Dataset Generation | Giacomo Savazzi et.al. | 2503.17224 | null |
2025-03-21 | Automating Adjudication of Cardiovascular Events Using Large Language Models | Sonish Sivarajkumar et.al. | 2503.17222 | null |
2025-03-21 | TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning | Sheng Wang et.al. | 2503.17195 | null |
2025-03-21 | LLMs Love Python: A Study of LLMs' Bias for Programming Languages and Libraries | Lukas Twist et.al. | 2503.17181 | null |
2025-03-21 | D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens | Panpan Wang et.al. | 2503.17155 | null |
2025-03-21 | Modifying Large Language Model Post-Training for Diverse Creative Writing | John Joon Young Chung et.al. | 2503.17126 | null |
2025-03-21 | Large Language Model Compression via the Nested Activation-Aware Decomposition | Jun Lu et.al. | 2503.17101 | null |
2025-03-21 | A Study into Investigating Temporal Robustness of LLMs | Jonas Wallat et.al. | 2503.17073 | null |