2025-12-01

Title: Cacheback: Speculative Decoding With Nothing But Cache

Title: 47B Mixture-of-Experts Beats 671B Dense Models on Chinese Medical Examinations

Title: CSV-Decode: Certifiable Sub-Vocabulary Decoding for Efficient Large Language Model Inference

Title: Evaluating Embedding Generalization: How LLMs, LoRA, and SLERP Shape Representational Geometry

Title: Insight-A: Attribution-aware for Multimodal Misinformation Detection

Title: A General Highly Accurate Online Planning Method Integrating Large Language Models into Nested Rollout Policy Adaptation for Dialogue Tasks

Title: Lost in the Pipeline: How Well Do Large Language Models Handle Data Preparation?

Title: Quantifying and Mitigating Selection Bias in LLMs: A Transferable LoRA Fine-Tuning and Efficient Majority Voting Approach

Title: Addressing Stereotypes in Large Language Models: A Critical Examination and Mitigation

Title: EulerESG: Automating ESG Disclosure Analysis with LLMs

Title: An Optimized Machine Learning Classifier for Detecting Fake Reviews Using Extracted Features

Title: CrossCheck-Bench: Diagnosing Compositional Failures in Multimodal Conflict Resolution

Title: When Harmless Words Harm: A New Threat to LLM Safety via Conceptual Triggers

Title: PeerCoPilot: A Language Model-Powered Assistant for Behavioral Health Organizations

Title: German General Personas: A Survey-Derived Persona Prompt Collection for Population-Aligned LLM Studies

Title: AD-CDO: A Lightweight Ontology for Representing Eligibility Criteria in Alzheimer's Disease Clinical Trials

Title: PromptTailor: Multi-turn Intent-Aligned Prompt Synthesis for Lightweight LLMs

Title: Goal-Directed Search Outperforms Goal-Agnostic Memory Compression in Long-Context Memory Tasks

Title: Affective Multimodal Agents with Proactive Knowledge Grounding for Emotionally Aligned Marketing Dialogue

Title: Identifying Quantum Structure in AI Language: Evidence for Evolutionary Convergence of Human and Artificial Cognition

Title: HUMORCHAIN: Theory-Guided Multi-Stage Reasoning for Interpretable Multimodal Humor Generation

Title: RoSA: Enhancing Parameter-Efficient Fine-Tuning via RoPE-aware Selective Adaptation in Large Language Models

Title: Asking LLMs to Verify First is Almost Free Lunch

Title: R2Q: Towards Robust 2-Bit Large Language Models via Residual Refinement Quantization

Title: Polarity-Aware Probing for Quantifying Latent Alignment in Language Models

Title: Decoding inner speech with an end-to-end brain-to-text neural interface

Title: A Multiscale Geometric Method for Capturing Relational Topic Alignment

Title: EduMod-LLM: A Modular Approach for Designing Flexible and Transparent Educational Assistants

Title: A Lightweight Approach to Detection of AI-Generated Texts Using Stylometric Features

Title: DELTA: Language Diffusion-based EEG-to-Text Architecture

Title: Building Domain-Specific Small Language Models via Guided Data Generation

Title: Proactive Defense: Compound AI for Detecting Persuasion Attacks and Measuring Inoculation Effectiveness

Title: SO-Bench: A Structural Output Evaluation of Multimodal LLMs

Title: Semantics as a Shield: Label Disguise Defense (LDD) against Prompt Injection in LLM Sentiment Classification

Title: Extracting Disaster Impacts and Impact Related Locations in Social Media Posts Using Large Language Models

Title: Dissecting the Ledger: Locating and Suppressing "Liar Circuits" in Financial Large Language Models

Title: A Longitudinal Measurement of Privacy Policy Evolution for Large Language Models

Title: Orchestrating Dual-Boundaries: An Arithmetic Intensity Inspired Acceleration Framework for Diffusion Language Models

Title: fMRI-LM: Towards a Universal Foundation Model for Language-Aligned fMRI Understanding

Title: LLMs for Low-Resource Dialect Translation Using Context-Aware Prompting: A Case Study on Sylheti

Title: Factors That Support Grounded Responses in LLM Conversations: A Rapid Review

Title: Adaptive Detection of Polymorphic Malware: Leveraging Mutation Engines and YARA Rules for Enhanced Security

Title: Categorical Framework for Quantum-Resistant Zero-Trust AI Security

Title: Physics-Informed Spiking Neural Networks via Conservative Flux Quantization

Title: Advanced Data Collection Techniques in Cloud Security: A Multi-Modal Deep Learning Autoencoder Approach

Title: The Double-Edged Nature of the Rashomon Set for Trustworthy Machine Learning

Title: Beyond Membership: Limitations of Add/Remove Adjacency in Differential Privacy

Title: Unsupervised Anomaly Detection for Smart IoT Devices: Performance and Resource Comparison

Title: FLAWS: A Benchmark for Error Identification and Localization in Scientific Papers

Title: Improving Score Reliability of Multiple Choice Benchmarks with Consistency Evaluation and Altered Answer Choices

Title: Towards a Foundation Model for Partial Differential Equations Across Physics Domains

Title: Saddle-Free Guidance: Improved On-Manifold Sampling without Labels or Additional Training

Title: Closed-Loop Transformers: Autoregressive Modeling as Iterative Latent Equilibrium

Title: Physically Interpretable Representation Learning with Gaussian Mixture Variational AutoEncoder (GM-VAE)

Title: UniArt: Unified 3D Representation for Generating 3D Articulated Objects with Open-Set Articulation

Title: Breaking the Illusion: Consensus-Based Generative Mitigation of Adversarial Illusions in Multi-Modal Embeddings

Title: Standardized Threat Taxonomy for AI Security, Governance, and Regulatory Compliance

Title: Adaptive Parameter Optimization for Robust Remote Photoplethysmography

Title: Multi-Modal Machine Learning for Early Trust Prediction in Human-AI Interaction Using Face Image and GSR Bio Signals

Title: Exploring Dynamic Properties of Backdoor Training Through Information Bottleneck

Title: Prompted Policy Search: Reinforcement Learning through Linguistic and Numerical Reasoning in LLMs

Title: A Comparative Study of LLM Prompting and Fine-Tuning for Cross-genre Authorship Attribution on Chinese Lyrics

Title: Does the Model Say What the Data Says? A Simple Heuristic for Model Data Alignment

Title: Modeling Quantum Autoencoder Trainable Kernel for IoT Anomaly Detection

Title: Heterogeneous Multi-Agent Reinforcement Learning with Attention for Cooperative and Scalable Feature Transformation

Title: Deep Learning Architectures for Code-Modulated Visual Evoked Potentials Detection

Title: AmodalGen3D: Generative Amodal 3D Object Reconstruction from Sparse Unposed Views

Title: ABLE: Using Adversarial Pairs to Construct Local Models for Explaining Model Predictions

Title: DeepGI: Explainable Deep Learning for Gastrointestinal Image Classification

Title: CTR Prediction on Alibaba's Taobao Advertising Dataset Using Traditional and Deep Learning Models

Title: MOTIF-RF: Multi-template On-chip Transformer Synthesis Incorporating Frequency-domain Self-transfer Learning for RFIC Design Automation

Title: Start Making Sense(s): A Developmental Probe of Attention Specialization Using Lexical Ambiguity

Title: DialBench: Towards Accurate Reading Recognition of Pointer Meter using Large Foundation Models

Title: PPBoost: Progressive Prompt Boosting for Text-Driven Medical Image Segmentation

Title: A Safety and Security Framework for Real-World Agentic Systems

Title: Can Multi-Modal LLMs Provide Live Step-by-Step Task Guidance?

Title: GECKO: Securing Digital Assets Through(out) the Physical World (Extended Technical Report)

Title: StreamFlow: Theory, Algorithm, and Implementation for High-Efficiency Rectified Flow Generation

Title: AfriStereo: A Culturally Grounded Dataset for Evaluating Stereotypical Bias in Large Language Models

Title: POLARIS: Cross-Domain Access Control via Verifiable Identity and Policy-Based Authorization

Title: Intra-Class Probabilistic Embeddings for Uncertainty Estimation in Vision-Language Models

Title: Layover or Direct Flight: Rethinking Audio-Guided Image Segmentation

Title: Calibration-Free EEG-based Driver Drowsiness Detection with Online Test-Time Adaptation

Title: Early Risk Prediction with Temporally and Contextually Grounded Clinical Language Processing

Title: SparseWorld-TC: Trajectory-Conditioned Sparse Occupancy World Model

Title: Distillability of LLM Security Logic: Predicting Attack Success Rate of Outline Filling Attack via Ranking Regression

Title: Evaluating the Robustness of Large Language Model Safety Guardrails Against Adversarial Attacks

Title: ICM-SR: Image-Conditioned Manifold Regularization for Image Super-Resoultion

Title: OralGPT-Omni: A Versatile Dental Multimodal Large Language Model

Title: Convergence Dynamics of Over-Parameterized Score Matching for a Single Gaussian

Title: ARES: Anomaly Recognition Model For Edge Streams

Title: A Fast and Flat Federated Learning Method via Weighted Momentum and Sharpness-Aware Minimization

Title: Binary-30K: A Heterogeneous Dataset for Deep Learning in Binary Analysis and Malware Detection

Title: WorldWander: Bridging Egocentric and Exocentric Worlds in Video Generation

Title: Decomposed Trust: Exploring Privacy, Adversarial Robustness, Fairness, and Ethics of Low-Rank LLMs

Title: MRI-Based Brain Age Estimation with Supervised Contrastive Learning of Continuous Representation

Title: MoE3D: Mixture of Experts meets Multi-Modal 3D Understanding

Title: Energy Efficient Sleep Mode Optimization in 5G mmWave Networks via Multi Agent Deep Reinforcement Learning

Title: A Hybrid Theory and Data-driven Approach to Persuasion Detection with Large Language Models

Title: IVGAE: Handling Incomplete Heterogeneous Data with a Variational Graph Autoencoder

Title: Privacy-preserving formal concept analysis: A homomorphic encryption-based concept construction

Title: PROMPTMINER: Black-Box Prompt Stealing against Text-to-Image Generative Models via Reinforcement Learning and Fuzz Optimization

Title: Cue3D: Quantifying the Role of Image Cues in Single-Image 3D Generation

Title: A Variational Manifold Embedding Framework for Nonlinear Dimensionality Reduction

Title: Probabilistic Digital Twin for Misspecified Structural Dynamical Systems via Latent Force Modeling and Bayesian Neural Networks

Title: EASL: Multi-Emotion Guided Semantic Disentanglement for Expressive Sign Language Generation

Title: TinyLLM: Evaluation and Optimization of Small Language Models for Agentic Tasks on Edge Devices

Title: C$^2$DLM: Causal Concept-Guided Diffusion Large Language Models

Title: RemedyGS: Defend 3D Gaussian Splatting against Computation Cost Attacks

Title: From Topology to Retrieval: Decoding Embedding Spaces with Unified Signatures

Title: A Theoretically Grounded Hybrid Ensemble for Reliable Detection of LLM-Generated Text

Title: IMTalker: Efficient Audio-driven Talking Face Generation with Implicit Motion Transfer

Title: Partially Shared Concept Bottleneck Models

Title: BrepGPT: Autoregressive B-rep Generation with Voronoi Half-Patch

Title: Guiding the Inner Eye: A Framework for Hierarchical and Flexible Visual Grounded Reasoning

Title: Focused Chain-of-Thought: Efficient LLM Reasoning via Structured Input Information

Title: Designing Instance-Level Sampling Schedules via REINFORCE with James-Stein Shrinkage

Title: Personalized 3D Spatiotemporal Trajectory Privacy Protection with Differential and Distortion Geo-Perturbation

Title: MTR-VP: Towards End-to-End Trajectory Planning through Context-Driven Image Encoding and Multiple Trajectory Prediction

Title: Shoe Style-Invariant and Ground-Aware Learning for Dense Foot Contact Estimation

Title: HybridWorldSim: A Scalable and Controllable High-fidelity Simulator for Autonomous Driving

Title: Department-Specific Security Awareness Campaigns: A Cross-Organizational Study of HR and Accounting

Title: Controllable 3D Object Generation with Single Image Prompt

Title: PULSE-ICU: A Pretrained Unified Long-Sequence Encoder for Multi-task Prediction in Intensive Care Units

Title: Real-PGDN: A Two-level Classification Method for Full-Process Recognition of Newly Registered Pornographic and Gambling Domain Names

Title: 3D-Consistent Multi-View Editing by Diffusion Guidance

Title: From Compound Figures to Composite Understanding: Developing a Multi-Modal LLM from Biomedical Literature with Medical Multiple-Image Benchmarking and Validation

Title: Bridging 3D Deep Learning and Curation for Analysis and High-Quality Segmentation in Practice

Title: Creating Blank Canvas Against AI-enabled Image Forgery

Title: TTSnap: Test-Time Scaling of Diffusion Models via Noise-Aware Pruning

Title: Semantic Anchoring for Robust Personalization in Text-to-Image Diffusion Models

Title: Toward Diffusible High-Dimensional Latent Spaces: A Frequency Perspective

Title: UMind-VL: A Generalist Ultrasound Vision-Language Model for Unified Grounded Perception and Comprehensive Interpretation

Title: Beyond Query-Level Comparison: Fine-Grained Reinforcement Learning for Text-to-SQL with Automated Interpretable Critiques

Title: Silence Speaks Volumes: A New Paradigm for Covert Communication via History Timing Patterns

Title: Can Protective Watermarking Safeguard the Copyright of 3D Gaussian Splatting?

Title: DriveVGGT: Visual Geometry Transformer for Autonomous Driving

Title: FedRE: A Representation Entanglement Framework for Model-Heterogeneous Federated Learning

Title: TreeCoder: Systematic Exploration and Optimisation of Decoding and Constraints for LLM Code Generation

Title: The Collapse of Patches

Title: The Hidden Cost of Approximation in Online Mirror Descent

Title: Adaptive tumor growth forecasting via neural & universal ODEs

Title: Structure is Supervision: Multiview Masked Autoencoders for Radiology

Title: FLUX: Efficient Descriptor-Driven Clustered Federated Learning under Arbitrary Distribution Shifts

Title: Small Object Detection for Birds with Swin Transformer

Title: Token-Level Marginalization for Multi-Label LLM Classifiers

Title: Sentiment Analysis Of Shopee Product Reviews Using Distilbert

Title: SingleQuant: Efficient Quantization of Large Language Models in a Single Pass

Title: Enhancing the Security of Rollup Sequencers using Decentrally Attested TEEs

Title: Prompt-based Consistent Video Colorization

Title: Keyless Entry: Breaking and Entering eMMC RPMB with EMFI

Title: Unexplored flaws in multiple-choice VQA evaluations

Title: Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment

Title: INSIGHT: An Interpretable Neural Vision-Language Framework for Reasoning of Generative Artifacts

Title: AnchorFlow: Training-Free 3D Editing via Latent Anchor-Aligned Flows

Title: Efficient-Husformer: Efficient Multimodal Transformer Hyperparameter Optimization for Stress and Cognitive Loads

Title: SuRe: Surprise-Driven Prioritised Replay for Continual LLM Learning

Title: Mapping Clinical Doubt: Locating Linguistic Uncertainty in LLMs

Title: UAV-MM3D: A Large-Scale Synthetic Benchmark for 3D Perception of Unmanned Aerial Vehicles with Multi-Modal Data

Title: DiffStyle360: Diffusion-Based 360° Head Stylization via Style Fusion Attention

Title: Exposing Vulnerabilities in RL: A Novel Stealthy Backdoor Attack through Reward Poisoning

Title: Extending Quantum-Safe Communications to Real-World Networks: An Adaptive Security Framework

Title: Wukong's 72 Transformations: High-fidelity Textured 3D Morphing via Flow Models

Title: Fin3R: Fine-tuning Feed-forward 3D Reconstruction Models via Monocular Knowledge Distillation

Title: SkeletonAgent: An Agentic Interaction Framework for Skeleton-based Action Recognition

Title: FastFHE: Packing-Scalable and Depthwise-Separable CNN Inference Over FHE

Title: PISA: Prioritized Invariant Subgraph Aggregation

Title: ABounD: Adversarial Boundary-Driven Few-Shot Learning for Multi-Class Anomaly Detection

Title: GEO-Detective: Unveiling Location Privacy Risks in Images with LLM Agents

Title: Do You See What I Say? Generalizable Deepfake Detection based on Visual Speech Recognition

Title: Benchmarking machine learning models for multi-class state recognition in double duantum dot data

Title: Beyond Real versus Fake Towards Intent-Aware Video Analysis

Title: ITS3D: Inference-Time Scaling for Text-Guided 3D Diffusion Models

Title: Gaussians on Fire: High-Frequency Reconstruction of Flames

Title: RoadSceneBench: A Lightweight Benchmark for Mid-Level Road Scene Understanding

Title: Rethinking Cross-Generator Image Forgery Detection through DINOv3

Title: Adversarial Flow Models

Title: Enhancing Trustworthiness with Mixed Precision: Benchmarks, Opportunities, and Challenges

Title: AI killed the video star. Audio-driven diffusion model for expressive talking head generation

Title: What Shape Is Optimal for Masks in Text Removal?

Title: Joint Speech and Text Training for LLM-Based End-to-End Spoken Dialogue State Tracking

Title: Privacy-Utility-Bias Trade-offs for Privacy-Preserving Recommender Systems

Title: List-Decodable Regression via Expander Sketching

Title: CoT4AD: A Vision-Language-Action Model with Explicit Chain-of-Thought Reasoning for Autonomous Driving

Title: Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration

Title: Diff-ICMH: Harmonizing Machine and Human Vision in Image Compression with Generative Prior

Title: Bringing Your Portrait to 3D Presence

Title: Text Condition Embedded Regression Network for Automated Dental Abutment Design

Title: Smarter, not Bigger: Fine-Tuned RAG-Enhanced LLMs for Automotive HIL Testing

Title: The Multiclass Score-Oriented Loss (MultiSOL) on the Simplex

Title: AnoRefiner: Anomaly-Aware Group-Wise Refinement for Zero-Shot Industrial Anomaly Detection

Title: LLM-Cave: A benchmark and light environment for large language models reasoning and decision-making system

Title: GazeTrack: High-Precision Eye Tracking Based on Regularization and Spatial Computing

Title: MG-Nav: Dual-Scale Visual Navigation via Sparse Spatial Memory

Title: Improving LLM-based Ontology Matching with fine-tuning on synthetic data

Title: Stable-Drift: A Patient-Aware Latent Drift Replay Method for Stabilizing Representations in Continual Learning

Title: Federated Learning Survey: A Multi-Level Taxonomy of Aggregation Techniques, Experimental Insights, and Future Frontiers

Title: REASONEDIT: Towards Reasoning-Enhanced Image Editing Models

Title: Flow Density Control: Generative Optimization Beyond Entropy-Regularized Fine-Tuning

Title: GeoZero: Incentivizing Reasoning from Scratch on Geospatial Scenes

Title: Spatially Aware Dictionary-Free Eigenfunction Identification for Modeling and Control of Nonlinear Dynamical Systems

Title: Automated Design Optimization via Strategic Search with Large Language Models

Title: Modèles de Fondation et Ajustement : Vers une Nouvelle Génération de Modèles pour la Prévision des Séries Temporelles

Title: Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield

Title: CacheTrap: Injecting Trojans in LLMs without Leaving any Traces in Inputs or Weights

Title: Test-time scaling of diffusions with flow maps

Title: Ar2Can: An Architect and an Artist Leveraging a Canvas for Multi-Human Generation

Title: Generative Anchored Fields: Controlled Data Generation via Emergent Velocity Fields and Transport Algebra

Title: Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Title: Ghosting Your LLM: Without The Knowledge of Your Gradient and Data

Title: Splat-SAP: Feed-Forward Gaussian Splatting for Human-Centered Scene with Scale-Aware Point Map Reconstruction

Title: ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering

Title: All Centers Are at most a Few Tokens Apart: Knowledge Distillation with Domain Invariant Prompt Tuning

Title: VeriDispatcher: Multi-Model Dispatching through Pre-Inference Difficulty Prediction for RTL Generation Optimization

Title: MammoRGB: Dual-View Mammogram Synthesis Using Denoising Diffusion Probabilistic Models

Title: Modeling Romanized Hindi and Bengali: Dataset Creation and Multilingual LLM Integration

Title: Alzheimer's Disease Prediction Using EffNetViTLoRA and BiLSTM with Multimodal Longitudinal MRI Data

Title: World in a Frame: Understanding Culture Mixing as a New Challenge for Vision-Language Models

Title: PRISM: Privacy-Aware Routing for Adaptive Cloud-Edge LLM Inference via Semantic Sketch Collaboration

Title: An Efficient Privacy-preserving Intrusion Detection Scheme for UAV Swarm Networks

Title: GSpaRC: Gaussian Splatting for Real-time Reconstruction of RF Channels

Title: From Pixels to Feelings: Aligning MLLMs with Human Cognitive Perception of Images

Title: LC4-DViT: Land-cover Creation for Land-cover Classification with Deformable Vision Transformer

Title: Intelligent Neural Networks: From Layered Architectures to Graph-Organized Intelligence

Title: Mitigating Semantic Drift: Evaluating LLMs' Efficacy in Psychotherapy through MI Dialogue Summarization

Title: A Unified and Stable Risk Minimization Framework for Weakly Supervised Learning with Theoretical Guarantees

Title: Some Modalities are More Equal Than Others: Decoding and Architecting Multimodal Integration in MLLMs

Title: PerfMamba: Performance Analysis and Pruning of Selective State Space Models

Title: TARFVAE: Efficient One-Step Generative Time Series Forecasting via TARFLOW based VAE

Title: CRAwDAD: Causal Reasoning Augmentation with Dual-Agent Debate

Title: CoordSpeaker: Exploiting Gesture Captioning for Coordinated Caption-Empowered Co-Speech Gesture Generation

Title: JBE-QA: Japanese Bar Exam QA Dataset for Assessing Legal Domain Knowledge

Title: Scalable Diffusion Transformer for Conditional 4D fMRI Synthesis

Title: FEANEL: A Benchmark for Fine-Grained Error Analysis in K-12 English Writing

Title: Modeling Chaotic Pedestrian Behavior Using Chaos Indicators and Supervised Learning

Title: Adversarial Training for Process Reward Models

Title: ClearGCD: Mitigating Shortcut Learning For Robust Generalized Category Discovery

Title: DM$^3$T: Harmonizing Modalities via Diffusion for Multi-Object Tracking

Title: From Points to Clouds: Learning Robust Semantic Distributions for Multi-modal Prompts

Title: Leveraging Textual Compositional Reasoning for Robust Change Captioning

Title: See, Rank, and Filter: Important Word-Aware Clip Filtering via Scene Understanding for Moment Retrieval and Highlight Detection

Title: ViGG: Robust RGB-D Point Cloud Registration using Visual-Geometric Mutual Guidance

Title: EnECG: Efficient Ensemble Learning for Electrocardiogram Multi-task Foundation Model

Title: Robust Image Self-Recovery against Tampering using Watermark Generation with Pixel Shuffling

Title: One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfe

Title: Visual Puns from Idioms: An Iterative LLM-T2IM-MLLM Framework

Title: Do We Need Perfect Data? Leveraging Noise for Domain Generalized Segmentation

Title: RobotSeg: A Model and Dataset for Segmenting Robots in Image and Video

Title: Experts are all you need: A Composable Framework for Large Language Model Inference

Title: Contrastive Heliophysical Image Pretraining for Solar Dynamics Observatory Records

Title: A Trainable Centrality Framework for Modern Data

Title: Taming the Light: Illumination-Invariant Semantic 3DGS-SLAM

Title: Training-Free Loosely Speculative Decoding: Accepting Semantically Correct Drafts Beyond Exact Match

Title: BlockVid: Block Diffusion for High-Quality and Consistent Minute-Long Video Generation

Title: McSc: Motion-Corrective Preference Alignment for Video Generation with Self-Critic Hierarchical Reasoning

Title: Pooling Attention: Evaluating Pretrained Transformer Embeddings for Deception Classification

Title: Ovis-Image Technical Report

Title: Convolutional Feature Noise Reduction for 2D Cardiac MR Image Segmentation

Title: MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generation

Title: Guiding Visual Autoregressive Models through Spectrum Weakening

Title: Optimizer Sensitivity In Vision Transformerbased Iris Recognition: Adamw Vs Sgd Vs Rmsprop

Title: MrGS: Multi-modal Radiance Fields with 3D Gaussian Splatting for RGB-Thermal Novel View Synthesis

Title: A Modular Framework for Rapidly Building Intrusion Predictors

Title: Masked Diffusion for Generative Recommendation

Title: A Game-Theoretic Approach for Adversarial Information Fusion in Distributed Sensor Networks

Title: Delta-XAI: A Unified Framework for Explaining Prediction Changes in Online Time Series Monitoring

Title: Social Perceptions of English Spelling Variation on Twitter: A Comparative Analysis of Human and LLM Responses

Title: GOATex: Geometry & Occlusion-Aware Texturing

Title: Decoding the Past: Explainable Machine Learning Models for Dating Historical Texts

Title: Evaluating the Clinical Impact of Generative Inpainting on Bone Age Estimation

Title: Buffer replay enhances the robustness of multimodal learning under missing-modality

Title: SpaceMind: Camera-Guided Modality Fusion for Spatial Reasoning in Vision-Language Models

Title: Accent Placement Models for Rigvedic Sanskrit Text

Title: Mind Reading or Misreading? LLMs on the Big Five Personality Test

Title: NumeriKontrol: Adding Numeric Control to Diffusion Transformers for Instruction-based Image Editing

Title: db-SP: Accelerating Sparse Attention for Visual Generative Models with Dual-Balanced Sequence Parallelism

Title: Dripper: Token-Efficient Main HTML Extraction with a Lightweight LM

Title: Freeze, Diffuse, Decode: Geometry-Aware Adaptation of Pretrained Transformer Embeddings for Antimicrobial Peptide Design

Title: DNA-Prior: Unsupervised Denoise Anything via Dual-Domain Prior

Title: DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation

Title: Multi-chain Graph Refinement and Selection for Reliable Reasoning in Large Language Models

Title: InstanceV: Instance-Level Video Generation

Title: Cascaded Robust Rectification for Arbitrary Document Images

Title: REVEAL: Reasoning-enhanced Forensic Evidence Analysis for Explainable AI-generated Image Detection

Title: Estimating the Event-Related Potential from Few EEG Trials

Title: Energy-Efficient Vision Transformer Inference for Edge-AI Deployment

Title: PowerCLIP: Powerset Alignment for Contrastive Pre-Training

Title: Fast Multi-view Consistent 3D Editing with Video Priors

Title: Are LLMs Good Safety Agents or a Propaganda Engine?

Title: Identification of Malicious Posts on the Dark Web Using Supervised Machine Learning

Title: Listwise Preference Optimization with Element-wise Confusions for Aspect Sentiment Quad Prediction

Title: Clustering Malware at Scale: A First Full-Benchmark Study

Title: Vision Bridge Transformer at Scale

Title: Quantifying the Privacy-Utility Trade-off in GPS-based Daily Stress Recognition using Semantic Features

Title: Zero-Shot Multi-Criteria Visual Quality Inspection for Semi-Controlled Industrial Environments via Real-Time 3D Digital Twin Simulation

Title: Instruction Tuning of Large Language Models for Tabular Data Generation-in One Day

Title: Robust 3DGS-based SLAM via Adaptive Kernel Smoothing

Title: DAONet-YOLOv8: An Occlusion-Aware Dual-Attention Network for Tea Leaf Pest and Disease Detection

Title: TWEO: Transformers Without Extreme Outliers Enables FP8 Training And Quantization For Dummies

Title: Unlocking Multilingual Reasoning Capability of LLMs and LVLMs through Representation Engineering

Title: SDE-Attention: Latent Attention in SDE-RNNs for Irregularly Sampled Time Series with Missing Data

Title: Towards Understanding Transformers in Learning Random Walks

Title: Synthetic Industrial Object Detection: GenAI vs. Feature-Based Methods

Title: Learning to Predict Aboveground Biomass from RGB Images with 3D Synthetic Scenes

Title: One-Shot Secure Aggregation: A Hybrid Cryptographic Protocol for Private Federated Learning in IoT

Title: BanglaSentNet: An Explainable Hybrid Deep Learning Framework for Multi-Aspect Sentiment Analysis with Cross-Domain Transfer Learning

Title: Simultaneous Image Quality Improvement and Artefacts Correction in Accelerated MRI

Title: Beyond Curve Fitting: Neuro-Symbolic Agents for Context-Aware Epidemic Forecasting

Title: MCP vs RAG vs NLWeb vs HTML: A Comparison of the Effectiveness and Efficiency of Different Agent Interfaces to the Web (Technical Report)

Title: Closing the Generalization Gap in Parameter-efficient Federated Edge Learning

Title: Transformer-Driven Triple Fusion Framework for Enhanced Multimodal Author Intent Classification in Low-Resource Bangla

Title: Machine Learning for Scientific Visualization: Ensemble Data Analysis

Title: Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models

Title: UniGeoSeg: Towards Unified Open-World Segmentation for Geospatial Scenes

Title: Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach

Title: Flow Straighter and Faster: Efficient One-Step Generative Modeling via MeanFlow on Rectified Trajectories

Title: Distributed Dynamic Associative Memory via Online Convex Optimization

Title: A Hierarchical Computer Vision Pipeline for Physiological Data Extraction from Bedside Monitors

Title: SimScale: Learning to Drive via Real-World Simulation at Scale

Title: Optimizing Multimodal Language Models through Attention-based Interpretability

Title: DEAL-300K: Diffusion-based Editing Area Localization with a 300K-Scale Dataset and Frequency-Prompted Baseline

Title: Learning-Augmented Online Bipartite Matching in the Random Arrival Order Model

Title: FedSGT: Exact Federated Unlearning via Sequential Group-based Training

Title: Quantized-Tinyllava: a new multimodal foundation model enables efficient split learning

Title: MANTA: Physics-Informed Generalized Underwater Object Tracking

Title: Evaluating LLMs for One-Shot Patching of Real and Artificial Vulnerabilities

Title: Hunyuan-GameCraft-2: Instruction-following Interactive Game World Model

Title: ASTRO: Adaptive Stitching via Dynamics-Guided Trajectory Rollouts

Title: Physics-Informed Neural Networks for Thermophysical Property Retrieval

Title: Object-Centric Data Synthesis for Category-level Object Detection

Title: SmallWorlds: Assessing Dynamics Understanding of World Models in Isolated Environments

Title: Visual Generation Tuning

Title: ThetaEvolve: Test-time Learning on Open Problems

Title: AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement

Title: Video-CoM: Interactive Video Reasoning via Chain of Manipulations

Title: Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models