2025-12-01

Title: Addressing Stereotypes in Large Language Models: A Critical Examination and Mitigation

Title: HUMORCHAIN: Theory-Guided Multi-Stage Reasoning for Interpretable Multimodal Humor Generation

Title: EduMod-LLM: A Modular Approach for Designing Flexible and Transparent Educational Assistants

Title: DELTA: Language Diffusion-based EEG-to-Text Architecture

Title: Proactive Defense: Compound AI for Detecting Persuasion Attacks and Measuring Inoculation Effectiveness

Title: Orchestrating Dual-Boundaries: An Arithmetic Intensity Inspired Acceleration Framework for Diffusion Language Models

Title: fMRI-LM: Towards a Universal Foundation Model for Language-Aligned fMRI Understanding

Title: Advanced Data Collection Techniques in Cloud Security: A Multi-Modal Deep Learning Autoencoder Approach

Title: Unsupervised Anomaly Detection for Smart IoT Devices: Performance and Resource Comparison

Title: Towards a Foundation Model for Partial Differential Equations Across Physics Domains

Title: Saddle-Free Guidance: Improved On-Manifold Sampling without Labels or Additional Training

Title: Closed-Loop Transformers: Autoregressive Modeling as Iterative Latent Equilibrium

Title: UniArt: Unified 3D Representation for Generating 3D Articulated Objects with Open-Set Articulation

Title: Breaking the Illusion: Consensus-Based Generative Mitigation of Adversarial Illusions in Multi-Modal Embeddings

Title: Prompted Policy Search: Reinforcement Learning through Linguistic and Numerical Reasoning in LLMs

Title: Modeling Quantum Autoencoder Trainable Kernel for IoT Anomaly Detection

Title: AmodalGen3D: Generative Amodal 3D Object Reconstruction from Sparse Unposed Views

Title: WalkCLIP: Multimodal Learning for Urban Walkability Prediction

Title: DialBench: Towards Accurate Reading Recognition of Pointer Meter using Large Foundation Models

Title: PPBoost: Progressive Prompt Boosting for Text-Driven Medical Image Segmentation

Title: StreamFlow: Theory, Algorithm, and Implementation for High-Efficiency Rectified Flow Generation

Title: ICM-SR: Image-Conditioned Manifold Regularization for Image Super-Resoultion

Title: Convergence Dynamics of Over-Parameterized Score Matching for a Single Gaussian

Title: ARES: Anomaly Recognition Model For Edge Streams

Title: WorldWander: Bridging Egocentric and Exocentric Worlds in Video Generation

Title: MRI-Based Brain Age Estimation with Supervised Contrastive Learning of Continuous Representation

Title: PROMPTMINER: Black-Box Prompt Stealing against Text-to-Image Generative Models via Reinforcement Learning and Fuzz Optimization

Title: Cue3D: Quantifying the Role of Image Cues in Single-Image 3D Generation

Title: Benchmarking In-context Experiential Learning Through Repeated Product Recommendations

Title: Autonomous labeling of surgical resection margins using a foundation model

Title: EASL: Multi-Emotion Guided Semantic Disentanglement for Expressive Sign Language Generation

Title: C$^2$DLM: Causal Concept-Guided Diffusion Large Language Models

Title: Real-Time Long Horizon Air Quality Forecasting via Group-Relative Policy Optimization

Title: BrepGPT: Autoregressive B-rep Generation with Voronoi Half-Patch

Title: Designing Instance-Level Sampling Schedules via REINFORCE with James-Stein Shrinkage

Title: HybridWorldSim: A Scalable and Controllable High-fidelity Simulator for Autonomous Driving

Title: Controllable 3D Object Generation with Single Image Prompt

Title: PULSE-ICU: A Pretrained Unified Long-Sequence Encoder for Multi-task Prediction in Intensive Care Units

Title: 3D-Consistent Multi-View Editing by Diffusion Guidance

Title: Bridging 3D Deep Learning and Curation for Analysis and High-Quality Segmentation in Practice

Title: TTSnap: Test-Time Scaling of Diffusion Models via Noise-Aware Pruning

Title: Semantic Anchoring for Robust Personalization in Text-to-Image Diffusion Models

Title: Toward Diffusible High-Dimensional Latent Spaces: A Frequency Perspective

Title: UMind-VL: A Generalist Ultrasound Vision-Language Model for Unified Grounded Perception and Comprehensive Interpretation

Title: Beyond Query-Level Comparison: Fine-Grained Reinforcement Learning for Text-to-SQL with Automated Interpretable Critiques

Title: Structure is Supervision: Multiview Masked Autoencoders for Radiology

Title: Token-Level Marginalization for Multi-Label LLM Classifiers

Title: Prompt-based Consistent Video Colorization

Title: Test Time Training for AC Power Flow Surrogates via Physics and Operational Constraint Refinement

Title: Cleaning the Pool: Progressive Filtering of Unlabeled Pools in Deep Active Learning

Title: Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment

Title: INSIGHT: An Interpretable Neural Vision-Language Framework for Reasoning of Generative Artifacts

Title: AnchorFlow: Training-Free 3D Editing via Latent Anchor-Aligned Flows

Title: TS2Vec-Ensemble: An Enhanced Self-Supervised Framework for Time Series Forecasting

Title: DiffStyle360: Diffusion-Based 360° Head Stylization via Style Fusion Attention

Title: Wukong's 72 Transformations: High-fidelity Textured 3D Morphing via Flow Models

Title: ABounD: Adversarial Boundary-Driven Few-Shot Learning for Multi-Class Anomaly Detection

Title: Beyond Real versus Fake Towards Intent-Aware Video Analysis

Title: ITS3D: Inference-Time Scaling for Text-Guided 3D Diffusion Models

Title: Hybrid, Unified and Iterative: A Novel Framework for Text-based Person Anomaly Retrieval

Title: Rethinking Cross-Generator Image Forgery Detection through DINOv3

Title: Adversarial Flow Models

Title: AI killed the video star. Audio-driven diffusion model for expressive talking head generation

Title: What Shape Is Optimal for Masks in Text Removal?

Title: Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration

Title: Diff-ICMH: Harmonizing Machine and Human Vision in Image Compression with Generative Prior

Title: Bringing Your Portrait to 3D Presence

Title: Text Condition Embedded Regression Network for Automated Dental Abutment Design

Title: AnoRefiner: Anomaly-Aware Group-Wise Refinement for Zero-Shot Industrial Anomaly Detection

Title: REASONEDIT: Towards Reasoning-Enhanced Image Editing Models

Title: Flow Density Control: Generative Optimization Beyond Entropy-Regularized Fine-Tuning

Title: Modèles de Fondation et Ajustement : Vers une Nouvelle Génération de Modèles pour la Prévision des Séries Temporelles

Title: Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield

Title: Emergent Extreme-View Geometry in 3D Foundation Models

Title: Test-time scaling of diffusions with flow maps

Title: Ar2Can: An Architect and an Artist Leveraging a Canvas for Multi-Human Generation

Title: Generative Anchored Fields: Controlled Data Generation via Emergent Velocity Fields and Transport Algebra

Title: Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Title: Splat-SAP: Feed-Forward Gaussian Splatting for Human-Centered Scene with Scale-Aware Point Map Reconstruction

Title: MammoRGB: Dual-View Mammogram Synthesis Using Denoising Diffusion Probabilistic Models

Title: Alzheimer's Disease Prediction Using EffNetViTLoRA and BiLSTM with Multimodal Longitudinal MRI Data

Title: World in a Frame: Understanding Culture Mixing as a New Challenge for Vision-Language Models

Title: LC4-DViT: Land-cover Creation for Land-cover Classification with Deformable Vision Transformer

Title: TARFVAE: Efficient One-Step Generative Time Series Forecasting via TARFLOW based VAE

Title: CoordSpeaker: Exploiting Gesture Captioning for Coordinated Caption-Empowered Co-Speech Gesture Generation

Title: Scalable Diffusion Transformer for Conditional 4D fMRI Synthesis

Title: DM$^3$T: Harmonizing Modalities via Diffusion for Multi-Object Tracking

Title: From Points to Clouds: Learning Robust Semantic Distributions for Multi-modal Prompts

Title: EnECG: Efficient Ensemble Learning for Electrocardiogram Multi-task Foundation Model

Title: One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfe

Title: Do We Need Perfect Data? Leveraging Noise for Domain Generalized Segmentation

Title: RobotSeg: A Model and Dataset for Segmenting Robots in Image and Video

Title: BlockVid: Block Diffusion for High-Quality and Consistent Minute-Long Video Generation

Title: McSc: Motion-Corrective Preference Alignment for Video Generation with Self-Critic Hierarchical Reasoning

Title: Ovis-Image Technical Report

Title: Guiding Visual Autoregressive Models through Spectrum Weakening

Title: Masked Diffusion for Generative Recommendation

Title: GOATex: Geometry & Occlusion-Aware Texturing

Title: Evaluating the Clinical Impact of Generative Inpainting on Bone Age Estimation

Title: NumeriKontrol: Adding Numeric Control to Diffusion Transformers for Instruction-based Image Editing

Title: db-SP: Accelerating Sparse Attention for Visual Generative Models with Dual-Balanced Sequence Parallelism

Title: Dripper: Token-Efficient Main HTML Extraction with a Lightweight LM

Title: Freeze, Diffuse, Decode: Geometry-Aware Adaptation of Pretrained Transformer Embeddings for Antimicrobial Peptide Design

Title: DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation

Title: InstanceV: Instance-Level Video Generation

Title: REVEAL: Reasoning-enhanced Forensic Evidence Analysis for Explainable AI-generated Image Detection

Title: Fast Multi-view Consistent 3D Editing with Video Priors

Title: Vision Bridge Transformer at Scale

Title: Pathryoshka: Compressing Pathology Foundation Models via Multi-Teacher Knowledge Distillation with Nested Embeddings

Title: Tourism Question Answer System in Indian Language using Domain-Adapted Foundation Models

Title: Synthetic Industrial Object Detection: GenAI vs. Feature-Based Methods

Title: Time Series Forecasting via Direct Per-Step Probability Distribution Modeling

Title: Beyond Curve Fitting: Neuro-Symbolic Agents for Context-Aware Epidemic Forecasting

Title: Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models

Title: Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach

Title: Flow Straighter and Faster: Efficient One-Step Generative Modeling via MeanFlow on Rectified Trajectories

Title: Scaling HuBERT for African Languages: From Base to Large and XL

Title: DEAL-300K: Diffusion-based Editing Area Localization with a 300K-Scale Dataset and Frequency-Prompted Baseline

Title: VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction

Title: Quantized-Tinyllava: a new multimodal foundation model enables efficient split learning

Title: LFM2 Technical Report

Title: Hunyuan-GameCraft-2: Instruction-following Interactive Game World Model

Title: ASTRO: Adaptive Stitching via Dynamics-Guided Trajectory Rollouts

Title: Physics-Informed Neural Networks for Thermophysical Property Retrieval

Title: Object-Centric Data Synthesis for Category-level Object Detection

Title: SmallWorlds: Assessing Dynamics Understanding of World Models in Isolated Environments

Title: Visual Generation Tuning

Title: ThetaEvolve: Test-time Learning on Open Problems

Title: AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement