2025-04-22

Title: Enhancing Ultra-Low-Bit Quantization of Large Language Models Through Saliency-Aware Partial Retraining

Title: NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning

Title: Open-Medical-R1: How to Choose Data for RLVR Training at Medicine Domain

Title: Generative System Dynamics in Recurrent Neural Networks

Title: Prognosis Of Lithium-Ion Battery Health with Hybrid EKF-CNN+LSTM Model Using Differential Capacity

Title: ToolRL: Reward is All Tool Learning Needs

Title: CONTINA: Confidence Interval for Traffic Demand Prediction with Coverage Guarantee

Title: Adversarial Resilience against Clean-Label Attacks in Realizable and Noisy Settings

Title: Multiscale Tensor Summation Factorization as a New Neural Network Layer (MTS Layer) for Multidimensional Data Processing

Title: CacheFormer: High Attention-Based Segment Caching

Title: QuatE-D: A Distance-Based Quaternion Model for Knowledge Graph Embedding

Title: One Jump Is All You Need: Short-Cutting Transformers for Early Exit Prediction with One Jump to Fit All Exit Levels

Title: Entropy Rectifying Guidance for Diffusion and Flow Models

Title: Gradual Binary Search and Dimension Expansion : A general method for activation quantization in LLMs

Title: PC-DeepNet: A GNSS Positioning Error Minimization Framework Using Permutation-Invariant Deep Neural Network

Title: Scaling LLaNA: Advancing NeRF-Language Understanding Through Large-Scale Training

Title: Fashion-RAG: Multimodal Fashion Image Editing via Retrieval-Augmented Generation

Title: Post Quantum Cryptography (PQC) Signatures Without Trapdoors

Title: Large Language Bayes

Title: LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models

Title: MEQA: A Meta-Evaluation Framework for Question & Answer LLM Benchmarks

Title: A synthetic dataset of French electric load curves with temperature conditioning

Title: CAOTE: KV Caching through Attention Output Error based Token Eviction

Title: Occlusion-Ordered Semantic Instance Segmentation

Title: Benchmarking Differentially Private Tabular Data Synthesis

Title: DoomArena: A framework for Testing AI Agents Against Evolving Security Threats

Title: Contextual Embedding-based Clustering to Identify Topics for Healthcare Service Improvement

Title: Towards Scale-Aware Low-Light Enhancement via Structure-Guided Transformer Design

Title: LogicTree: Structured Proof Exploration for Coherent and Rigorous Logical Reasoning with Large Language Models

Title: Retinex-guided Histogram Transformer for Mask-free Shadow Removal

Title: Leakage and Interpretability in Concept-Based Models

Title: VideoPASTA: 7K Preference Pairs That Matter for Video-LLM Alignment

Title: Point-Driven Interactive Text and Image Layer Editing Using Diffusion Models

Title: Lightweight Road Environment Segmentation using Vector Quantization

Title: PEFT A2Z: Parameter-Efficient Fine-Tuning Survey for Large Language and Vision Models

Title: Detecting Zero-Day Web Attacks with an Ensemble of LSTM, GRU, and Stacked Autoencoders

Title: BMRL: Bi-Modal Guided Multi-Perspective Representation Learning for Zero-Shot Deepfake Attribution

Title: HFBRI-MAE: Handcrafted Feature Based Rotation-Invariant Masked Autoencoder for 3D Point Cloud Analysis

Title: Rethinking Target Label Conditioning in Adversarial Attacks: A 2D Tensor-Guided Generative Approach

Title: Segment Any Crack: Deep Semantic Segmentation Adaptation for Crack Detection

Title: ThyroidEffi 1.0: A Cost-Effective System for High-Performance Multi-Class Thyroid Carcinoma Classification

Title: Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations

Title: Locate 3D: Real-World Object Localization via Self-Supervised Learning in 3D

Title: SConU: Selective Conformal Uncertainty in Large Language Models

Title: ROFBS$α$: Real Time Backup System Decoupled from ML Based Ransomware Detection

Title: Self-Correction Makes LLMs Better Parsers

Title: A Physics-guided Multimodal Transformer Path to Weather and Climate Sciences

Title: Hypothetical Documents or Knowledge Leakage? Rethinking LLM-based Query Expansion

Title: Segregation and Context Aggregation Network for Real-time Cloud Segmentation

Title: FedC4: Graph Condensation Meets Client-Client Collaboration for Efficient and Private Federated Graph Learning

Title: Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models

Title: Learning Joint ID-Textual Representation for ID-Preserving Image Synthesis

Title: DConAD: A Differencing-based Contrastive Representation Learning Framework for Time Series Anomaly Detection

Title: Decomposition-based multi-scale transformer framework for time series anomaly detection

Title: Bias Analysis and Mitigation through Protected Attribute Detection and Regard Classification

Title: Understanding the Repeat Curse in Large Language Models from a Feature Perspective

Title: From Cyber Security Incident Management to Cyber Security Crisis Management in the European Union

Title: Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection

Title: SimplifyMyText: An LLM-Based System for Inclusive Plain Language Text Simplification

Title: Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at Scale

Title: Exploring Modality Guidance to Enhance VFM-based Feature Fusion for UDA in 3D Semantic Segmentation

Title: A Novel Frequency-Spatial Domain Aware Network for Fast Thermal Prediction in 2.5D ICs

Title: Single Document Image Highlight Removal via A Large-Scale Real-World Dataset and A Location-Aware Network

Title: Towards Explainable Fake Image Detection with Multi-Modal Large Language Models

Title: Any Image Restoration via Efficient Spatial-Frequency Degradation Adaptation

Title: ColorVein: Colorful Cancelable Vein Biometrics

Title: Visual Consensus Prompting for Co-Salient Object Detection

Title: Cross-attention for State-based model RWKV-7

Title: Generative emulation of chaotic dynamics with coherent prior

Title: Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction

Title: Mixed-Precision Conjugate Gradient Solvers with RL-Driven Precision Tuning

Title: RAMCT: Novel Region-adaptive Multi-channel Tracker with Iterative Tikhonov Regularization for Thermal Infrared Tracking

Title: CLIP-Powered Domain Generalization and Domain Adaptation: A Comprehensive Survey

Title: SRPO: A Cross-Domain Implementation of Large-Scale Reinforcement Learning on LLM

Title: Probing the Subtle Ideological Manipulation of Large Language Models

Title: From Missing Pieces to Masterpieces: Image Completion with Context-Adaptive Diffusion

Title: Learning and Generating Diverse Residential Load Patterns Using GAN with Weakly-Supervised Training and Weight Selection

Title: Balancing Privacy and Action Performance: A Penalty-Driven Approach to Image Anonymization

Title: FGSGT: Saliency-Guided Siamese Network Tracker Based on Key Fine-Grained Feature Information for Thermal Infrared Target Tracking

Title: DCFG: Diverse Cross-Channel Fine-Grained Feature Learning and Progressive Fusion Siamese Tracker for Thermal Infrared Target Tracking

Title: ScaloWork: Useful Proof-of-Work with Distributed Pool Mining

Title: Visual Prompting for One-shot Controllable Video Editing without Inversion

Title: Multispectral airborne laser scanning for tree species classification: a benchmark of machine learning and deep learning algorithms

Title: Manipulating Multimodal Agents via Cross-Modal Prompt Injection

Title: Improving RL Exploration for LLM Reasoning through Retrospective Replay

Title: Accelerating LLM Inference with Flexible N:M Sparsity via A Fully Digital Compute-in-Memory Accelerator

Title: Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models

Title: Diverse Prompts: Illuminating the Prompt Space of Large Language Models with MAP-Elites

Title: Do You Really Need Public Data? Surrogate Public Data for Differential Privacy on Tabular Data

Title: Efficient Spiking Point Mamba for Point Cloud Analysis

Title: Bottom-Up Synthesis of Knowledge-Grounded Task-Oriented Dialogues with Iteratively Self-Refined Prompts

Title: Publicly Verifiable Secret Sharing: Generic Constructions and Lattice-Based Instantiations in the Standard Model

Title: LOOPE: Learnable Optimal Patch Order in Positional Embeddings for Vision Transformers

Title: Balancing Fairness and Performance in Healthcare AI: A Gradient Reconciliation Approach

Title: Hydra: An Agentic Reasoning Approach for Enhancing Adversarial Robustness and Mitigating Hallucinations in Vision-Language Models

Title: SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation

Title: Exploring Pseudo-Token Approaches in Transformer Neural Processes

Title: How Do Mobile Applications Enhance Security? An Exploratory Analysis of Use Cases and Provided Information

Title: Adversarial Attack for RGB-Event based Visual Object Tracking

Title: ResNetVLLM-2: Addressing ResNetVLLM's Multi-Modal Hallucinations

Title: ResNetVLLM -- Multi-modal Vision LLM for the Video Understanding Task

Title: Application of Deep Reinforcement Learning for Intrusion Detection in Internet of Things: A Systematic Review

Title: LoRe: Personalizing LLMs via Low-Rank Reward Modeling

Title: WT-BCP: Wavelet Transform based Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation

Title: Neglected Risks: The Disturbing Reality of Children's Images in Datasets and the Urgent Call for Accountability

Title: Causal Disentanglement for Robust Long-tail Medical Image Generation

Title: ParaPO: Aligning Language Models to Reduce Verbatim Reproduction of Pre-training Data

Title: CoLoTa: A Dataset for Entity-based Commonsense Reasoning over Long-Tail Knowledge

Title: LGD: Leveraging Generative Descriptions for Zero-Shot Referring Image Segmentation

Title: Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis

Title: Vision-Centric Representation-Efficient Fine-Tuning for Robust Universal Foreground Segmentation

Title: STARS: Sparse Learning Correlation Filter with Spatio-temporal Regularization and Super-resolution Reconstruction for Thermal Infrared Target Tracking

Title: FairSteer: Inference Time Debiasing for LLMs with Dynamic Activation Steering

Title: Functional Abstraction of Knowledge Recall in Large Language Models

Title: Fast Plaintext-Ciphertext Matrix Multiplication from Additively Homomorphic Encryption

Title: Less is More: Adaptive Coverage for Synthetic Training Data

Title: DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning

Title: On Dimension-Free Transformer: An Application of STP to AI

Title: Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction

Title: SlimPipe: Memory-Thrifty and Efficient Pipeline Parallelism for Long-Context LLM Training

Title: Are Vision LLMs Road-Ready? A Comprehensive Benchmark for Safety-Critical Driving Video Understanding

Title: Causality for Natural Language Processing

Title: SUDO: Enhancing Text-to-Image Diffusion Models with Self-Supervised Direct Preference Optimization

Title: FlowLoss: Dynamic Flow-Conditioned Loss Strategy for Video Diffusion Models

Title: BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation

Title: Towards Model Resistant to Transferable Adversarial Examples via Trigger Activation

Title: VGNC: Reducing the Overfitting of Sparse-view 3DGS via Validation-guided Gaussian Number Control

Title: REDEditing: Relationship-Driven Precise Backdoor Poisoning on Text-to-Image Diffusion Models

Title: SMTT: Novel Structured Multi-task Tracking with Graph-Regularized Sparse Representation for Robust Thermal Infrared Target Tracking

Title: NoWag: A Unified Framework for Shape Preserving Compression of Large Language Models

Title: Using street view imagery and deep generative modeling for estimating the health of urban forests

Title: Generative Auto-Bidding with Value-Guided Explorations

Title: a1: Steep Test-time Scaling Law via Environment Augmented Generation

Title: MP-Mat: A 3D-and-Instance-Aware Human Matting and Editing Framework with Multiplane Representation

Title: No Imputation of Missing Values In Tabular Data Classification Using Incremental Learning

Title: VM-BHINet:Vision Mamba Bimanual Hand Interaction Network for 3D Interacting Hand Mesh Recovery From a Single RGB Image

Title: Translation Analytics for Freelancers: I. Introduction, Data Preparation, Baseline Evaluations

Title: A Hierarchical Framework for Measuring Scientific Paper Innovation via Large Language Models

Title: Talk is Not Always Cheap: Promoting Wireless Sensing Models with Text Prompts

Title: MSAD-Net: Multiscale and Spatial Attention-based Dense Network for Lung Cancer Classification

Title: Harnessing Generative LLMs for Enhanced Financial Event Entity Extraction Performance

Title: NVSMask3D: Hard Visual Prompting with Camera Pose Interpolation for 3D Open Vocabulary Instance Segmentation

Title: Relation-R1: Cognitive Chain-of-Thought Guided Reinforcement Learning for Unified Relational Comprehension

Title: Surrogate Fitness Metrics for Interpretable Reinforcement Learning

Title: BLACKOUT: Data-Oblivious Computation with Blinded Capabilities

Title: LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs

Title: A Case Study Exploring the Current Landscape of Synthetic Medical Record Generation with Commercial LLMs

Title: EmoSEM: Segment and Explain Emotion Stimuli in Visual Art

Title: Frequency-domain Learning with Kernel Prior for Blind Image Deblurring

Title: Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens

Title: Efficient Federated Split Learning for Large Language Models over Communication Networks

Title: Trans-Zero: Self-Play Incentivizes Large Language Models for Multilingual Translation Without Parallel Data

Title: Evaluating Temporal Plasticity in Foundation Time Series Models for Incremental Fine-tuning

Title: Seurat: From Moving Points to Depth

Title: FarsEval-PKBETS: A new diverse benchmark for evaluating Persian large language models

Title: Video-MMLU: A Massive Multi-Discipline Lecture Understanding Benchmark

Title: Learning Critically: Selective Self Distillation in Federated Learning on Non-IID Data

Title: Quantitative Clustering in Mean-Field Transformer Models

Title: Connecting Parameter Magnitudes and Hessian Eigenspaces at Scale using Sketched Methods

Title: Evaluating BERTopic on Open-Ended Data: A Case Study with Belgian Dutch Daily Narratives

Title: Time Frequency Analysis of EMG Signal for Gesture Recognition using Fine grained Features

Title: Med-2D SegNet: A Light Weight Deep Neural Network for Medical 2D Image Segmentation

Title: Pairwise or Pointwise? Evaluating Feedback Protocols for Bias in LLM-Based Evaluation

Title: TAPIP3D: Tracking Any Point in Persistent 3D Geometry

Title: SuperCL: Superpixel Guided Contrastive Learning for Medical Image Segmentation Pre-training

Title: PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines

Title: AltGDmin: Alternating GD and Minimization for Partly-Decoupled (Federated) Optimization

Title: Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches

Title: Establishing Workload Identity for Zero Trust CI/CD: From Secrets to SPIFFE-Based Authentication

Title: Decoupling Identity from Access: Credential Broker Patterns for Secure CI/CD

Title: A Combinatorial Theory of Dropout: Subnetworks, Graph Geometry, and Generalization

Title: Disentangling Linguistic Features with Dimension-Wise Analysis of Vector Embeddings

Title: Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions

Title: Novel Concept-Oriented Synthetic Data approach for Training Generative AI-Driven Crystal Grain Analysis Using Diffusion Model

Title: How Effective Can Dropout Be in Multiple Instance Learning ?

Title: When Cloud Removal Meets Diffusion Model in Remote Sensing

Title: Enhanced Data-driven Topology Design Methodology with Multi-level Mesh and Correlation-based Mutation for Stress-related Multi-objective Optimization

Title: Edge-boosted graph learning for functional brain connectivity analysis

Title: Verifying Robust Unlearning: Probing Residual Knowledge in Unlearned Models

Title: Automatic Evaluation Metrics for Document-level Translation: Overview, Challenges and Trends

Title: Dynamic Contrastive Skill Learning with State-Transition Based Skill Clustering and Dynamic Length Adjustment

Title: On Self-improving Token Embeddings

Title: vApps: Verifiable Applications at Internet Scale

Title: CSI2Dig: Recovering Digit Content from Smartphone Loudspeakers Using Channel State Information

Title: A Basic Evaluation of Neural Networks Trained with the Error Diffusion Learning Algorithm

Title: What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale

Title: ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-scale Stages

Title: Distribution-aware Dataset Distillation for Efficient Image Restoration

Title: Protecting Your Voice: Temporal-aware Robust Watermarking

Title: Reliable Multi-Modal Object Re-Identification via Modality-Aware Graph Reasoning

Title: Object-Level Verbalized Confidence Calibration in Vision-Language Models via Semantic Perturbation

Title: Transparentize the Internal and External Knowledge Utilization in LLMs with Trustworthy Citation

Title: Natural Fingerprints of Large Language Models

Title: Collaborative Enhancement Network for Low-quality Multi-spectral Vehicle Re-identification

Title: Impact of Latent Space Dimension on IoT Botnet Detection Performance: VAE-Encoder Versus ViT-Encoder

Title: Towards Fuzzing Zero-Knowledge Proof Circuits (Short Paper)

Title: Some Optimizers are More Equal: Understanding the Role of Optimizers in Group Fairness

Title: Zero Day Malware Detection with Alpha: Fast DBI with Transformer Models for Real World Application

Title: WMKA-Net: A Weighted Multi-Kernel Attention NetworkMethod for Retinal Vessel Segmentation

Title: Latent Bayesian Optimization via Autoregressive Normalizing Flows

Title: Retrieval Augmented Generation Evaluation in the Era of Large Language Models: A Comprehensive Survey

Title: Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation

Title: CRAVE: A Conflicting Reasoning Approach for Explainable Claim Verification Using LLMs

Title: POLYRAG: Integrating Polyviews into Retrieval-Augmented Generation for Medical Applications

Title: GenCLIP: Generalizing CLIP Prompts for Zero-shot Anomaly Detection

Title: Fast Adversarial Training with Weak-to-Strong Spatial-Temporal Consistency in the Frequency Domain on Videos

Title: TWIG: Two-Step Image Generation using Segmentation Masks in Diffusion Models

Title: Causal DAG Summarization (Full Version)

Title: PIV-FlowDiffuser:Transfer-learning-based denoising diffusion models for PIV

Title: Efficient Document Retrieval with G-Retriever

Title: MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core

Title: Speaker Fuzzy Fingerprints: Benchmarking Text-Based Identification in Multiparty Dialogues

Title: A Security Framework for General Blockchain Layer 2 Protocols

Title: Evaluating LLMs on Chinese Topic Constructions: A Research Proposal Inspired by Tian et al. (2024)

Title: Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization

Title: aiXamine: LLM Safety and Security Simplified

Title: Efficient Pretraining Length Scaling

Title: Dual Utilization of Perturbation for Stream Data Publication under Local Differential Privacy

Title: Insert Anything: Image Insertion via In-Context Editing in DiT

Title: Stay Hungry, Stay Foolish: On the Extended Reading Articles Generation with LLMs

Title: Gaussian Shading++: Rethinking the Realistic Deployment Challenge of Performance-Lossless Image Watermark for Diffusion Models

Title: DistilQwen2.5: Industrial Practices of Training Distilled Open Lightweight Language Models

Title: DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation

Title: SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation

Title: A Call for New Recipes to Enhance Spatial Reasoning in MLLMs

Title: RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity Search

Title: ScanEdit: Hierarchically-Guided Functional 3D Scan Editing

Title: Testing LLMs' Capabilities in Annotating Translations Based on an Error Typology Designed for LSP Translation: First Experiments with ChatGPT

Title: Structure-guided Diffusion Transformer for Low-Light Image Enhancement

Title: Mining Characteristics of Vulnerable Smart Contracts Across Lifecycle Stages

Title: Think2SQL: Reinforce LLM Reasoning Capabilities for Text2SQL

Title: Federated Latent Factor Model for Bias-Aware Recommendation with Privacy-Preserving

Title: Rethinking the Potential of Multimodality in Collaborative Problem Solving Diagnosis with Large Language Models

Title: VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation

Title: Fast-Slow Co-advancing Optimizer: Toward Harmonious Adversarial Training of GAN

Title: Kuwain 1.5B: An Arabic SLM via Language Injection

Title: Robust and Real-time Surface Normal Estimation from Stereo Disparities using Affine Transformations

Title: MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular Video

Title: EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models

Title: GIFDL: Generated Image Fluctuation Distortion Learning for Enhancing Steganographic Security

Title: Landmark-Free Preoperative-to-Intraoperative Registration in Laparoscopic Liver Resection

Title: Acquire and then Adapt: Squeezing out Text-to-Image Model for Image Restoration

Title: The Synthetic Imputation Approach: Generating Optimal Synthetic Texts For Underrepresented Categories In Supervised Classification Tasks

Title: Survey of Loss Augmented Knowledge Tracing

Title: An Efficient Aerial Image Detection with Variable Receptive Fields

Title: Audio-Visual Class-Incremental Learning for Fish Feeding intensity Assessment in Aquaculture

Title: DSPO: Direct Semantic Preference Optimization for Real-World Image Super-Resolution

Title: FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image

Title: Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform

Title: Automated Measurement of Eczema Severity with Self-Supervised Learning

Title: Extending the ElGamal Cryptosystem to the Third Group of Units of $\Z_{n}$

Title: Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges

Title: How Global Calibration Strengthens Multiaccuracy

Title: Compute-Optimal LLMs Provably Generalize Better With Scale

Title: A Causal Convolutional Low-rank Representation Model for Imputation of Water Quality Data

Title: EvalAgent: Discovering Implicit Evaluation Criteria from the Web

Title: A Deep Learning Framework for Sequence Mining with Bidirectional LSTM and Multi-Scale Attention

Title: A Review on Privacy in DAG-Based DLTs

Title: Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions

Title: Conformalized-KANs: Uncertainty Quantification with Coverage Guarantees for Kolmogorov-Arnold Networks (KANs) in Scientific Machine Learning

Title: MR. Guard: Multilingual Reasoning Guardrail using Curriculum Learning

Title: Single-loop Algorithms for Stochastic Non-convex Optimization with Weakly-Convex Constraints

Title: A Refreshment Stirred, Not Shaken (III): Can Swapping Be Differentially Private?

Title: Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators

Title: Bringing Diversity from Diffusion Models to Semantic-Guided Face Asset Generation

Title: Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction

Title: Diffusion Bridge Models for 3D Medical Image Translation

Title: Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Title: VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Title: Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs

Title: StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians