2024-12-04

Title: MALT: Improving Reasoning with Multi-Agent LLM Training

Title: A Novel Generative Multi-Task Representation Learning Approach for Predicting Postoperative Complications in Cardiac Surgery Patients

Title: Free Process Rewards without Process Labels

Title: HybridMQA: Exploring Geometry-Texture Interactions for Colored Mesh Quality Assessment

Title: NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training

Title: GNN-based Auto-Encoder for Short Linear Block Codes: A DRL Approach

Title: CLERF: Contrastive LEaRning for Full Range Head Pose Estimation

Title: AccDiffusion v2: Towards More Accurate Higher-Resolution Diffusion Extrapolation

Title: Evaluating the Impact of Data Augmentation on Predictive Model Performance

Title: OmniCreator: Self-Supervised Unified Generation with Universal Editing

Title: Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis

Title: LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models

Title: 3D representation in 512-Byte:Variational tokenizer is the key for autoregressive 3D generation

Title: An Automated Data Mining Framework Using Autoencoders for Feature Extraction and Dimensionality Reduction

Title: CubeFormer: A Simple yet Effective Baseline for Lightweight Image Super-Resolution

Title: Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models

Title: Fast LiDAR Data Generation with Rectified Flows

Title: VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation

Title: Diffusion Implicit Policy for Unpaired Scene-aware Motion Synthesis

Title: Composing Open-domain Vision with RAG for Ocean Monitoring and Conservation

Title: Sustainable Self-evolution Adversarial Training

Title: PCIM: Learning Pixel Attributions via Pixel-wise Channel Isolation Mixing in High Content Imaging

Title: GQWformer: A Quantum-based Transformer for Graph Representation Learning

Title: Viewpoint Consistency in 3D Generation via Attention and CLIP Guidance

Title: Enhanced Photovoltaic Power Forecasting: An iTransformer and LSTM-Based Model Integrating Temporal and Covariate Interactions

Title: HumanRig: Learning Automatic Rigging for Humanoid Character in a Large Scale Dataset

Title: Controlling the Latent Diffusion Model for Generative Image Shadow Removal via Residual Generation

Title: SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion Models

Title: Amodal Depth Anything: Amodal Depth Estimation in the Wild

Title: GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing

Title: DP-2Stage: Adapting Language Models as Differentially Private Tabular Data Generators

Title: WEM-GAN: Wavelet transform based facial expression manipulation

Title: Unveiling Concept Attribution in Diffusion Models

Title: OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation

Title: Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback

Title: Continual Learning of Personalized Generative Face Models with Experience Replay

Title: Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation

Title: AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction

Title: SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance

Title: FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation

Title: Taming Scalable Visual Tokenizer for Autoregressive Image Generation

Title: Diffusion-based Visual Anagram as Multi-task Learning

Title: Motion Prompting: Controlling Video Generation with Motion Trajectories