2024-05-30

Title: Towards Open Domain Text-Driven Synthesis of Multi-Person Motions

Title: Anomaly detection for the identification of volcanic unrest in satellite imagery

Title: Atlas3D: Physically Constrained Self-Supporting Text-to-3D for Simulation and Fabrication

Title: Scalable Surrogate Verification of Image-based Neural Network Control Systems using Composition and Unrolling

Title: PureGen: Universal Data Purification for Train-Time Poison Defense via Generative Model Dynamics

Title: A Theoretical Understanding of Self-Correction through In-context Alignment

Title: When and How Does In-Distribution Label Help Out-of-Distribution Detection?

Title: ConSiDERS-The-Human Evaluation Framework: Rethinking Human Evaluation for Generative Large Language Models

Title: Mitigating Object Hallucination via Data Augmented Contrastive Tuning

Title: Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities

Title: Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map Filtering

Title: Learning Diffeomorphism for Image Registration with Time-Continuous Networks using Semigroup Regularization

Title: Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction

Title: SketchDeco: Decorating B&W Sketches with Colour

Title: Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning

Title: T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback

Title: Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI

Title: Leveraging Many-To-Many Relationships for Defending Against Visual-Language Adversarial Attacks

Title: On the Role of Attention Masks and LayerNorm in Transformers

Title: SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation

Title: MindSemantix: Deciphering Brain Visual Experiences with a Brain-Language Model

Title: Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching

Title: Evaluating Zero-Shot GPT-4V Performance on 3D Visual Question Answering Benchmarks

Title: MEGA: Masked Generative Autoencoder for Human Mesh Recovery

Title: Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation

Title: Anomaly Detection by Context Contrasting

Title: Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization

Title: Leveraging Time-Series Foundation Models in Smart Agriculture for Soil Moisture Forecasting

Title: Federated Continual Learning Goes Online: Leveraging Uncertainty for Modality-Agnostic Class-Incremental Learning

Title: ParsEval: Evaluation of Parsing Behavior using Real-world Out-in-the-wild X.509 Certificates

Title: Enhancing Vision-Language Model with Unmasked Token Alignment

Title: Cephalo: Multi-Modal Vision-Language Models for Bio-Inspired Materials Analysis and Design

Title: Faithful Chart Summarization with ChaTS-Pi

Title: Poseidon: Efficient Foundation Models for PDEs

Title: PathReasoner: Modeling Reasoning Path with Equivalent Extension for Logical Question Answering

Title: A Study of Plasticity Loss in On-Policy Deep Reinforcement Learning

Title: Does learning the right latent variables necessarily improve in-context learning?

Title: Diffusion-based Dynamics Models for Long-Horizon Rollout in Offline Reinforcement Learning

Title: Going beyond compositional generalization, DDPMs can produce zero-shot interpolation

Title: $E^{3}$Gen: Efficient, Expressive and Editable Avatars Generation

Title: ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning

Title: Weak Generative Sampler to Efficiently Sample Invariant Distribution of Stochastic Differential Equation

Title: Programmable Motion Generation for Open-Set Motion Control Tasks

Title: Neural Isometries: Taming Transformations for Equivariant ML

Title: Nearest Neighbor Speculative Decoding for LLM Generation and Attribution

Title: Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models

Title: Multi-Modal Generative Embedding Model

Title: X-VILA: Cross-Modality Alignment for Large Language Model