2024-12-13

Title: A feature refinement module for light-weight semantic segmentation network

Title: A Deep Semantic Segmentation Network with Semantic and Contextual Refinements

Title: Distinguishing Scams and Fraud with Ensemble Learning

Title: LatentQA: Teaching LLMs to Decode Activations Into Natural Language

Title: From MLP to NeoMLP: Leveraging Self-Attention for Neural Fields

Title: Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions

Title: In-Context Learning with Topological Information for Knowledge Graph Completion

Title: Proactive Adversarial Defense: Harnessing Prompt Tuning in Vision-Language Models to Detect Unseen Backdoored Images

Title: Integrating Optimization Theory with Deep Learning for Wireless Network Design

Title: Beyond Knowledge Silos: Task Fingerprinting for Democratization of Medical Imaging AI

Title: Security Properties for Open-Source Hardware Designs

Title: LLaVA-Zip: Adaptive Visual Token Compression with Intrinsic Image Information

Title: ProtoOcc: Accurate, Efficient 3D Occupancy Prediction Using Dual Branch Encoder-Prototype Query Decoder

Title: Bayesian optimized deep ensemble for uncertainty quantification of deep neural networks: a system safety case study on sodium fast reactor thermal stratification modeling

Title: Reward-based Blockchain Infrastructure for 3D IC Supply Chain Provenance

Title: Generative Modeling with Explicit Memory

Title: Coverage-based Fairness in Multi-document Summarization

Title: HARP: A challenging human-annotated math reasoning benchmark

Title: Large Concept Models: Language Modeling in a Sentence Representation Space

Title: Exploring Large Language Models on Cross-Cultural Values in Connection with Training Methodology

Title: ViUniT: Visual Unit Tests for More Robust Visual Programming

Title: A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions

Title: Inference-Time Diffusion Model Distillation

Title: Towards modeling evolving longitudinal health trajectories with a transformer-based deep learning model

Title: SMMF: Square-Matricized Momentum Factorization for Memory-Efficient Optimization

Title: AI-assisted Knowledge Discovery in Biomedical Literature to Support Decision-making in Precision Oncology

Title: Federated Foundation Models on Heterogeneous Time Series

Title: Goal-Conditioned Supervised Learning for Multi-Objective Recommendation

Title: Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression

Title: Sensing for Space Safety and Sustainability: A Deep Learning Approach with Vision Transformers

Title: QFAM: Mitigating QUIC Handshake Flooding Attacks Through Crypto Challenges

Title: Optimized Gradient Clipping for Noisy Label Learning

Title: MoSLD: An Extremely Parameter-Efficient Mixture-of-Shared LoRAs for Multi-Task Learning

Title: Selective Visual Prompting in Vision Mamba

Title: Mojito: Motion Trajectory and Intensity Control for Video Generation

Title: Align, Generate, Learn: A Novel Closed-Loop Framework for Cross-Lingual In-Context Learning

Title: BA-ORABE: Blockchain-Based Auditable Registered Attribute-Based Encryption With Reliable Outsourced Decryption

Title: AFFAKT: A Hierarchical Optimal Transport based Method for Affective Facial Knowledge Transfer in Video Deception Detection

Title: Deep Learning Model Security: Threats and Defenses

Title: Reasoning-Aware Query-Focused Summarization over Multi-Table Data

Title: RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

Title: Elevating Flow-Guided Video Inpainting with Reference Generation

Title: Assessing the Robustness of Retrieval-Augmented Generation Systems in K-12 Educational Question Answering with Knowledge Discrepancies

Title: CBCMS: A Compliance Management System for Cross-Border Data Transfer

Title: MS2Mesh-XR: Multi-modal Sketch-to-Mesh Generation in XR Environments

Title: A physics-informed transformer neural operator for learning generalized solutions of initial boundary value problems

Title: What Makes Cryptic Crosswords Challenging for LLMs?

Title: Arbitrary-steps Image Super-resolution via Diffusion Inversion

Title: STEAM: Squeeze and Transform Enhanced Attention Module

Title: Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model

Title: Learning and Current Prediction of PMSM Drive via Differential Neural Networks

Title: RingFormer: A Ring-Enhanced Graph Transformer for Organic Solar Cell Property Prediction

Title: Dialogue Language Model with Large-Scale Persona Data Engineering

Title: ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty

Title: Beyond Confusion: A Fine-grained Dialectical Examination of Human Activity Recognition Benchmark Datasets

Title: Motif Guided Graph Transformer with Combinatorial Skeleton Prototype Learning for Skeleton-Based Person Re-Identification

Title: Mining Word Boundaries from Speech-Text Parallel Data for Cross-domain Chinese Word Segmentation

Title: Multi-Task Learning with LLMs for Implicit Sentiment Analysis: Data-level and Task-level Automatic Weight Learning

Title: Dial-In LLM: Human-Aligned Dialogue Intent Clustering with LLM-in-the-loop

Title: ContextHOI: Spatial Context Learning for Human-Object Interaction Detection

Title: Hyperbolic-constraint Point Cloud Reconstruction from Single RGB-D Images

Title: PhishIntel: Toward Practical Deployment of Reference-based Phishing Detection

Title: Go With the Flow: Fast Diffusion for Gaussian Mixture Models

Title: An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques

Title: SVasP: Self-Versatility Adversarial Style Perturbation for Cross-Domain Few-Shot Learning

Title: Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning

Title: Neural Networks for Threshold Dynamics Reconstruction

Title: Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method

Title: Filter-then-Generate: Large Language Models with Structure-Text Adapter for Knowledge Graph Completion

Title: OriginPruner: Leveraging Method Origins for Guided Call Graph Pruning

Title: The Utility and Complexity of In- and Out-of-Distribution Machine Unlearning

Title: LVMark: Robust Watermark for latent video diffusion models

Title: Evaluating the Potential of In-Memory Processing to Accelerate Homomorphic Encryption

Title: Evaluating Adversarial Attacks on Traffic Sign Classifiers beyond Standard Baselines

Title: When Text Embedding Meets Large Language Model: A Comprehensive Survey

Title: DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization

Title: ReFF: Reinforcing Format Faithfulness in Language Models across Varied Tasks

Title: On the effectiveness of Rotation-Equivariance in U-Net: A Benchmark for Image Segmentation

Title: RAD: Region-Aware Diffusion Models for Image Inpainting

Title: ExpRDiff: Short-exposure Guided Diffusion Model for Realistic Local Motion Deblurring

Title: MVC-VPR: Mutual Learning of Viewpoint Classification and Visual Place Recognition

Title: CleanComedy: Creating Friendly Humor through Generative Techniques

Title: Enhancing Implicit Neural Representations via Symmetric Power Transformation

Title: USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature Decorrelation

Title: Building a Privacy Web with SPIDEr -- Secure Pipeline for Information De-Identification with End-to-End Encryption

Title: DASK: Distribution Rehearsing via Adaptive Style Kernel Learning for Exemplar-Free Lifelong Person Re-Identification

Title: Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering

Title: Uplift modeling with continuous treatments: A predict-then-optimize approach

Title: VLMs meet UDA: Boosting Transferability of Open Vocabulary Segmentation with Unsupervised Domain Adaptation

Title: Make Satire Boring Again: Reducing Stylistic Bias of Satirical Corpus by Utilizing Generative LLMs

Title: GeLoRA: Geometric Adaptive Ranks For Efficient LoRA Fine-tuning

Title: When Can Memorization Improve Fairness?

Title: FD2-Net: Frequency-Driven Feature Decomposition Network for Infrared-Visible Object Detection

Title: Multi-client Functional Encryption for Set Intersection with Non-monotonic Access Structures in Federated Learning

Title: LatentSync: Audio Conditioned Latent Diffusion Models for Lip Sync

Title: Towards Understanding the Robustness of LLM-based Evaluations under Perturbations

Title: Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine

Title: Learning to Solve Domain-Specific Calculation Problems with Knowledge-Intensive Programs Generator

Title: CRVQ: Channel-relaxed Vector Quantization for Extreme Compression of LLMs

Title: Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices

Title: Transfer Learning of RSSI to Improve Indoor Localisation Performance

Title: GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expression

Title: Advancing Attribution-Based Neural Network Explainability through Relative Absolute Magnitude Layer-Wise Relevance Propagation and Multi-Component Evaluation

Title: FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation

Title: Are Conditional Latent Diffusion Models Effective for Image Restoration?

Title: Auto-Regressive Moving Diffusion Models for Time Series Forecasting

Title: MaskTerial: A Foundation Model for Automated 2D Material Flake Detection

Title: Training LayoutLM from Scratch for Efficient Named-Entity Recognition in the Insurance Domain

Title: DisPose: Disentangling Pose Guidance for Controllable Human Image Animation

Title: Causal Graphical Models for Vision-Language Compositional Understanding

Title: Word Sense Linking: Disambiguating Outside the Sandbox

Title: A comprehensive interpretable machine learning framework for Mild Cognitive Impairment and Alzheimer's disease diagnosis

Title: Diffusion Model with Representation Alignment for Protein Inverse Folding

Title: UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer

Title: MultiEYE: Dataset and Benchmark for OCT-Enhanced Retinal Disease Recognition from Fundus Images

Title: Unifying AI Tutor Evaluation: An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors

Title: Towards Robust and Fair Vision Learning in Open-World Environments

Title: MOS: Model Surgery for Pre-Trained Model-Based Class-Incremental Learning

Title: ATPrompt: Textual Prompt Learning with Embedded Attributes

Title: A Semi Black-Box Adversarial Bit-Flip Attack with Limited DNN Model Information

Title: The Impact of Copyrighted Material on Large Language Models: A Norwegian Perspective

Title: OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs

Title: STORM: A Spatio-Temporal Factor Model Based on Dual Vector Quantized Variational Autoencoders for Financial Trading

Title: A Novel Ensemble-Based Deep Learning Model with Explainable AI for Accurate Kidney Disease Diagnosis

Title: Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Title: Vision Transformers for Efficient Indoor Pathloss Radio Map Prediction

Title: GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency

Title: Efficient and Comprehensive Feature Extraction in Large Vision-Language Model for Clinical Pathology Analysis

Title: Can Modern LLMs Act as Agent Cores in Radiology~Environments?

Title: Sail into the Headwind: Alignment via Robust Rewards and Dynamic Labels against Reward Hacking

Title: SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing

Title: Exemplar Masking for Multimodal Incremental Learning

Title: Video Creation by Demonstration

Title: Does Representation Matter? Exploring Intermediate Layers in Large Language Models

Title: Obfuscated Activations Bypass LLM Latent-Space Defenses

Title: JuStRank: Benchmarking LLM Judges for System Ranking

Title: DiverseAgentEntropy: Quantifying Black-Box LLM Uncertainty through Diverse Perspectives and Multi-Agent Interaction

Title: FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction

Title: Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders

Title: Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion

Title: InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Title: LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors

Title: Do Multimodal Large Language Models See Like Humans?

Title: SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Title: Feat2GS: Probing Visual Foundation Models with Gaussian Splatting

Title: Spectral Image Tokenizer

Title: FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers

Title: Olympus: A Universal Task Router for Computer Vision Tasks

Title: Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG

Title: EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Title: SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Title: Learning Camera Movement Control from Real-World Drone Videos

Title: LoRACLR: Contrastive Adaptation for Customization of Diffusion Models

Title: OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation

Title: GenEx: Generating an Explorable World

Title: Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors

Title: FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion

Title: Doe-1: Closed-Loop Autonomous Driving with Large World Model