2024-11-27

Title: Leveraging Conversational Generative AI for Anomaly Detection in Digital Substations

Title: Enhancing LLMs for Power System Simulations: A Feedback-driven Multi-agent Framework

Title: SafeLight: Enhancing Security in Optical Convolutional Neural Network Accelerators

Title: Conditional Text-to-Image Generation with Reference Guidance

Title: TPIE: Topology-Preserved Image Editing With Text Instructions

Title: Learn2Synth: Learning Optimal Data Synthesis Using Hypergradients

Title: Importance-based Token Merging for Diffusion Models

Title: Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks

Title: $\textit{Revelio}$: Interpreting and leveraging semantic information in diffusion models

Title: EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion

Title: "Moralized" Multi-Step Jailbreak Prompts: Black-Box Testing of Guardrails in Large Language Models for Verbal Attacks

Title: Multi-Reranker: Maximizing performance of retrieval-augmented generation in the FinanceRAG challenge

Title: Towards Satellite Image Road Graph Extraction: A Global-Scale Dataset and A Novel Method

Title: ChemSafetyBench: Benchmarking LLM Safety on Chemistry Domain

Title: Federated Learning in Chemical Engineering: A Tutorial on a Framework for Privacy-Preserving Collaboration Across Distributed Data Sources

Title: Classifier-Free Guidance inside the Attraction Basin May Cause Memorization

Title: LoBAM: LoRA-Based Backdoor Attack on Model Merging

Title: FollowGen: A Scaled Noise Conditional Diffusion Model for Car-Following Trajectory Prediction

Title: LetsTalk: Latent Diffusion Transformer for Talking Video Synthesis

Title: AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks

Title: PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation

Title: Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy

Title: Visual Counter Turing Test (VCT^2): Discovering the Challenges for AI-Generated Image Detection and Introducing Visual AI Index (V_AI)

Title: LibraGrad: Balancing Gradient Flow for Universally Better Vision Transformer Attributions

Title: Is 'Right' Right? Enhancing Object Orientation Understanding in Multimodal Language Models through Egocentric Instruction Tuning

Title: Hide in Plain Sight: Clean-Label Backdoor for Auditing Membership Inference

Title: SHuBERT: Self-Supervised Sign Language Representation Learning via Multi-Stream Cluster Prediction

Title: Revisiting DDIM Inversion for Controlling Defect Generation by Disentangling the Background

Title: In-Context Experience Replay Facilitates Safety Red-Teaming of Text-to-Image Diffusion Models

Title: VidHal: Benchmarking Temporal Hallucinations in Vision LLMs

Title: MICAS: Multi-grained In-Context Adaptive Sampling for 3D Point Cloud Processing

Title: SynDiff-AD: Improving Semantic Segmentation and End-to-End Autonomous Driving with Synthetic Data from Latent Diffusion Models

Title: GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis

Title: NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model

Title: UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing

Title: Scaling Laws for Black box Adversarial Attacks

Title: CoCoNO: Attention Contrast-and-Complete for Initial Noise Optimization in Text-to-Image Synthesis

Title: TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction

Title: Leveraging the Power of MLLMs for Gloss-Free Sign Language Translation

Title: Learning Predictive Checklists with Probabilistic Logic Programming

Title: What can LLM tell us about cities?

Title: From Diffusion to Resolution: Leveraging 2D Diffusion Models for 3D Super-Resolution Task

Title: Phase-Informed Tool Segmentation for Manual Small-Incision Cataract Surgery

Title: Towards Efficient Model-Heterogeneity Federated Learning for Large Models

Title: Enhancing Answer Reliability Through Inter-Model Consensus of Large Language Models

Title: Phys4DGen: A Physics-Driven Framework for Controllable and Efficient 4D Content Generation from a Single Image

Title: Controllable Human Image Generation with Personalized Multi-Garments

Title: Abnormality-Driven Representation Learning for Radiology Imaging

Title: Blockchain Meets LLMs: A Living Survey on Bidirectional Integration

Title: Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observation

Title: Fine-Tuning LLMs with Noisy Data for Political Argument Generation

Title: XAI and Android Malware Models

Title: Enhancing In-Hospital Mortality Prediction Using Multi-Representational Learning with LLM-Generated Expert Summaries

Title: Pathways on the Image Manifold: Image Editing via Video Generation

Title: DetailGen3D: Generative 3D Geometry Enhancement via Data-Dependent Flow

Title: Beyond Sight: Towards Cognitive Alignment in LVLM via Enriched Visual Knowledge

Title: Decision Making under the Exponential Family: Distributionally Robust Optimisation with Bayesian Ambiguity Sets

Title: Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing

Title: Open Vocabulary Monocular 3D Object Detection

Title: SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE

Title: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering

Title: PreF3R: Pose-Free Feed-Forward 3D Gaussian Splatting from Variable-length Image Sequence

Title: Explainable AI Approach using Near Misses Analysis

Title: Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding

Title: A SAM-guided and Match-based Semi-Supervised Segmentation Framework for Medical Imaging

Title: Probing the limitations of multimodal language models for chemistry and materials research

Title: MotionWavelet: Human Motion Prediction via Wavelet Manifold Learning

Title: ZoomLDM: Latent Diffusion Model for multi-scale image generation

Title: SEMU-Net: A Segmentation-based Corrector for Fabrication Process Variations of Nanophotonics with Microscopic Images

Title: ExpTest: Automating Learning Rate Searching and Tuning with Insights from Linearized Neural Networks

Title: EvoChain: a Recovery Approach for Permissioned Blockchain Applications

Title: Teaching Smaller Language Models To Generalise To Unseen Compositional Questions (Full Thesis)

Title: Decentralized Storage And Self-Sovereign Identity For Document-Based Claims

Title: CMAViT: Integrating Climate, Managment, and Remote Sensing Data for Crop Yield Estimation with Multimodel Vision Transformers

Title: Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models

Title: Tree Transformers are an Ineffective Model of Syntactic Constituency

Title: Curvature Informed Furthest Point Sampling

Title: Can a Single Tree Outperform an Entire Forest?

Title: HOPE: Homomorphic Order-Preserving Encryption for Outsourced Databases -- A Stateless Approach

Title: TED-VITON: Transformer-Empowered Diffusion Models for Virtual Try-On

Title: RED: Robust Environmental Design

Title: Multimodal Alignment and Fusion: A Survey

Title: Free$^2$Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models

Title: Large-Scale Data-Free Knowledge Distillation for ImageNet via Multi-Resolution Data Generation

Title: PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation

Title: ThreatModeling-LLM: Automating Threat Modeling using Large Language Models for Banking System

Title: A generalised novel loss function for computational fluid dynamics

Title: SCASeg: Strip Cross-Attention for Efficient Semantic Segmentation

Title: Graph Structure Learning with Bi-level Optimization

Title: Relations, Negations, and Numbers: Looking for Logic in Generative Text-to-Image Models

Title: Don't Command, Cultivate: An Exploratory Study of System-2 Alignment

Title: Contrastive CFG: Improving CFG in Diffusion Models by Contrasting Positive and Negative Concepts

Title: {\Omega}SFormer: Dual-Modal {\Omega}-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction

Title: Efficient LLM Inference with I/O-Aware Partial KV Cache Recomputation

Title: LESS: Efficient Log Storage System Based on Learned Model and Minimum Attribute Tree

Title: PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-Resolution

Title: Learning from Noisy Labels via Conditional Distributionally Robust Optimization

Title: Star Attention: Efficient LLM Inference over Long Sequences

Title: Advancing Content Moderation: Evaluating Large Language Models for Detecting Sensitive Content Across Text, Images, and Videos

Title: DOGE: Towards Versatile Visual Document Grounding and Referring

Title: From Machine Learning to Machine Unlearning: Complying with GDPR's Right to be Forgotten while Maintaining Business Value of Predictive Models

Title: Improving Resistance to Noisy Label Fitting by Reweighting Gradient in SAM

Title: Crack Detection in Infrastructure Using Transfer Learning, Spatial Attention, and Genetic Algorithm Optimization

Title: Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation

Title: Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation

Title: Enhancing Lane Segment Perception and Topology Reasoning with Crowdsourcing Trajectory Priors

Title: OSDFace: One-Step Diffusion Model for Face Restoration

Title: MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic Segmentation Network For Relic Landslide Detection

Title: Learning Monotonic Attention in Transducer for Streaming Generation

Title: ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting

Title: LiteVAR: Compressing Visual Autoregressive Modelling with Efficient Attention and Quantization

Title: An In-depth Investigation of Sparse Rate Reduction in Transformer-like Models

Title: E-Trojans: Ransomware, Tracking, DoS, and Data Leaks on Battery-powered Embedded Systems

Title: PhysMotion: Physics-Grounded Dynamics From a Single Image

Title: Strategic Prompting for Conversational Tasks: A Comparative Analysis of Large Language Models Across Diverse Conversational Tasks

Title: LampMark: Proactive Deepfake Detection via Training-Free Landmark Perceptual Watermarks

Title: Scaling nnU-Net for CBCT Segmentation

Title: MAT: Multi-Range Attention Transformer for Efficient Image Super-Resolution

Title: Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning

Title: GraphSubDetector: Time Series Subsequence Anomaly Detection via Density-Aware Adaptive Graph Neural Network

Title: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting

Title: MWFormer: Multi-Weather Image Restoration Using Degradation-Aware Transformers

Title: MLI-NeRF: Multi-Light Intrinsic-Aware Neural Radiance Fields

Title: From Graph Diffusion to Graph Classification

Title: Grounding-IQA: Multimodal Language Grounding Model for Image Quality Assessment

Title: Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration

Title: DiffSLT: Enhancing Diversity in Sign Language Translation via Diffusion Model

Title: DGNN-YOLO: Dynamic Graph Neural Networks with YOLO11 for Small Object Detection and Tracking in Traffic Surveillance

Title: APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World Agents

Title: Disentangled Interpretable Representation for Efficient Long-term Time Series Forecasting

Title: HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator

Title: A Topic-level Self-Correctional Approach to Mitigate Hallucinations in MLLMs

Title: BadScan: An Architectural Backdoor Attack on Visual State Space Models

Title: Using Large Language Models for Expert Prior Elicitation in Predictive Modelling

Title: Privacy Preserving Federated Unsupervised Domain Adaptation with Application to Age Prediction from DNA Methylation Data

Title: Task Progressive Curriculum Learning for Robust Visual Question Answering

Title: GrokFormer: Graph Fourier Kolmogorov-Arnold Transformers

Title: ER2Score: LLM-based Explainable and Customizable Metric for Assessing Radiology Reports with Reward-Control Loss

Title: Meaningless is better: hashing bias-inducing words in LLM prompts improves performance in logical reasoning and statistical learning

Title: in-Car Biometrics (iCarB) Datasets for Driver Recognition: Face, Fingerprint, and Voice

Title: Reward Incremental Learning in Text-to-Image Generation

Title: A Framework for the Security and Privacy of Biometric System Constructions under Defined Computational Assumptions

Title: InsightEdit: Towards Better Instruction Following for Image Editing

Title: MotionLLaMA: A Unified Framework for Motion Synthesis and Comprehension

Title: Different Bias Under Different Criteria: Assessing Bias in LLMs with a Fact-Based Approach

Title: Assessing Vulnerability in Smart Contracts: The Role of Code Complexity Metrics in Security Analysis

Title: Real-Time Multimodal Signal Processing for HRI in RoboCup: Understanding a Human Referee

Title: Joint Combinatorial Node Selection and Resource Allocations in the Lightning Network using Attention-based Reinforcement Learning

Title: DWCL: Dual-Weighted Contrastive Learning for Multi-View Clustering

Title: SAM-MPA: Applying SAM to Few-shot Medical Image Segmentation using Mask Propagation and Auto-prompting

Title: Fairness And Performance In Harmony: Data Debiasing Is All You Need

Title: The Extractive-Abstractive Spectrum: Uncovering Verifiability Trade-offs in LLM Generations

Title: RealTraj: Towards Real-World Pedestrian Trajectory Forecasting

Title: MFF-FTNet: Multi-scale Feature Fusion across Frequency and Temporal Domains for Time Series Forecasting

Title: AnchorCrafter: Animate CyberAnchors Saling Your Products via Human-Object Interacting Video Generation

Title: Robust Bayesian Optimization via Localized Online Conformal Prediction

Title: Can LLMs be Good Graph Judger for Knowledge Graph Construction?

Title: NumGrad-Pull: Numerical Gradient Guided Tri-plane Representation for Surface Reconstruction from Point Clouds

Title: One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models

Title: CoA: Chain-of-Action for Generative Semantic Labels

Title: Multimodal Outer Arithmetic Block Dual Fusion of Whole Slide Images and Omics Data for Precision Oncology

Title: DRiVE: Diffusion-based Rigging Empowers Generation of Versatile and Expressive Characters

Title: Self-supervised Video Instance Segmentation Can Boost Geographic Entity Alignment in Historical Maps

Title: Rewiring Techniques to Mitigate Oversquashing and Oversmoothing in GNNs: A Survey

Title: Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Title: Maximally Separated Active Learning

Title: Support Vector Machine for Person Classification Using the EEG Signals

Title: A Graph Neural Network deep-dive into successful counterattacks

Title: VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models

Title: PEFTGuard: Detecting Backdoor Attacks Against Parameter-Efficient Fine-Tuning

Title: Spatially Visual Perception for End-to-End Robotic Learning

Title: WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model

Title: SoK: Decentralized AI (DeAI)

Title: Learning 3D Representations from Procedural 3D Programs

Title: Adversarial Bounding Boxes Generation (ABBG) Attack against Visual Object Trackers

Title: Towards Precise Scaling Laws for Video Diffusion Transformers

Title: Learning New Concepts, Remembering the Old: A Novel Continual Learning

Title: Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory

Title: TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba

Title: COBRA: A Continual Learning Approach to Vision-Brain Understanding

Title: Time-Series Forecasting in Smart Manufacturing Systems: An Experimental Evaluation of the State-of-the-art Algorithms

Title: SuperMat: Physically Consistent PBR Material Estimation at Interactive Rates

Title: Pushing the Limits of Large Language Model Quantization via the Linearity Theorem

Title: HSI-Drive v2.0: More Data for New Challenges in Scene Understanding for Autonomous Driving

Title: FTMoMamba: Motion Generation with Frequency and Text State Space Models

Title: IMPROVE: Improving Medical Plausibility without Reliance on HumanValidation - An Enhanced Prototype-Guided Diffusion Framework

Title: Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning

Title: AI-Augmented Ethical Hacking: A Practical Examination of Manual Exploitation and Privilege Escalation in Linux Environments

Title: Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving

Title: A Bilayer Segmentation-Recombination Network for Accurate Segmentation of Overlapping C. elegans

Title: Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey

Title: RTL-Breaker: Assessing the Security of LLMs against Backdoor Attacks on HDL Code Generation

Title: Learning Explainable Treatment Policies with Clinician-Informed Representations: A Practical Approach

Title: A Distractor-Aware Memory for Visual Object Tracking with SAM2

Title: From Fairness to Infinity: Outcome-Indistinguishable (Omni)Prediction in Evolving Graphs

Title: Pre-training for Action Recognition with Automatically Generated Fractal Datasets

Title: Multi-Objective Reinforcement Learning for Automated Resilient Cyber Defence

Title: VideoDirector: Precise Video Editing via Text-to-Video Models

Title: What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics

Title: Can artificial intelligence predict clinical trial outcomes?

Title: HyperSeg: Towards Universal Visual Segmentation with Large Language Model

Title: Scaling Speech-Text Pre-training with Synthetic Interleaved Data

Title: Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation

Title: Accelerating Vision Diffusion Transformers with Skip Branches

Title: Machine Learning and Multi-source Remote Sensing in Forest Carbon Stock Estimation: A Review

Title: Data-driven development of cycle prediction models for lithium metal batteries using multi modal mining

Title: Learning Chemical Reaction Representation with Reactant-Product Alignment

Title: On Limitations of LLM as Annotator for Low Resource Languages

Title: A robust image encryption scheme based on new 4-D hyperchaotic system and elliptic curve

Title: Explainable AI for Classifying UTI Risk Groups Using a Real-World Linked EHR and Pathology Lab Dataset

Title: SAMWISE: Infusing wisdom in SAM2 for Text-Driven Video Segmentation

Title: DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting

Title: Linguistic Laws Meet Protein Sequences: A Comparative Analysis of Subword Tokenization Methods

Title: Synthetic Data Generation with LLM for Improved Depression Prediction

Title: SketchAgent: Language-Driven Sequential Sketch Generation

Title: Push the Limit of Multi-modal Emotion Recognition by Prompting LLMs with Receptive-Field-Aware Attention Weighting

Title: Enhancing Character-Level Understanding in LLMs through Token Internal Structure Learning

Title: RealSeal: Revolutionizing Media Authentication with Real-Time Realism Scoring

Title: Attamba: Attending To Multi-Token States

Title: Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration

Title: GenDeg: Diffusion-Based Degradation Synthesis for Generalizable All-in-One Image Restoration

Title: Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Title: Adaptive Deployment of Untrusted LLMs Reduces Distributed Threats

Title: ScribbleLight: Single Image Indoor Relighting with Scribbles

Title: StableAnimator: High-Quality Identity-Preserving Human Image Animation