Search Results for author: Xin Wang

Found 475 papers, 164 papers with code

OIE@OIA: an Adaptable and Efficient Open Information Extraction Framework

no code implementations ACL 2022 Xin Wang, Minlong Peng, Mingming Sun, Ping Li

OIE@OIA follows the methodology of Open Information eXpression (OIX): parsing a sentence to an Open Information Annotation (OIA) Graph and then adapting the OIA graph to different OIE tasks with simple rules.

Open Information Extraction Sentence

A Predicate-Function-Argument Annotation of Natural Language for Open-Domain Information eXpression

no code implementations EMNLP 2020 Mingming Sun, Wenyue Hua, Zoey Liu, Xin Wang, Kangjie Zheng, Ping Li

Based on the same platform of OIX, the OIE strategies are reusable, and people can select a set of strategies to assemble their algorithm for a specific task so that the adaptability may be significantly increased.

Open Information Extraction Sentence

Dependency Position Encoding for Relation Extraction

no code implementations Findings (NAACL) 2022 Qiushi Guo, Xin Wang, Dehong Gao

Leveraging the dependency tree of the input sentence is able to improve the model performance for relation extraction.

Position Relation +2

Efficient Sharpness-aware Minimization for Molecular Graph Transformer Models

1 code implementation ICLR 2024 Yili Wang, Kaixiong Zhou, Ninghao Liu, Ying Wang, Xin Wang

Sharpness-aware minimization (SAM) has received increasing attention in computer vision since it can effectively eliminate the sharp local minima from the training trajectory and mitigate generalization degradation.

Molecular Property Prediction

Tele-FLM Technical Report

no code implementations25 Apr 2024 Xiang Li, Yiqun Yao, Xin Jiang, Xuezhi Fang, Chao Wang, Xinzhang Liu, Zihan Wang, Yu Zhao, Xin Wang, Yuyao Huang, Shuangyong Song, Yongxiang Li, Zheng Zhang, Bo Zhao, Aixin Sun, Yequan Wang, Zhongjiang He, Zhongyuan Wang, Xuelong Li, Tiejun Huang

Large language models (LLMs) have showcased profound capabilities in language understanding and generation, facilitating a wide array of applications.

Optimizing OOD Detection in Molecular Graphs: A Novel Approach with Diffusion Models

no code implementations24 Apr 2024 Xu Shen, Yili Wang, Kaixiong Zhou, Shirui Pan, Xin Wang

In this work, we propose to detect OOD molecules by adopting an auxiliary diffusion model-based framework, which compares similarities between input molecules and reconstructed graphs.

Denoising Graph Reconstruction +1

ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

1 code implementation23 Apr 2024 Xuanhua He, Quande Liu, Shengju Qian, Xin Wang, Tao Hu, Ke Cao, Keyu Yan, Man Zhou, Jie Zhang

Based on this pipeline, a random face reference training method is further devised to precisely capture the ID-relevant embeddings from reference images, thus improving the fidelity and generalization capacity of our model for ID-specific video generation.

Attribute Video Generation

FLARE: A New Federated Learning Framework with Adjustable Learning Rates over Resource-Constrained Wireless Networks

no code implementations23 Apr 2024 Bingnan Xiao, Jingjing Zhang, Wei Ni, Xin Wang

Wireless federated learning (WFL) suffers from heterogeneity prevailing in the data distributions, computing powers, and channel conditions of participating devices.

Federated Learning Scheduling

LLM-Enhanced Causal Discovery in Temporal Domain from Interventional Data

no code implementations23 Apr 2024 Peiwen Li, Xin Wang, Zeyang Zhang, Yuan Meng, Fang Shen, Yue Li, Jialong Wang, Yang Li, Wenweu Zhu

In the field of Artificial Intelligence for Information Technology Operations, causal discovery is pivotal for operation and maintenance of graph construction, facilitating downstream industrial tasks such as root cause analysis.

Causal Discovery graph construction

Mechanisms promoting biodiversity in ecosystems

no code implementations23 Apr 2024 Ju Kang, Yiyuan Niu, Xin Wang

Explaining biodiversity is a central focus in theoretical ecology.

FedGreen: Carbon-aware Federated Learning with Model Size Adaptation

no code implementations23 Apr 2024 Ali Abbasi, Fan Dong, Xin Wang, Henry Leung, Jiayu Zhou, Steve Drew

Federated learning (FL) provides a promising collaborative framework to build a model from distributed clients, and this work investigates the carbon emission of the FL process.

Federated Learning Model Compression

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

no code implementations22 Apr 2024 Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Parul Chopra, Allie Del Giorno, Gustavo de Rosa, Matthew Dixon, Ronen Eldan, Dan Iter, Amit Garg, Abhishek Goswami, Suriya Gunasekar, Emman Haider, Junheng Hao, Russell J. Hewett, Jamie Huynh, Mojan Javaheripi, Xin Jin, Piero Kauffmann, Nikos Karampatziakis, Dongwoo Kim, Mahoud Khademi, Lev Kurilenko, James R. Lee, Yin Tat Lee, Yuanzhi Li, Chen Liang, Weishung Liu, Eric Lin, Zeqi Lin, Piyush Madan, Arindam Mitra, Hardik Modi, Anh Nguyen, Brandon Norick, Barun Patra, Daniel Perez-Becker, Thomas Portet, Reid Pryzant, Heyang Qin, Marko Radmilac, Corby Rosset, Sambudha Roy, Olatunji Ruwase, Olli Saarikivi, Amin Saied, Adil Salim, Michael Santacroce, Shital Shah, Ning Shang, Hiteshi Sharma, Xia Song, Masahiro Tanaka, Xin Wang, Rachel Ward, Guanhua Wang, Philipp Witte, Michael Wyatt, Can Xu, Jiahang Xu, Sonali Yadav, Fan Yang, ZiYi Yang, Donghan Yu, Chengruidong Zhang, Cyril Zhang, Jianwen Zhang, Li Lyna Zhang, Yi Zhang, Yue Zhang, Yunan Zhang, Xiren Zhou

We introduce phi-3-mini, a 3. 8 billion parameter language model trained on 3. 3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3. 5 (e. g., phi-3-mini achieves 69% on MMLU and 8. 38 on MT-bench), despite being small enough to be deployed on a phone.

Language Modelling

Texture-aware and Shape-guided Transformer for Sequential DeepFake Detection

no code implementations22 Apr 2024 Yunfei Li, Jiaran Zhou, Xin Wang, Junyu Dong, Yuezun Li

In this paper, we propose a novel Texture-aware and Shape-guided Transformer to enhance detection performance.

DeepFake Detection Face Swapping

Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images

1 code implementation19 Apr 2024 Santosh, Li Lin, Irene Amerini, Xin Wang, Shu Hu

Diffusion models (DMs) have revolutionized image generation, producing high-quality images with applications spanning various fields.

Image Generation

The VoicePrivacy 2024 Challenge Evaluation Plan

1 code implementation3 Apr 2024 Natalia Tomashenko, Xiaoxiao Miao, Pierre Champion, Sarina Meyer, Xin Wang, Emmanuel Vincent, Michele Panariello, Nicholas Evans, Junichi Yamagishi, Massimiliano Todisco

The task of the challenge is to develop a voice anonymization system for speech data which conceals the speaker's voice identity while protecting linguistic content and emotional states.

Qibo: A Large Language Model for Traditional Chinese Medicine

no code implementations24 Mar 2024 Heyi Zhang, Xin Wang, Zhaopeng Meng, Yongzhe Jia, Dawei Xu

Furthermore, we develop the Qibo-benchmark, a specialized tool for evaluating the performance of LLMs, which is a specialized tool for evaluating the performance of LLMs in the TCM domain.

Language Modelling Large Language Model

Exploring the Potential of Large Language Models in Graph Generation

no code implementations21 Mar 2024 Yang Yao, Xin Wang, Zeyang Zhang, Yijian Qin, Ziwei Zhang, Xu Chu, Yuekui Yang, Wenwu Zhu, Hong Mei

In this paper, we propose LLM4GraphGen to explore the ability of LLMs for graph generation with systematical task designs and extensive experiments.

Drug Discovery Graph Generation +1

When Do We Not Need Larger Vision Models?

1 code implementation19 Mar 2024 Baifeng Shi, Ziyang Wu, Maolin Mao, Xin Wang, Trevor Darrell

Our results show that a multi-scale smaller model has comparable learning capacity to a larger model, and pre-training smaller models with S$^2$ can match or even exceed the advantage of larger models.

Depth Estimation

MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations

1 code implementation16 Mar 2024 Hanlei Zhang, Xin Wang, Hua Xu, Qianrui Zhou, Kai Gao, Jianhua Su, jinyue Zhao, Wenrui Li, Yanting Chen

We believe that MIntRec2. 0 will serve as a valuable resource, providing a pioneering foundation for research in human-machine conversational interactions, and significantly facilitating related applications.

Multimodal Intent Recognition

Robust Light-Weight Facial Affective Behavior Recognition with CLIP

1 code implementation14 Mar 2024 Li Lin, Sarah Papabathini, Xin Wang, Shu Hu

Human affective behavior analysis aims to delve into human expressions and behaviors to deepen our understanding of human emotions.

Robust COVID-19 Detection in CT Images with CLIP

1 code implementation13 Mar 2024 Li Lin, Yamini Sri Krubha, Zhenhuan Yang, Cheng Ren, Thuc Duy Le, Irene Amerini, Xin Wang, Shu Hu

In the realm of medical imaging, particularly for COVID-19 detection, deep learning models face substantial challenges such as the necessity for extensive computational resources, the paucity of well-annotated datasets, and a significant amount of unlabeled data.

SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression

1 code implementation12 Mar 2024 Xin Wang, Yu Zheng, Zhongwei Wan, Mi Zhang

The advancements in Large Language Models (LLMs) have been hindered by their substantial sizes, which necessitate LLM compression methods for practical deployment.

Language Modelling Large Language Model +1

UAV-Enabled Asynchronous Federated Learning

no code implementations11 Mar 2024 Zhiyuan Zhai, Xiaojun Yuan, Xin Wang, Huiyuan Yang

To exploit unprecedented data generation in mobile edge networks, federated learning (FL) has emerged as a promising alternative to the conventional centralized machine learning (ML).

Federated Learning

Optimizing Latent Graph Representations of Surgical Scenes for Zero-Shot Domain Transfer

no code implementations11 Mar 2024 Siddhant Satyanaik, Aditya Murali, Deepak Alapatt, Xin Wang, Pietro Mascagni, Nicolas Padoy

Purpose: Advances in deep learning have resulted in effective models for surgical video analysis; however, these models often fail to generalize across medical centers due to domain shift caused by variations in surgical workflow, camera setups, and patient demographics.

Anatomy Disentanglement +3

Spectral Invariant Learning for Dynamic Graphs under Distribution Shifts

1 code implementation NeurIPS 2023 Zeyang Zhang, Xin Wang, Ziwei Zhang, Zhou Qin, Weigao Wen, Hui Xue, Haoyang Li, Wenwu Zhu

In this paper, we discover that there exist cases with distribution shifts unobservable in the time domain while observable in the spectral domain, and propose to study distribution shifts on dynamic graphs in the spectral domain for the first time.

Link Prediction Node Classification

Unsupervised Graph Neural Architecture Search with Disentangled Self-supervision

no code implementations NeurIPS 2023 Zeyang Zhang, Xin Wang, Ziwei Zhang, Guangyao Shen, Shiqi Shen, Wenwu Zhu

To address the challenge, we propose a novel Disentangled Self-supervised Graph Neural Architecture Search (DSGAS) model, which is able to discover the optimal architectures capturing various latent graph factors in a self-supervised fashion based on unlabeled graph data.

Disentanglement Neural Architecture Search

Electrocardiogram Instruction Tuning for Report Generation

no code implementations7 Mar 2024 Zhongwei Wan, Che Liu, Xin Wang, Chaofan Tao, Hui Shen, Zhenwu Peng, Jie Fu, Rossella Arcucci, Huaxiu Yao, Mi Zhang

Electrocardiogram (ECG) serves as the primary non-invasive diagnostic tool for cardiac conditions monitoring, are crucial in assisting clinicians.

Parameterized quantum comb and simpler circuits for reversing unknown qubit-unitary operations

no code implementations6 Mar 2024 Yin Mo, Lei Zhang, Yu-Ao Chen, Yingjian Liu, Tengxiang Lin, Xin Wang

Quantum comb is an essential tool for characterizing complex quantum protocols in quantum information processing.

Quantum Machine Learning

Neural Radiance Fields in Medical Imaging: Challenges and Next Steps

no code implementations26 Feb 2024 Xin Wang, Shu Hu, Heng Fan, Hongtu Zhu, Xin Li

Neural Radiance Fields (NeRF), as a pioneering technique in computer vision, offer great potential to revolutionize medical imaging by synthesizing three-dimensional representations from the projected two-dimensional image data.

Two-stage Cytopathological Image Synthesis for Augmenting Cervical Abnormality Screening

no code implementations22 Feb 2024 Zhenrong Shen, Manman Fei, Xin Wang, Jiangdong Cai, Sheng Wang, Lichi Zhang, Qian Wang

In the first Global Image Generation stage, a Normal Image Generator is designed to generate cytopathological images full of normal cervical cells.

Cell Detection Data Augmentation +1

HyCubE: Efficient Knowledge Hypergraph 3D Circular Convolutional Embedding

no code implementations14 Feb 2024 Zhao Li, Xin Wang, JianXin Li, Wenbin Guo, Jun Zhao

Existing knowledge hypergraph embedding methods mainly focused on improving model performance, but their model structures are becoming more complex and redundant.

hypergraph embedding

Rethinking Propagation for Unsupervised Graph Domain Adaptation

1 code implementation8 Feb 2024 Meihan Liu, Zeyu Fang, Zhen Zhang, Ming Gu, Sheng Zhou, Xin Wang, Jiajun Bu

Motivated by our empirical analysis, we reevaluate the role of GNNs in graph domain adaptation and uncover the pivotal role of the propagation process in GNNs for adapting to different graph domains.

Domain Adaptation

Failure Analysis in Next-Generation Critical Cellular Communication Infrastructures

no code implementations6 Feb 2024 Siguo Bi, Xin Yuan, Shuyan Hu, Kai Li, Wei Ni, Ekram Hossain, Xin Wang

The advent of communication technologies marks a transformative phase in critical infrastructure construction, where the meticulous analysis of failures becomes paramount in achieving the fundamental objectives of continuity, security, and availability.

Revisiting VAE for Unsupervised Time Series Anomaly Detection: A Frequency Perspective

1 code implementation5 Feb 2024 Zexin Wang, Changhua Pei, Minghua Ma, Xin Wang, Zhihan Li, Dan Pei, Saravan Rajmohan, Dongmei Zhang, QIngwei Lin, Haiming Zhang, Jianhui Li, Gaogang Xie

To ensure an accurate AD, FCVAE exploits an innovative approach to concurrently integrate both the global and local frequency features into the condition of Conditional Variational Autoencoder (CVAE) to significantly increase the accuracy of reconstructing the normal data.

Anomaly Detection Time Series +1

Artificial Intelligence in Image-based Cardiovascular Disease Analysis: A Comprehensive Survey and Future Outlook

no code implementations4 Feb 2024 Xin Wang, Hongtu Zhu

Our review encompasses these modalities, giving a broad perspective on the diverse imaging techniques integrated with AI for CVD analysis.

Masked Conditional Diffusion Model for Enhancing Deepfake Detection

no code implementations1 Feb 2024 Tiewen Chen, Shanmin Yang, Shu Hu, Zhenghan Fang, Ying Fu, Xi Wu, Xin Wang

this paper present we put a new insight into diffusion model-based data augmentation, and propose a Masked Conditional Diffusion Model (MCDM) for enhancing deepfake detection.

Data Augmentation DeepFake Detection +1

Uncertainty-Aware Explainable Recommendation with Large Language Models

no code implementations31 Jan 2024 Yicui Peng, Hao Chen, ChingSheng Lin, Guo Huang, Jinrong Hu, Hui Guo, Bin Kong, Shu Hu, Xi Wu, Xin Wang

Providing explanations within the recommendation system would boost user satisfaction and foster trust, especially by elaborating on the reasons for selecting recommended items tailored to the user.

Explainable Recommendation Multi-Task Learning

Active Generation Network of Human Skeleton for Action Recognition

no code implementations30 Jan 2024 Long Liu, Xin Wang, Fangming Li, Jiayu Chen

To solve those problems, We propose a novel active generative network (AGN), which can adaptively learn various action categories by motion style transfer to generate new actions when the data for a particular action is only a single sample or few samples.

Action Generation Action Recognition +4

Detecting Multimedia Generated by Large AI Models: A Survey

1 code implementation22 Jan 2024 Li Lin, Neeraj Gupta, Yue Zhang, Hainan Ren, Chun-Hao Liu, Feng Ding, Xin Wang, Xin Li, Luisa Verdoliva, Shu Hu

The rapid advancement of Large AI Models (LAIMs), particularly diffusion models and large language models, has marked a new era where AI-generated multimedia is increasingly integrated into various aspects of daily life.

Efficient Image Super-Resolution via Symmetric Visual Attention Network

no code implementations17 Jan 2024 Chengxu Wu, Qinrui Fan, Shu Hu, Xi Wu, Xin Wang, Jing Hu

An important development direction in the Single-Image Super-Resolution (SISR) algorithms is to improve the efficiency of the algorithms.

Image Super-Resolution

IoT in the Era of Generative AI: Vision and Challenges

no code implementations3 Jan 2024 Xin Wang, Zhongwei Wan, Arvin Hekmati, Mingyu Zong, Samiul Alam, Mi Zhang, Bhaskar Krishnamachari

Equipped with sensing, networking, and computing capabilities, Internet of Things (IoT) such as smartphones, wearables, smart speakers, and household robots have been seamlessly weaved into our daily lives.

Federated Learning Prompt Engineering

Grounding-Prompter: Prompting LLM with Multimodal Information for Temporal Sentence Grounding in Long Videos

no code implementations28 Dec 2023 Houlun Chen, Xin Wang, Hong Chen, Zihan Song, Jia Jia, Wenwu Zhu

To tackle these challenges, in this work we propose a Grounding-Prompter method, which is capable of conducting TSG in long videos through prompting LLM with multimodal information.

Denoising In-Context Learning +3

PokeMQA: Programmable knowledge editing for Multi-hop Question Answering

1 code implementation23 Dec 2023 Hengrui Gu, Kaixiong Zhou, Xiaotian Han, Ninghao Liu, Ruobing Wang, Xin Wang

Multi-hop question answering (MQA) is one of the challenging tasks to evaluate machine's comprehension and reasoning abilities, where large language models (LLMs) have widely achieved the human-comparable performance.

Answer Generation knowledge editing +3

LLM4VG: Large Language Models Evaluation for Video Grounding

no code implementations21 Dec 2023 Wei Feng, Xin Wang, Hong Chen, Zeyang Zhang, Zihan Song, Yuwei Zhou, Wenwu Zhu

Recently, researchers have attempted to investigate the capability of LLMs in handling videos and proposed several video LLM models.

Image Captioning Video Grounding +1

In2SET: Intra-Inter Similarity Exploiting Transformer for Dual-Camera Compressive Hyperspectral Imaging

no code implementations20 Dec 2023 Xin Wang, Lizhi Wang, Xiangtian Ma, Maoqing Zhang, Lin Zhu, Hua Huang

Dual-Camera Compressed Hyperspectral Imaging (DCCHI) offers the capability to reconstruct 3D Hyperspectral Image (HSI) by fusing compressive and Panchromatic (PAN) image, which has shown great potential for snapshot hyperspectral imaging in practice.

ConvD: Attention Enhanced Dynamic Convolutional Embeddings for Knowledge Graph Completion

no code implementations11 Dec 2023 Wenbin Guo, Zhao Li, Xin Wang, Zirui Chen

In this paper, we propose a novel dynamic convolutional embedding model ConvD for knowledge graph completion, which directly reshapes the relation embeddings into multiple internal convolution kernels to improve the external convolution kernels of the traditional convolutional embedding model.

Entity Embeddings Relation

Detection and Mitigation of Position Spoofing Attacks on Cooperative UAV Swarm Formations

no code implementations6 Dec 2023 Siguo Bi, Kai Li, Shuyan Hu, Wei Ni, Cong Wang, Xin Wang

Detecting spoofing attacks on the positions of unmanned aerial vehicles (UAVs) within a swarm is challenging.

Position

Efficient Large Language Models: A Survey

3 code implementations6 Dec 2023 Zhongwei Wan, Xin Wang, Che Liu, Samiul Alam, Yu Zheng, Jiachen Liu, Zhongnan Qu, Shen Yan, Yi Zhu, Quanlu Zhang, Mosharaf Chowdhury, Mi Zhang

Large Language Models (LLMs) have demonstrated remarkable capabilities in important tasks such as natural language understanding, language generation, and complex reasoning and have the potential to make a substantial impact on our society.

Natural Language Understanding Text Generation

Virtual Quantum Markov Chains

no code implementations4 Dec 2023 Yu-Ao Chen, Chengkai Zhu, Keming He, Mingrui Jing, Xin Wang

In this work, we propose the concept of virtual quantum Markov chains (VQMCs), focusing on scenarios where subsystems retain classical information about global systems from measurement statistics.

VTimeLLM: Empower LLM to Grasp Video Moments

1 code implementation30 Nov 2023 Bin Huang, Xin Wang, Hong Chen, Zihan Song, Wenwu Zhu

Large language models (LLMs) have shown remarkable text understanding capabilities, which have been extended as Video LLMs to handle video data for comprehending visual details.

Dense Video Captioning Video-based Generative Performance Benchmarking (Consistency) +5

Out-of-Distribution Generalized Dynamic Graph Neural Network for Human Albumin Prediction

no code implementations27 Nov 2023 Zeyang Zhang, Xingwang Li, Fei Teng, Ning Lin, Xueling Zhu, Xin Wang, Wenwu Zhu

We first model human albumin prediction as a dynamic graph regression problem to model the dynamics and patient relationship.

Graph Attention Graph Regression +1

OFDMA-F$^2$L: Federated Learning With Flexible Aggregation Over an OFDMA Air Interface

no code implementations25 Nov 2023 Shuyan Hu, Xin Yuan, Wei Ni, Xin Wang, Ekram Hossain, H. Vincent Poor

Federated learning (FL) can suffer from a communication bottleneck when deployed in mobile networks, limiting participating clients and deterring FL convergence.

Federated Learning

Out-of-Distribution Generalized Dynamic Graph Neural Network with Disentangled Intervention and Invariance Promotion

no code implementations24 Nov 2023 Zeyang Zhang, Xin Wang, Ziwei Zhang, Haoyang Li, Wenwu Zhu

In this paper, we propose Disentangled Intervention-based Dynamic graph Attention networks with Invariance Promotion (I-DIDA) to handle spatio-temporal distribution shifts in dynamic graphs by discovering and utilizing invariant patterns, i. e., structures and features whose predictive abilities are stable across distribution shifts.

Graph Attention

Self-organized biodiversity in biotic resource systems

no code implementations23 Nov 2023 Ju Kang, Shijie Zhang, Yiyuan Niu, Xin Wang

What determines biodiversity in nature is a prominent issue in ecology, especially in biotic resource systems that are typically devoid of cross-feeding.

Adversarial Prompt Tuning for Vision-Language Models

1 code implementation19 Nov 2023 Jiaming Zhang, Xingjun Ma, Xin Wang, Lingyu Qiu, Jiaqi Wang, Yu-Gang Jiang, Jitao Sang

With the rapid advancement of multimodal learning, pre-trained Vision-Language Models (VLMs) such as CLIP have demonstrated remarkable capacities in bridging the gap between visual and language modalities.

Adversarial Robustness

MeLo: Low-rank Adaptation is Better than Fine-tuning for Medical Image Diagnosis

1 code implementation14 Nov 2023 Yitao Zhu, Zhenrong Shen, Zihao Zhao, Sheng Wang, Xin Wang, Xiangyu Zhao, Dinggang Shen, Qian Wang

By fixing the weight of ViT models and only adding small low-rank plug-ins, we achieve competitive results on various diagnosis tasks across different imaging modalities using only a few trainable parameters.

Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models

no code implementations10 Nov 2023 Siao Tang, Xin Wang, Hong Chen, Chaoyu Guan, Zewen Wu, Yansong Tang, Wenwu Zhu

In this paper, we propose a novel post-training quantization method PCR (Progressive Calibration and Relaxing) for text-to-image diffusion models, which consists of a progressive calibration strategy that considers the accumulated quantization error across timesteps, and an activation relaxing strategy that improves the performance with negligible cost.

Quantization

UMedNeRF: Uncertainty-aware Single View Volumetric Rendering for Medical Neural Radiance Fields

no code implementations10 Nov 2023 Jing Hu, Qinrui Fan, Shu Hu, Siwei Lyu, Xi Wu, Xin Wang

In the field of clinical medicine, computed tomography (CT) is an effective medical imaging modality for the diagnosis of various pathologies.

Computed Tomography (CT)

3D Pose Estimation of Tomato Peduncle Nodes using Deep Keypoint Detection and Point Cloud

no code implementations8 Nov 2023 Jianchao Ci, Xin Wang, David Rapado-Rincón, Akshay K. Burusa, Gert Kootstra

A 21 comprehensive evaluation was conducted in a commercial greenhouse to gain insight into the 22 performance of different parts of the method.

3D Pose Estimation Keypoint Detection

Lightweight Diffusion Models with Distillation-Based Block Neural Architecture Search

no code implementations8 Nov 2023 Siao Tang, Xin Wang, Hong Chen, Chaoyu Guan, Yansong Tang, Wenwu Zhu

When retraining the searched architecture, we adopt a dynamic joint loss to maintain the consistency between supernet training and subnet retraining, which also provides informative objectives for each block and shortens the paths of gradient propagation.

Neural Architecture Search

Dissecting the Runtime Performance of the Training, Fine-tuning, and Inference of Large Language Models

no code implementations7 Nov 2023 Longteng Zhang, Xiang Liu, Zeyu Li, Xinglin Pan, Peijie Dong, Ruibo Fan, Rui Guo, Xin Wang, Qiong Luo, Shaohuai Shi, Xiaowen Chu

For end users, our benchmark and findings help better understand different optimization techniques, training and inference frameworks, together with hardware platforms in choosing configurations for deploying LLMs.

Quantization

VideoDreamer: Customized Multi-Subject Text-to-Video Generation with Disen-Mix Finetuning

no code implementations2 Nov 2023 Hong Chen, Xin Wang, Guanning Zeng, YiPeng Zhang, Yuwei Zhou, Feilin Han, Wenwu Zhu

The video generator is further customized for the given multiple subjects by the proposed Disen-Mix Finetuning and Human-in-the-Loop Re-finetuning strategy, which can tackle the attribute binding problem of multi-subject generation.

Attribute Text-to-Video Generation +1

A Systematic Review for Transformer-based Long-term Series Forecasting

no code implementations31 Oct 2023 Liyilei Su, Xumin Zuo, Rui Li, Xin Wang, Heng Zhao, Bingding Huang

Various variants have enabled transformer architecture to effectively handle long-term time series forecasting (LTSF) tasks.

Time Series Time Series Forecasting

Towards Generalized Multi-stage Clustering: Multi-view Self-distillation

no code implementations29 Oct 2023 Jiatai Wang, Zhiwei Xu, Xin Wang, Tao Li

MVC aims at exploring common semantics and pseudo-labels from multiple views and clustering in a self-supervised manner.

Clustering Contrastive Learning +1

Hierarchical Mutual Information Analysis: Towards Multi-view Clustering in The Wild

no code implementations28 Oct 2023 Jiatai Wang, Zhiwei Xu, Xuewen Yang, Xin Wang

Multi-view clustering (MVC) can explore common semantics from unsupervised views generated by different sources, and thus has been extensively used in applications of practical computer vision.

Clustering

Disentangled Representation Learning with Large Language Models for Text-Attributed Graphs

no code implementations27 Oct 2023 Yijian Qin, Xin Wang, Ziwei Zhang, Wenwu Zhu

Text-attributed graphs (TAGs) are prevalent on the web and research over TAGs such as citation networks, e-commerce networks and social networks has attracted considerable attention in the web community.

Representation Learning

LLM4DyG: Can Large Language Models Solve Spatial-Temporal Problems on Dynamic Graphs?

no code implementations26 Oct 2023 Zeyang Zhang, Xin Wang, Ziwei Zhang, Haoyang Li, Yijian Qin, Wenwu Zhu

Our main observations are: 1) LLMs have preliminary spatial-temporal understanding abilities on dynamic graphs, 2) Dynamic graph tasks show increasing difficulties for LLMs as the graph size and density increase, while not sensitive to the time span and data generation mechanism, 3) the proposed DST2 prompting method can help to improve LLMs' spatial-temporal understanding abilities on dynamic graphs for most tasks.

Self-triggered Consensus Control of Multi-agent Systems from Data

no code implementations19 Oct 2023 Yifei Li, Xin Wang, Jian Sun, Gang Wang, Jie Chen

In the presence of external disturbances, a model-based STC scheme is put forth for $\mathcal{H}_{\infty}$-consensus of MASs, serving as a baseline for the data-driven STC.

Provable Advantage of Parameterized Quantum Circuit in Function Approximation

no code implementations11 Oct 2023 Zhan Yu, Qiuhao Chen, Yuling Jiao, Yinan Li, Xiliang Lu, Xin Wang, Jerry Zhijian Yang

To achieve this, we utilize techniques from quantum signal processing and linear combinations of unitaries to construct PQCs that implement multivariate polynomials.

Quantum Machine Learning

Decentralized Federated Learning via MIMO Over-the-Air Computation: Consensus Analysis and Performance Optimization

no code implementations8 Oct 2023 Zhiyuan Zhai, Xiaojun Yuan, Xin Wang

We conduct a general convergence analysis to quantitatively capture the influence of aggregation weight and communication error on the MIMO OA-DFL performance in \emph{ad hoc} networks.

Distributed Optimization Federated Learning

Controlling Neural Style Transfer with Deep Reinforcement Learning

no code implementations30 Sep 2023 Chengming Feng, Jing Hu, Xin Wang, Shu Hu, Bin Zhu, Xi Wu, Hongtu Zhu, Siwei Lyu

Controlling the degree of stylization in the Neural Style Transfer (NST) is a little tricky since it usually needs hand-engineering on hyper-parameters.

reinforcement-learning Reinforcement Learning (RL) +1

Collaborative Watermarking for Adversarial Speech Synthesis

no code implementations26 Sep 2023 Lauri Juvela, Xin Wang

Advances in neural speech synthesis have brought us technology that is not only close to human naturalness, but is also capable of instant voice cloning with little data, and is highly accessible with pre-trained models available.

Speaker Verification Speech Synthesis +2

Statistical Analysis of Quantum State Learning Process in Quantum Neural Networks

1 code implementation NeurIPS 2023 Hao-Kai Zhang, Chenghong Zhu, Mingrui Jing, Xin Wang

As a quantum analog of probability distribution learning, quantum state learning is theoretically and practically essential in quantum machine learning.

Quantum Machine Learning

Image-to-Image Translation with Deep Reinforcement Learning

1 code implementation24 Sep 2023 Xin Wang, Ziwei Luo, Jing Hu, Chengming Feng, Shu Hu, Bin Zhu, Xi Wu, Xin Li, Siwei Lyu

The key feature in the RL-I2IT framework is to decompose a monolithic learning process into small steps with a lightweight model to progressively transform a source image successively to a target image.

Auxiliary Learning Decision Making +3

WiCV@CVPR2023: The Eleventh Women In Computer Vision Workshop at the Annual CVPR Conference

no code implementations22 Sep 2023 Doris Antensteiner, Marah Halawa, Asra Aslam, Ivaxi Sheth, Sachini Herath, Ziqi Huang, Sunnie S. Y. Kim, Aparna Akula, Xin Wang

In this paper, we present the details of Women in Computer Vision Workshop - WiCV 2023, organized alongside the hybrid CVPR 2023 in Vancouver, Canada.

On-the-Fly SfM: What you capture is What you get

1 code implementation21 Sep 2023 Zongqian Zhan, Rui Xia, Yifei Yu, Yibo Xu, Xin Wang

Over the last decades, ample achievements have been made on Structure from motion (SfM).

Image Registration Image Retrieval +1

For A More Comprehensive Evaluation of 6DoF Object Pose Tracking

no code implementations14 Sep 2023 Yang Li, Fan Zhong, Xin Wang, Shuangbing Song, Jiachen Li, Xueying Qin, Changhe Tu

The limitations of previous scoring methods and error metrics are analyzed, based on which we introduce our improved evaluation methods.

Pose Tracking

Can large-scale vocoded spoofed data improve speech spoofing countermeasure with a self-supervised front end?

1 code implementation12 Sep 2023 Xin Wang, Junichi Yamagishi

While many datasets use spoofed data generated by speech synthesis systems, it was recently found that data vocoded by neural vocoders were also effective as the spoofed training data.

Self-Supervised Learning Speech Synthesis

Outlier Robust Adversarial Training

1 code implementation10 Sep 2023 Shu Hu, Zhenhuan Yang, Xin Wang, Yiming Ying, Siwei Lyu

Theoretically, we show that the learning objective of ORAT satisfies the $\mathcal{H}$-consistency in binary classification, which establishes it as a proper surrogate to adversarial 0/1 loss.

Adversarial Attack Binary Classification

Control-Oriented Modeling and Layer-to-Layer Spatial Control of Powder Bed Fusion Processes

no code implementations8 Sep 2023 Xin Wang, Bumsoo Park, Robert G. Landers, Sandipan Mishra, Douglas A. Bristow

However, due to inherent process variability, it is still very costly and time consuming to certify the process and the part.

DRAG: Divergence-based Adaptive Aggregation in Federated learning on Non-IID Data

no code implementations4 Sep 2023 Feng Zhu, Jingjing Zhang, Shengyun Liu, Xin Wang

Local stochastic gradient descent (SGD) is a fundamental approach in achieving communication efficiency in Federated Learning (FL) by allowing individual workers to perform local updates.

Federated Learning

BenchTemp: A General Benchmark for Evaluating Temporal Graph Neural Networks

1 code implementation31 Aug 2023 Qiang Huang, Jiawei Jiang, Xi Susie Rao, Ce Zhang, Zhichao Han, Zitao Zhang, Xin Wang, Yongjun He, Quanqing Xu, Yang Zhao, Chuang Hu, Shuo Shang, Bo Du

To handle graphs in which features or connectivities are evolving over time, a series of temporal graph neural networks (TGNNs) have been proposed.

Link Prediction Node Classification

Graph Meets LLMs: Towards Large Graph Models

1 code implementation28 Aug 2023 Ziwei Zhang, Haoyang Li, Zeyang Zhang, Yijian Qin, Xin Wang, Wenwu Zhu

In order to promote applying large models for graphs forward, we present a perspective paper to discuss the challenges and opportunities associated with developing large graph models.

A Survey on Fairness in Large Language Models

no code implementations20 Aug 2023 Yingji Li, Mengnan Du, Rui Song, Xin Wang, Ying Wang

Large Language Models (LLMs) have shown powerful performance and development prospects and are widely deployed in the real world.

Fairness

Unsupervised Multiplex Graph Learning with Complementary and Consistent Information

1 code implementation3 Aug 2023 Liang Peng, Xin Wang, Xiaofeng Zhu

Unsupervised multiplex graph learning (UMGL) has been shown to achieve significant effectiveness for different downstream tasks by exploring both complementary information and consistent information among multiple graphs.

Graph Learning Representation Learning

SphereNet: Learning a Noise-Robust and General Descriptor for Point Cloud Registration

no code implementations18 Jul 2023 Guiyu Zhao, Zhentao Guo, Xin Wang, Hongbin Ma

However, most methods are susceptible to noise and have poor generalization ability on unseen datasets.

Point Cloud Registration

Mixed-Precision Quantization with Cross-Layer Dependencies

no code implementations11 Jul 2023 Zihao Deng, Xin Wang, Sayeh Sharify, Michael Orshansky

Quantization assigning the same bit-width to all layers leads to large accuracy degradation at low precision and is wasteful at high precision settings.

Quantization

Unlocking the Potential of Deep Learning in Peak-Hour Series Forecasting

1 code implementation4 Jul 2023 Zhenwei Zhang, Xin Wang, Jingyuan Xie, Heling Zhang, Yuantao Gu

Unlocking the potential of deep learning in Peak-Hour Series Forecasting (PHSF) remains a critical yet underexplored task in various domains.

Time Series Time Series Forecasting

An Explainable Deep Framework: Towards Task-Specific Fusion for Multi-to-One MRI Synthesis

1 code implementation3 Jul 2023 Luyi Han, Tianyu Zhang, Yunzhi Huang, Haoran Dou, Xin Wang, Yuan Gao, Chunyao Lu, Tan Tao, Ritse Mann

Multi-sequence MRI is valuable in clinical settings for reliable diagnosis and treatment prognosis, but some sequences may be unusable or missing for various reasons.

Over-The-Air Federated Learning: Status Quo, Open Challenges, and Future Directions

no code implementations3 Jul 2023 Bingnan Xiao, Xichen Yu, Wei Ni, Xin Wang, H. Vincent Poor

The development of applications based on artificial intelligence and implemented over wireless networks is increasingly rapidly and is expected to grow dramatically in the future.

Federated Learning

Efficient Search and Detection of Relevant Plant Parts using Semantics-Aware Active Vision

no code implementations16 Jun 2023 Akshay K. Burusa, Joost Scholten, David Rapado Rincon, Xin Wang, Eldert J. van Henten, Gert Kootstra

To automate harvesting and de-leafing of tomato plants using robots, it is important to search and detect the relevant plant parts, namely tomatoes, peduncles, and petioles.

Towards single integrated spoofing-aware speaker verification embeddings

1 code implementation30 May 2023 Sung Hwan Mun, Hye-jin Shim, Hemlata Tak, Xin Wang, Xuechen Liu, Md Sahidullah, Myeonghun Jeong, Min Hyun Han, Massimiliano Todisco, Kong Aik Lee, Junichi Yamagishi, Nicholas Evans, Tomi Kinnunen, Nam Soo Kim, Jee-weon Jung

Second, competitive performance should be demonstrated compared to the fusion of automatic speaker verification (ASV) and countermeasure (CM) embeddings, which outperformed single embedding solutions by a large margin in the SASV2022 challenge.

Speaker Verification

Controllable Text-to-Image Generation with GPT-4

no code implementations29 May 2023 Tianjun Zhang, Yi Zhang, Vibhav Vineet, Neel Joshi, Xin Wang

Control-GPT works by querying GPT-4 to write TikZ code, and the generated sketches are used as references alongside the text instructions for diffusion models (e. g., ControlNet) to generate photo-realistic images.

Instruction Following Text-to-Image Generation

Range-Based Equal Error Rate for Spoof Localization

1 code implementation28 May 2023 Lin Zhang, Xin Wang, Erica Cooper, Nicholas Evans, Junichi Yamagishi

To properly measure misclassified ranges and better evaluate spoof localization performance, we upgrade point-based EER to range-based EER.

Integrating Listwise Ranking into Pairwise-based Image-Text Retrieval

1 code implementation26 May 2023 Zheng Li, Caili Guo, Xin Wang, Zerun Feng, Yanjun Wang

Given a query caption, the goal is to rank candidate images by relevance, from large to small.

Retrieval Text Retrieval

Gorilla: Large Language Model Connected with Massive APIs

1 code implementation24 May 2023 Shishir G. Patil, Tianjun Zhang, Xin Wang, Joseph E. Gonzalez

Large Language Models (LLMs) have seen an impressive wave of advances recently, with models now excelling in a variety of tasks, such as mathematical reasoning and program synthesis.

Hallucination Language Modelling +4

Federated Learning Model Aggregation in Heterogenous Aerial and Space Networks

no code implementations24 May 2023 Fan Dong, Ali Abbasi, Henry Leung, Xin Wang, Jiayu Zhou, Steve Drew

Direct sharing of the data distribution may be prohibitive due to the additional private information that is sent from the clients.

Federated Learning Privacy Preserving

TOAST: Transfer Learning via Attention Steering

1 code implementation24 May 2023 Baifeng Shi, Siyu Gai, Trevor Darrell, Xin Wang

We introduce Top-Down Attention Steering (TOAST), a novel transfer learning algorithm that keeps the pre-trained backbone frozen, selects task-relevant features in the output, and feeds those features back to the model to steer the attention to the task-specific features.

Fine-Grained Image Classification Instruction Following +2

Efficient information recovery from Pauli noise via classical shadow

no code implementations6 May 2023 Yifei Chen, Zhan Yu, Chenghong Zhu, Xin Wang

The rapid advancement of quantum computing has led to an extensive demand for effective techniques to extract classical information from quantum systems, particularly in fields like quantum machine learning and quantum chemistry.

Quantum Machine Learning

Clothes Grasping and Unfolding Based on RGB-D Semantic Segmentation

no code implementations5 May 2023 Xingyu Zhu, Xin Wang, Jonathan Freer, Hyung Jin Chang, Yixing Gao

These methods often utilize physics engines to synthesize depth images to reduce the cost of real labeled data collection.

Data Augmentation Semantic Segmentation

DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation

1 code implementation5 May 2023 Hong Chen, YiPeng Zhang, Simin Wu, Xin Wang, Xuguang Duan, Yuwei Zhou, Wenwu Zhu

To tackle the problems, we propose DisenBooth, an identity-preserving disentangled tuning framework for subject-driven text-to-image generation.

Denoising Disentanglement +1

DELTA: Dynamic Embedding Learning with Truncated Conscious Attention for CTR Prediction

no code implementations3 May 2023 Chen Zhu, Liang Du, Hong Chen, Shuang Zhao, Zixun Sun, Xin Wang, Wenwu Zhu

To tackle this problem, inspired by the Global Workspace Theory in conscious processing, which posits that only a specific subset of the product features are pertinent while the rest can be noisy and even detrimental to human-click behaviors, we propose a CTR model that enables Dynamic Embedding Learning with Truncated Conscious Attention for CTR prediction, termed DELTA.

Click-Through Rate Prediction

Enhancing Video Super-Resolution via Implicit Resampling-based Alignment

1 code implementation arXiv 2024 Kai Xu, Ziwei Yu, Xin Wang, Michael Bi Mi, Angela Yao

We show that bilinear interpolation inherently attenuates high-frequency information while an MLP-based coordinate network can approximate more frequencies.

Video Super-Resolution

Harnessing the Power of Text-image Contrastive Models for Automatic Detection of Online Misinformation

no code implementations19 Apr 2023 Hao Chen, Peng Zheng, Xin Wang, Shu Hu, Bin Zhu, Jinrong Hu, Xi Wu, Siwei Lyu

As growing usage of social media websites in the recent decades, the amount of news articles spreading online rapidly, resulting in an unprecedented scale of potentially fraudulent information.

Contrastive Learning Misinformation +1

Arbitrary Reduction of MRI Inter-slice Spacing Using Hierarchical Feature Conditional Diffusion

no code implementations16 Apr 2023 Xin Wang, Zhenrong Shen, Zhiyun Song, Sheng Wang, Mengjun Liu, Lichi Zhang, Kai Xuan, Qian Wang

Magnetic resonance (MR) images collected in 2D scanning protocols typically have large inter-slice spacing, resulting in high in-plane resolution but reduced through-plane resolution.

Super-Resolution

A Clustering Framework for Unsupervised and Semi-supervised New Intent Discovery

1 code implementation16 Apr 2023 Hanlei Zhang, Hua Xu, Xin Wang, Fei Long, Kai Gao

New intent discovery is of great value to natural language processing, allowing for a better understanding of user needs and providing friendly services.

Clustering Intent Discovery +3

CIMI4D: A Large Multimodal Climbing Motion Dataset under Human-scene Interactions

no code implementations CVPR 2023 Ming Yan, Xin Wang, Yudi Dai, Siqi Shen, Chenglu Wen, Lan Xu, Yuexin Ma, Cheng Wang

The core of this dataset is a blending optimization process, which corrects for the pose as it drifts and is affected by the magnetic conditions.

Pose Prediction

Domain Adaptive Semantic Segmentation by Optimal Transport

no code implementations29 Mar 2023 Yaqian Guo, Xin Wang, Ce Li, Shihui Ying

Second, we utilize OT to achieve a more robust alignment of source and target domains in output space, where the OT plan defines a well attention mechanism to improve the adaptation of the model.

Autonomous Driving Domain Adaptation +2

Top-Down Visual Attention from Analysis by Synthesis

1 code implementation CVPR 2023 Baifeng Shi, Trevor Darrell, Xin Wang

In this paper, we consider top-down attention from a classic Analysis-by-Synthesis (AbS) perspective of vision.

Retrieval Semantic Segmentation +1

Damage detection of high-speed railway box girder using train-induced dynamic responses

no code implementations23 Mar 2023 Xin Wang, Yi Zhuo, Shunlong Li

This paper proposes a damage detection method based on the train-induced responses of high-speed railway box girder.

A Survey of Graph Prompting Methods: Techniques, Applications, and Challenges

no code implementations13 Mar 2023 Xuansheng Wu, Kaixiong Zhou, Mingchen Sun, Xin Wang, Ninghao Liu

In particular, we introduce the basic concepts of graph prompt learning, organize the existing work of designing graph prompting functions, and describe their applications and future challenges.

Optimal Beamforming for MIMO DFRC Systems with Transmit Covariance Constraints

no code implementations6 Mar 2023 Chenhao Yang, Xin Wang, Wei Ni, Yi Jiang

Under this approach, we reveal that the optimal receive beamforming is given by the classic MMSE one and the optimal transmit beamforming design amounts to solving an orthogonal Procrustes problem, thereby allowing for closed-form solutions to subproblems in each BCD step and fast convergence of the proposed algorithm to a high-quality (near-optimal) overall beamforming design.

Selectively Hard Negative Mining for Alleviating Gradient Vanishing in Image-Text Matching

no code implementations1 Mar 2023 Zheng Li, Caili Guo, Xin Wang, Zerun Feng, Zhongtian Du

To alleviate the gradient vanishing problem, we propose a Selectively Hard Negative Mining (SelHN) strategy, which chooses whether to mine hard negative samples according to the gradient vanishing condition.

Image-text matching Text Matching

RIS-Assisted Jamming Rejection and Path Planning for UAV-Borne IoT Platform: A New Deep Reinforcement Learning Framework

no code implementations10 Feb 2023 Shuyan Hu, Xin Yuan, Wei Ni, Xin Wang, Abbas Jamalipour

This paper presents a new deep reinforcement learning (DRL)-based approach to the trajectory planning and jamming rejection of an unmanned aerial vehicle (UAV) for the Internet-of-Things (IoT) applications.

Trajectory Planning

Unsupervised Deep Learning for IoT Time Series

no code implementations7 Feb 2023 Ya Liu, Yingjie Zhou, Kai Yang, Xin Wang

IoT time series analysis has found numerous applications in a wide variety of areas, ranging from health informatics to network security.

Clustering Representation Learning +3

Curriculum Graph Machine Learning: A Survey

no code implementations6 Feb 2023 Haoyang Li, Xin Wang, Wenwu Zhu

To the best of our knowledge, this paper is the first survey for curriculum graph machine learning.

Model Optimization

IMPORTANT-Net: Integrated MRI Multi-Parameter Reinforcement Fusion Generator with Attention Network for Synthesizing Absent Data

1 code implementation3 Feb 2023 Tianyu Zhang, Tao Tan, Luyi Han, Xin Wang, Yuan Gao, Jonas Teuwen, Regina Beets-Tan, Ritse Mann

Then the multi-parameter fusion with attention module enables the interaction of the encoded information from different parameters through a set of algorithmic strategies, and applies different weights to the information through the attention mechanism after information fusion to obtain refined representation information.

Lesion Classification Lesion Detection

Synthesis-based Imaging-Differentiation Representation Learning for Multi-Sequence 3D/4D MRI

1 code implementation1 Feb 2023 Luyi Han, Tao Tan, Tianyu Zhang, Yunzhi Huang, Xin Wang, Yuan Gao, Jonas Teuwen, Ritse Mann

Multi-sequence MRIs can be necessary for reliable diagnosis in clinical practice due to the complimentary information within sequences.

Representation Learning

The Power of External Memory in Increasing Predictive Model Capacity

no code implementations31 Jan 2023 Cenk Baykal, Dylan J Cutler, Nishanth Dikkala, Nikhil Ghosh, Rina Panigrahy, Xin Wang

One way of introducing sparsity into deep networks is by attaching an external table of parameters that is sparsely looked up at different layers of the network.

Language Modelling

Attacking Important Pixels for Anchor-free Detectors

no code implementations26 Jan 2023 Yunxu Xie, Shu Hu, Xin Wang, Quanyu Liao, Bin Zhu, Xi Wu, Siwei Lyu

Existing adversarial attacks on object detection focus on attacking anchor-based detectors, which may not work well for anchor-free detectors.

Adversarial Attack object-detection +2

HDG-ODE: A Hierarchical Continuous-Time Model for Human Pose Forecasting

1 code implementation ICCV 2023 Yucheng Xing, Xin Wang

Considering the structural-property of the skeleton data in representing human poses and the possible irregularity caused by occlusion, we propose the use of dynamic graph convolution as the basic operator.

Human Pose Forecasting

Decouple Before Interact: Multi-Modal Prompt Learning for Continual Visual Question Answering

no code implementations ICCV 2023 Zi Qian, Xin Wang, Xuguang Duan, Pengda Qin, Yuhong Li, Wenwu Zhu

Based on our formulation, we further propose MulTi-Modal PRompt LearnIng with DecouPLing bEfore InTeraction (TRIPLET), a novel approach that builds on a pre-trained vision-language model and consists of decoupled prompts and prompt interaction strategies to capture the complex interactions between modalities.

Continual Learning Language Modelling +2

You Do Not Need Additional Priors or Regularizers in Retinex-Based Low-Light Image Enhancement

no code implementations CVPR 2023 Huiyuan Fu, Wenkai Zheng, Xiangyu Meng, Xin Wang, Chuanming Wang, Huadong Ma

The Retinex-based methods require decomposing the image into reflectance and illumination components, which is a highly ill-posed problem and there is no available ground truth.

Contrastive Learning Low-Light Image Enhancement +1

Doubly Right Object Recognition: A Why Prompt for Visual Rationales

1 code implementation CVPR 2023 Chengzhi Mao, Revant Teotia, Amrutha Sundar, Sachit Menon, Junfeng Yang, Xin Wang, Carl Vondrick

We propose a ``doubly right'' object recognition benchmark, where the metric requires the model to simultaneously produce both the right labels as well as the right rationales.

Object Recognition

Hiding speaker's sex in speech using zero-evidence speaker representation in an analysis/synthesis pipeline

1 code implementation29 Nov 2022 Paul-Gauthier Noé, Xiaoxiao Miao, Xin Wang, Junichi Yamagishi, Jean-François Bonastre, Driss Matrouf

The use of modern vocoders in an analysis/synthesis pipeline allows us to investigate high-quality voice conversion that can be used for privacy purposes.

Voice Conversion

Disentangled Representation Learning

no code implementations21 Nov 2022 Xin Wang, Hong Chen, Si'ao Tang, Zihao Wu, Wenwu Zhu

Disentangled Representation Learning (DRL) aims to learn a model capable of identifying and disentangling the underlying factors hidden in the observable data in representation form.

Representation Learning

FedSiam-DA: Dual-aggregated Federated Learning via Siamese Network under Non-IID Data

no code implementations17 Nov 2022 Ming Yang, Yanhan Wang, Xin Wang, Zhenyong Zhang, Xiaoming Wu, Peng Cheng

Federated learning is a distributed learning that allows each client to keep the original data locally and only upload the parameters of the local model to the server.

Contrastive Learning Federated Learning

Super-resolution Reconstruction of Single Image for Latent features

no code implementations16 Nov 2022 Xin Wang, Jing-Ke Yan, Jing-Ye Cai, Jian-Hua Deng, Qin Qin, Yao Cheng

Single-image super-resolution (SISR) typically focuses on restoring various degraded low-resolution (LR) images to a single high-resolution (HR) image.

Denoising Image Reconstruction +2

Shared Loss between Generators of GANs

no code implementations14 Nov 2022 Xin Wang

Traditional GANs fall prey to the mode collapse problem, which means that they are unable to generate the different variations of data present in the input dataset.

LiDAL: Inter-frame Uncertainty Based Active Learning for 3D LiDAR Semantic Segmentation

1 code implementation11 Nov 2022 Zeyu Hu, Xuyang Bai, Runze Zhang, Xin Wang, Guangyuan Sun, Hongbo Fu, Chiew-Lan Tai

We propose LiDAL, a novel active learning method for 3D LiDAR semantic segmentation by exploiting inter-frame uncertainty among LiDAR frames.

Active Learning LIDAR Semantic Segmentation +1

Large Language Models with Controllable Working Memory

no code implementations9 Nov 2022 Daliang Li, Ankit Singh Rawat, Manzil Zaheer, Xin Wang, Michal Lukasik, Andreas Veit, Felix Yu, Sanjiv Kumar

By contrast, when the context is irrelevant to the task, the model should ignore it and fall back on its internal knowledge.

counterfactual World Knowledge

Lightweight Neural Network with Knowledge Distillation for CSI Feedback

no code implementations31 Oct 2022 Yiming Cui, Jiajia Guo, Zheng Cao, Huaze Tang, Chao-Kai Wen, Shi Jin, Xin Wang, Xiaolin Hou

Firstly, an autoencoder KD-based method is introduced by training a student autoencoder to mimic the reconstructed CSI of a pretrained teacher autoencoder.

Knowledge Distillation

Detection of Real-time DeepFakes in Video Conferencing with Active Probing and Corneal Reflection

no code implementations21 Oct 2022 Hui Guo, Xin Wang, Siwei Lyu

Specifically, we authenticate video calls by displaying a distinct pattern on the screen and using the corneal reflection extracted from the images of the call participant's face.

Spoofed training data for speech spoofing countermeasure can be efficiently created using neural vocoders

1 code implementation19 Oct 2022 Xin Wang, Junichi Yamagishi

To make better use of pairs of bona fide and spoofed data, this study introduces a contrastive feature loss that can be plugged into the standard training criterion.

InFIP: An Explainable DNN Intellectual Property Protection Method based on Intrinsic Features

no code implementations14 Oct 2022 Mingfu Xue, Xin Wang, Yinghao Wu, Shifeng Ni, Yushu Zhang, Weiqiang Liu

Since the intrinsic feature is composed of unique interpretation of the model's decision, the intrinsic feature can be regarded as fingerprint of the model.

Explainable artificial intelligence

GGViT:Multistream Vision Transformer Network in Face2Face Facial Reenactment Detection

no code implementations12 Oct 2022 Haotian Wu, Peipei Wang, Xin Wang, Ji Xiang, Rui Gong

The compression of videos on social media has destroyed some pixel details that could be used to detect forgeries.

3D Matting: A Benchmark Study on Soft Segmentation Method for Pulmonary Nodules Applied in Computed Tomography

no code implementations11 Oct 2022 Lin Wang, Xiufen Ye, Donghao Zhang, Wanji He, Lie Ju, Yi Luo, Huan Luo, Xin Wang, Wei Feng, Kaimin Song, Xin Zhao, ZongYuan Ge

In this work, we introduce the image matting into the 3D scenes and use the alpha matte, i. e., a soft mask, to describe lesions in a 3D medical image.

Binarization Image Matting

Block Format Error Bounds and Optimal Block Size Selection

no code implementations11 Oct 2022 Ilya Soloveychik, Ilya Lyubomirsky, Xin Wang, Sudeep Bhoja

This measure allows us to determine the optimal parameters, such as the block size, yielding highest accuracy.

STSyn: Speeding Up Local SGD with Straggler-Tolerant Synchronization

no code implementations6 Oct 2022 Feng Zhu, Jingjing Zhang, Xin Wang

Synchronous local stochastic gradient descent (local SGD) suffers from some workers being idle and random delays due to slow and straggling workers, as it waits for the workers to complete the same amount of local updates.

Unified Loss of Pair Similarity Optimization for Vision-Language Retrieval

no code implementations28 Sep 2022 Zheng Li, Caili Guo, Xin Wang, Zerun Feng, Jenq-Neng Hwang, Zhongtian Du

More specifically, Triplet loss with Hard Negative mining (Triplet-HN), which is widely used in existing retrieval models to improve the discriminative ability, is easy to fall into local minima in training.

Contrastive Learning Retrieval +2

3D Matting: A Soft Segmentation Method Applied in Computed Tomography

no code implementations16 Sep 2022 Lin Wang, Xiufen Ye, Donghao Zhang, Wanji He, Lie Ju, Xin Wang, Wei Feng, Kaimin Song, Xin Zhao, ZongYuan Ge

It can be caused by many factors, such as the imaging properties, pathological anatomy, and the weak representation of the binary masks, which brings challenges to accurate 3D segmentation.

Anatomy Image Matting

Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances

no code implementations1 Sep 2022 Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi

Conventional automatic speaker verification systems can usually be decomposed into a front-end model such as time delay neural network (TDNN) for extracting speaker embeddings and a back-end model such as statistics-based probabilistic linear discriminant analysis (PLDA) or neural network-based neural PLDA (NPLDA) for similarity scoring.

Data Augmentation Speaker Verification

NeurIPS'22 Cross-Domain MetaDL competition: Design and baseline results

no code implementations31 Aug 2022 Dustin Carrión-Ojeda, Hong Chen, Adrian El Baz, Sergio Escalera, Chaoyu Guan, Isabelle Guyon, Ihsan Ullah, Xin Wang, Wenwu Zhu

We present the design and baseline results for a new challenge in the ChaLearn meta-learning series, accepted at NeurIPS'22, focusing on "cross-domain" meta-learning.

Few-Shot Image Classification Few-Shot Learning +1

NL2GDPR: Automatically Develop GDPR Compliant Android Application Features from Natural Language

no code implementations29 Aug 2022 Faysal Hossain Shezan, Yingjie Lao, Minlong Peng, Xin Wang, Mingming Sun, Ping Li

At the core, NL2GDPR is a privacy-centric information extraction model, appended with a GDPR policy finder and a policy generator.

Data-Driven Control of Distributed Event-Triggered Network Systems

no code implementations22 Aug 2022 Xin Wang, Jian Sun, Gang Wang, Frank Allgöwer, Jie Chen

The present paper deals with data-driven event-triggered control of a class of unknown discrete-time interconnected systems (a. k. a.

Efficient Climate Simulation via Machine Learning Method

no code implementations15 Aug 2022 Xin Wang, Wei Xue, Yilun Han, Guangwen Yang

We develop a user-friendly platform NeuroGCM for efficiently developing hybrid modeling in climate simulation.

GPPT: Graph Pre-training and Prompt Tuning to Generalize Graph Neural Networks

1 code implementation SIGKDD 2022 Mingchen Sun, Kaixiong Zhou, Xin He, Ying Wang, Xin Wang

Based on the pre-trained model, we propose the graph prompting function to modify the standalone node into a token pair, and reformulate the downstream node classification looking the same as edge prediction.

Few-Shot Learning Node Classification +3

Revisiting Adversarial Attacks on Graph Neural Networks for Graph Classification

no code implementations13 Aug 2022 Xin Wang, Heng Chang, Beini Xie, Tian Bian, Shiji Zhou, Daixin Wang, Zhiqiang Zhang, Wenwu Zhu

Graph neural networks (GNNs) have achieved tremendous success in the task of graph classification and its diverse downstream real-world applications.

Graph Classification

A Theoretical View on Sparsely Activated Networks

no code implementations8 Aug 2022 Cenk Baykal, Nishanth Dikkala, Rina Panigrahy, Cyrus Rashtchian, Xin Wang

After representing LSH-based sparse networks with our model, we prove that sparse networks can match the approximation power of dense networks on Lipschitz functions.

Event-triggered Consensus Control of Heterogeneous Multi-agent Systems: Model- and Data-based Analysis

no code implementations1 Aug 2022 Xin Wang, Jian Sun, Gang Wang, Jie Chen

This article deals with model- and data-based consensus control of heterogenous leader-following multi-agent systems (MASs) under an event-triggering transmission scheme.

Trajectory Planning of Cellular-Connected UAV for Communication-assisted Radar Sensing

no code implementations27 Jul 2022 Shuyan Hu, Xin Yuan, Wei Ni, Xin Wang

Being a key technology for beyond fifth-generation wireless systems, joint communication and radar sensing (JCAS) utilizes the reflections of communication signals to detect foreign objects and deliver situational awareness.

Trajectory Planning

Proving Common Mechanisms Shared by Twelve Methods of Boosting Adversarial Transferability

no code implementations24 Jul 2022 Quanshi Zhang, Xin Wang, Jie Ren, Xu Cheng, Shuyun Lin, Yisen Wang, Xiangming Zhu

This paper summarizes the common mechanism shared by twelve previous transferability-boosting methods in a unified view, i. e., these methods all reduce game-theoretic interactions between regional adversarial perturbations.

PanGu-Coder: Program Synthesis with Function-Level Language Modeling

1 code implementation22 Jul 2022 Fenia Christopoulou, Gerasimos Lampouras, Milan Gritta, Guchun Zhang, Yinpeng Guo, Zhongqi Li, Qi Zhang, Meng Xiao, Bo Shen, Lin Li, Hao Yu, Li Yan, Pingyi Zhou, Xin Wang, Yuchi Ma, Ignacio Iacobacci, Yasheng Wang, Guangtai Liang, Jiansheng Wei, Xin Jiang, Qianxiang Wang, Qun Liu

We present PanGu-Coder, a pretrained decoder-only language model adopting the PanGu-Alpha architecture for text-to-code generation, i. e. the synthesis of programming language solutions given a natural language problem description.

Code Generation Language Modelling +2

Neural-Sim: Learning to Generate Training Data with NeRF

1 code implementation22 Jul 2022 Yunhao Ge, Harkirat Behl, Jiashu Xu, Suriya Gunasekar, Neel Joshi, Yale Song, Xin Wang, Laurent Itti, Vibhav Vineet

However, existing approaches either require human experts to manually tune each scene property or use automatic methods that provide little to no control; this requires rendering large amounts of random data variations, which is slow and is often suboptimal for the target domain.

Object Detection

Scene Recognition with Objectness, Attribute and Category Learning

no code implementations20 Jul 2022 Ji Zhang, Jean-Paul Ainam, Li-hui Zhao, Wenai Song, Xin Wang

Based on the complementarity of attribute and category labels, we propose a Multi-task Attribute-Scene Recognition (MASR) network which learns a category embedding and at the same time predicts scene attributes.

Attribute Scene Classification +1

Rank-based Decomposable Losses in Machine Learning: A Survey

no code implementations18 Jul 2022 Shu Hu, Xin Wang, Siwei Lyu

Following these categories, we review the literature on rank-based aggregate losses and rank-based individual losses.

BIG-bench Machine Learning

Scaling Novel Object Detection with Weakly Supervised Detection Transformers

1 code implementation11 Jul 2022 Tyler LaBonte, Yale Song, Xin Wang, Vibhav Vineet, Neel Joshi

A critical object detection task is finetuning an existing model to detect novel objects, but the standard workflow requires bounding box annotations which are time-consuming and expensive to collect.

Multiple Instance Learning Novel Object Detection +4

Unsupervised Domain Adaptive Fundus Image Segmentation with Category-level Regularization

1 code implementation8 Jul 2022 Wei Feng, Lin Wang, Lie Ju, Xin Zhao, Xin Wang, Xiaoyu Shi, ZongYuan Ge

Existing unsupervised domain adaptation methods based on adversarial learning have achieved good performance in several medical imaging tasks.

Image Segmentation Semantic Segmentation +1

Enhanced brain structure-function tethering in transmodal cortex revealed by high-frequency eigenmodes

no code implementations7 Jul 2022 Yaqian Yang, Zhiming Zheng, Longzhao Liu, Hongwei Zheng, Yi Zhen, Yi Zheng, Xin Wang, Shaoting Tang

Specifically, low-frequency eigenmodes, which are considered sufficient to capture the essence of the functional network, contribute little to functional connectivity reconstruction in transmodal regions, resulting in structure-function decoupling along the unimodal-transmodal gradient.

NAS-Bench-Graph: Benchmarking Graph Neural Architecture Search

1 code implementation18 Jun 2022 Yijian Qin, Ziwei Zhang, Xin Wang, Zeyang Zhang, Wenwu Zhu

To the best of our knowledge, our work is the first benchmark for graph neural architecture search.

Benchmarking Neural Architecture Search

Concentration of Data Encoding in Parameterized Quantum Circuits

no code implementations16 Jun 2022 Guangxi Li, Ruilin Ye, Xuanqiang Zhao, Xin Wang

This result in particular implies that the average encoded state will concentrate on the maximally mixed state at an exponential speed on depth.

Combinatorial Optimization

A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and Future Directions

1 code implementation15 Jun 2022 Sheng Zhou, Hongjia Xu, Zhuonan Zheng, Jiawei Chen, Zhao Li, Jiajun Bu, Jia Wu, Xin Wang, Wenwu Zhu, Martin Ester

Motivated by the tremendous success of deep learning in clustering, one of the most fundamental machine learning tasks, and the large number of recent advances in this direction, in this paper we conduct a comprehensive survey on deep clustering by proposing a new taxonomy of different state-of-the-art approaches.

Clustering Deep Clustering +1

Deep Learning-based Massive MIMO CSI Acquisition for 5G Evolution and 6G

no code implementations10 Jun 2022 Xin Wang, Xiaolin Hou, Lan Chen, Yoshihisa Kishiyama, Takahiro Asai

Considering its large impact on air-interface design, it will be a candidate technology for 6th generation (6G) networks, in which an air interface designed by artificial intelligence can be used.

Mitigating barren plateaus of variational quantum eigensolvers

no code implementations26 May 2022 Xia Liu, Geng Liu, Jiaxin Huang, Hao-Kai Zhang, Xin Wang

Variational quantum algorithms (VQAs) are expected to establish valuable applications on near-term quantum computers.

Spatial Attention-based Implicit Neural Representation for Arbitrary Reduction of MRI Slice Spacing

no code implementations23 May 2022 Xin Wang, Sheng Wang, Honglin Xiong, Kai Xuan, Zixu Zhuang, Mengjun Liu, Zhenrong Shen, Xiangyu Zhao, Lichi Zhang, Qian Wang

Magnetic resonance (MR) images collected in 2D clinical protocols typically have large inter-slice spacing, resulting in high in-plane resolution and reduced through-plane resolution.

Computational Efficiency Super-Resolution

Power and limitations of single-qubit native quantum neural networks

no code implementations16 May 2022 Zhan Yu, Hongshun Yao, Mujin Li, Xin Wang

Quantum neural networks (QNNs) have emerged as a leading strategy to establish applications in machine learning, chemistry, and optimization.

The VoicePrivacy 2020 Challenge Evaluation Plan

1 code implementation14 May 2022 Natalia Tomashenko, Brij Mohan Lal Srivastava, Xin Wang, Emmanuel Vincent, Andreas Nautsch, Junichi Yamagishi, Nicholas Evans, Jose Patino, Jean-François Bonastre, Paul-Gauthier Noé, Massimiliano Todisco

The VoicePrivacy Challenge aims to promote the development of privacy preservation tools for speech technology by gathering a new community to define the tasks of interest and the evaluation methodology, and benchmarking solutions through a series of challenges.

Benchmarking

Open-Eye: An Open Platform to Study Human Performance on Identifying AI-Synthesized Faces

no code implementations13 May 2022 Hui Guo, Shu Hu, Xin Wang, Ming-Ching Chang, Siwei Lyu

In this work, we develop an online platform called Open-eye to study the human performance of AI-synthesized face detection.

Face Detection

Quantum Self-Attention Neural Networks for Text Classification

1 code implementation11 May 2022 Guangxi Li, Xuanqiang Zhao, Xin Wang

An emerging direction of quantum computing is to establish meaningful quantum applications in various fields of artificial intelligence, including natural language processing (NLP).

text-classification Text Classification

Fundamental limitations on optimization in variational quantum algorithms

no code implementations10 May 2022 Hao-Kai Zhang, Chengkai Zhu, Geng Liu, Xin Wang

Exploring quantum applications of near-term quantum devices is a rapidly growing field of quantum information science with both theoretical and practical interests.

An Edge-Cloud Integrated Framework for Flexible and Dynamic Stream Analytics

no code implementations10 May 2022 Xin Wang, Azim Khan, Jianwu Wang, Aryya Gangopadhyay, Carl E. Busart, Jade Freeman

In this paper, we study how to best leverage edge and cloud resources to achieve better accuracy and latency for stream analytics using a type of RNN model called long short-term memory (LSTM).

Cloud Computing Edge-computing +3

CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training

no code implementations Findings (NAACL) 2022 Xin Wang, Yasheng Wang, Yao Wan, Jiawei Wang, Pingyi Zhou, Li Li, Hao Wu, Jin Liu

Specifically, we first extract multiple code views using compiler tools, and learn the complementary information among them under a contrastive learning framework.

Contrastive Learning Defect Detection +2

Cannot find the paper you are looking for? You can Submit a new open access paper.