1 code implementation • 23 Apr 2024 • Xuanhua He, Quande Liu, Shengju Qian, Xin Wang, Tao Hu, Ke Cao, Keyu Yan, Man Zhou, Jie Zhang
Based on this pipeline, a random face reference training method is further devised to precisely capture the ID-relevant embeddings from reference images, thus improving the fidelity and generalization capacity of our model for ID-specific video generation.
no code implementations • 22 Apr 2024 • Tao Hu, Wenhang Ge, Yuyang Zhao, Gim Hee Lee
In this paper, we introduce X-Ray, an innovative approach to 3D generation that employs a new sequential representation, drawing inspiration from the depth-revealing capabilities of X-Ray scans to meticulously capture both the external and internal features of objects.
1 code implementation • 22 Apr 2024 • Kangzhen Yang, Tao Hu, Kexin Dai, Genggeng Chen, Yu Cao, Wei Dong, Peng Wu, Yanning Zhang, Qingsen Yan
In real-world scenarios, images captured often suffer from blurring, noise, and other forms of image degradation, and due to sensor limitations, people usually can only obtain low dynamic range images.
1 code implementation • 21 Apr 2024 • Genggeng Chen, Kexin Dai, Kangzhen Yang, Tao Hu, Xiangyu Chen, Yongqing Yang, Wei Dong, Peng Wu, Yanning Zhang, Qingsen Yan
Specifically, we employ two modules for feature extraction: shared weight modules and non-shared weight modules.
no code implementations • 2 Apr 2024 • Tao Hu, Fangzhou Hong, Zhaoxi Chen, Ziwei Liu
FashionEngine automates the 3D human production with three key components: 1) A pre-trained 3D human diffusion model that learns to model 3D humans in a semantic UV latent space from 2D image training data, which provides strong priors for diverse generation and editing tasks.
no code implementations • 1 Apr 2024 • Tao Hu, Fangzhou Hong, Ziwei Liu
2) Physical motion decoding that is designed to encourage physical motion learning by decoding the motion triplane features at timestep t to predict both spatial derivatives and temporal derivatives at the next timestep t+1 in the training stage.
no code implementations • 1 Apr 2024 • Tao Hu, Fangzhou Hong, Ziwei Liu
2) A structured 3D-aware auto-decoder that factorizes the global latent space into several semantic body parts parameterized by a set of conditional structured local NeRFs anchored to the body template, which embeds the properties learned from the 2D training data and can be decoded to render view-consistent humans under different poses and clothing styles.
no code implementations • 1 Apr 2024 • Tao Hu, Qingsen Yan, Yuankai Qi, Yanning Zhang
To address this challenge, we propose the Low-Frequency aware Diffusion (LF-Diff) model for ghost-free HDR imaging.
no code implementations • 11 Mar 2024 • Xiaogang Xu, Shu Kong, Tao Hu, Zhe Liu, Hujun Bao
Pre-trained models with large-scale training data, such as CLIP and Stable Diffusion, have demonstrated remarkable performance in various high-level computer vision tasks such as image understanding and generation from language descriptions.
no code implementations • 30 Jan 2024 • Hao Wu, Yingnan Song, Ammar Hoori, Ananya Subramaniam, Juhwan Lee, Justin Kim, Tao Hu, Sadeer Al-Kindi, Wei-Ming Huang, Chun-Ho Yun, Chung-Lieh Hung, Sanjay Rajagopalan, David L. Wilson
CCTA in conjunction with a new automated quantitative CCTP approach can augment the interpretation of CAD, enabling the distinction of ischemia due to obstructive lesions and MVD.
no code implementations • 29 Jan 2024 • Tao Hu, Joshua Freeze, Prerna Singh, Justin Kim, Yingnan Song, Hao Wu, Juhwan Lee, Sadeer Al-Kindi, Sanjay Rajagopalan, David L. Wilson, Ammar Hoori
Background: Recent studies have used basic epicardial adipose tissue (EAT) assessments (e. g., volume and mean HU) to predict risk of atherosclerosis-related, major adverse cardiovascular events (MACE).
no code implementations • 28 Jan 2024 • Yingnan Song, Hao Wu, Juhwan Lee, Justin Kim, Ammar Hoori, Tao Hu, Vladislav Zimin, Mohamed Makhlouf, Sadeer Al-Kindi, Sanjay Rajagopalan, Chun-Ho Yun, Chung-Lieh Hung, David L. Wilson
Preliminarily, PCAT features can be assessed from three main coronary arteries in non-contrast CTCS images with performance characteristics that are at the very least comparable to CCTA.
1 code implementation • 4 Jan 2024 • Xuanhua He, Tao Hu, Guoli Wang, Zejin Wang, Run Wang, Qian Zhang, Keyu Yan, Ziyi Chen, Rui Li, Chenjun Xie, Jie Zhang, Man Zhou
However, current methods often ignore the difference between cell phone RAW images and DSLR camera RGB images, a difference that goes beyond the color matrix and extends to spatial structure due to resolution variations.
no code implementations • 14 Dec 2023 • Tao Hu, Honglong Zhang, Fan Zeng, Min Du, XiangKun Du, Yue Zheng, Quanqi Li, Mengran Zhang, Dan Yang, Jihao Wu
However, temporal and spatial dimensions are extremely critical in the logistics field, and this limitation may directly affect the precision of subsidy and pricing strategies.
1 code implementation • 23 Nov 2023 • Tao Hu, William Thong, Pascal Mettes, Cees G. M. Snoek
In this paper, we propose a visual-semantic embedding network that explicitly deals with the imbalanced scenario for activity retrieval.
no code implementations • 2 Nov 2023 • Qingsen Yan, Tao Hu, Yuan Sun, Hao Tang, Yu Zhu, Wei Dong, Luc van Gool, Yanning Zhang
To address this challenge, we formulate the HDR deghosting problem as an image generation that leverages LDR features as the diffusion model's condition, consisting of the feature condition generator and the noise predictor.
no code implementations • 9 Sep 2023 • Wenjing Xie, Tao Hu, Neiwen Ling, Guoliang Xing, Shaoshan Liu, Nan Guan
Surround Radar/Lidar can provide 360-degree view sampling with the minimal cost, which are promising sensing hardware solutions for autonomous driving systems.
no code implementations • 23 Aug 2023 • Ammar Hoori, Sadeer Al-Kindi, Tao Hu, Yingnan Song, Hao Wu, Juhwan Lee, Nour Tashtish, Pingfu Fu, Robert Gilkeson, Sanjay Rajagopalan, David L. Wilson
We used a Cox model with elastic-net regularization on 2457 CT calcium score (CTCS) enriched for MACE events obtained from a large no-cost CLARIFY program (ClinicalTri-als. gov Identifier: NCT04075162).
no code implementations • 18 Aug 2023 • Shoukang Hu, Fangzhou Hong, Tao Hu, Liang Pan, Haiyi Mei, Weiye Xiao, Lei Yang, Ziwei Liu
In this work, we propose HumanLiff, the first layer-wise 3D human generative model with a unified diffusion process.
no code implementations • 27 Jun 2023 • Hao Wu, Yingnan Song, Ammar Hoori, Ananya Subramaniam, Juhwan Lee, Justin Kim, Tao Hu, Sadeer Al-Kindi, Wei-Ming Huang, Chun-Ho Yun, Chung-Lieh Hung, Sanjay Rajagopalan, David L. Wilson
HU, blood flow, and radiomics were assessed over time.
no code implementations • 22 Apr 2023 • Shaoteng Liu, Xiangyu Zhang, Tao Hu, Jiaya Jia
In each iteration, the input to VSA is one view (or multiple views) of a 3D object and the output is a synthesized image in another target pose.
no code implementations • CVPR 2023 • Tao Hu, Xiaogang Xu, Shu Liu, Jiaya Jia
Also, we present Point Encoding to build Multi-scale Radiance Fields that provide discriminative 3D point features.
1 code implementation • CVPR 2023 • Tao Hu, Xiaogang Xu, Ruihang Chu, Jiaya Jia
However, artifacts still appear in rendered images, due to the challenges in extracting continuous and discriminative 3D features from point clouds.
1 code implementation • ICCV 2023 • Wenhang Ge, Tao Hu, Haoyu Zhao, Shu Liu, Ying-Cong Chen
We show that together with a reflection direction-dependent radiance, our model achieves high-quality surface reconstruction on reflective surfaces and outperforms the state-of-the-arts by a large margin.
1 code implementation • CVPR 2023 • Kun Zhou, Wenbo Li, Yi Wang, Tao Hu, Nianjuan Jiang, Xiaoguang Han, Jiangbo Lu
Neural radiance fields (NeRF) show great success in novel view synthesis.
Ranked #1 on Novel View Synthesis on LLFF
1 code implementation • 2 Jun 2022 • Tao Hu, Shu Liu, Yilun Chen, Tiancheng Shen, Jiaya Jia
Neural Radiance Fields (NeRF) has been wildly applied to various tasks for its high-quality representation of 3D scenes.
no code implementations • CVPR 2022 • Desen Zhou, Zhichao Liu, Jian Wang, Leshan Wang, Tao Hu, Errui Ding, Jingdong Wang
To associate the predictions of disentangled decoders, we first generate a unified representation for HOI triplets with a base decoder, and then utilize it as input feature of each disentangled decoder.
3 code implementations • CVPR 2022 • Qiang Chen, Qiman Wu, Jian Wang, Qinghao Hu, Tao Hu, Errui Ding, Jian Cheng, Jingdong Wang
We propose MixFormer to find a solution.
no code implementations • CVPR 2022 • Tao Hu, Shu Liu, Yilun Chen, Tiancheng Shen, Jiaya Jia
Neural Radiance Fields (NeRF) has been wildly applied to various tasks for its high-quality representation of 3D scenes.
no code implementations • 19 Dec 2021 • Tao Hu, Tao Yu, Zerong Zheng, He Zhang, Yebin Liu, Matthias Zwicker
To handle complicated motions (e. g., self-occlusions), we then leverage the encoded information on the UV manifold to construct a 3D volumetric representation based on a dynamic pose-conditioned neural radiance field.
1 code implementation • CVPR 2022 • Binglu Wang, Tao Hu, Baoshan Li, Xiaojuan Chen, Zhijie Zhang
In this paper, we build a novel framework named GaTector to tackle the gaze object prediction problem in a unified way.
no code implementations • ICCV 2021 • Tao Hu, Kripasindhu Sarkar, Lingjie Liu, Matthias Zwicker, Christian Theobalt
We next combine the target pose image and the textures into a combined feature image, which is transformed into the output color image using a neural image translation network.
no code implementations • 28 Jul 2021 • Tao Hu, Chengjiang Long, Chunxia Xiao
Based on those constraints, a category-consistent and relativistic diverse conditional GAN (CRD-CGAN) is proposed to synthesize $K$ photo-realistic images simultaneously.
no code implementations • CVPR 2021 • Tao Hu, LiWei Wang, Xiaogang Xu, Shu Liu, Jiaya Jia
Recent single-view 3D reconstruction methods reconstruct object's shape and texture from a single image with only 2D image-level annotation.
no code implementations • 10 Nov 2020 • Peixiao Wang, Tao Hu, Hongqiang Liu, Xinyan Zhu
Therefore, in this paper, a novel framework was proposed to explore the impact of under-reporting on COVID-19 spatiotemporal distributions, and empirical analysis was carried out using infection data of healthcare workers in Wuhan and Hubei (excluding Wuhan).
no code implementations • 5 Mar 2020 • Mengxiao Hu, Jinlong Li, Maolin Hu, Tao Hu
In conditional Generative Adversarial Networks (cGANs), when two different initial noises are concatenated with the same conditional information, the distance between their outputs is relatively smaller, which makes minor modes likely to collapse into large modes.
no code implementations • 4 Mar 2020 • Tao Hu, Lichao Huang, Han Shen
Recent works in multiple object tracking use sequence model to calculate the similarity score between the detections and the previous tracklets.
1 code implementation • 22 Dec 2019 • Tao Hu, Geng Lin, Zhizhong Han, Matthias Zwicker
In this paper, we propose a novel approach for reconstructing point clouds from RGB images.
1 code implementation • 28 Nov 2019 • Tao Hu, Zhizhong Han, Matthias Zwicker
We formulate the regularization term as a consistency loss that encourages geometric consistency among multiple views, while the data term guarantees that the optimized views do not drift away too much from a learned shape descriptor.
no code implementations • ICCV 2019 • Tao Hu, Pascal Mettes, Jia-Hong Huang, Cees G. M. Snoek
To that end, we introduce a spatial similarity module that searches the spatial commonality among the given images.
no code implementations • 27 Sep 2019 • Ruisen Luo, Tao Hu, Zuodong Tang, Chen Wang, Xiaofeng Gong, Haiyan Tu
To solve the problem of inaccurate recognition of types of communication signal modulation, a RNN neural network recognition algorithm combining residual block network with attention mechanism is proposed.
1 code implementation • 2 Aug 2019 • Tao Hu, Lichao Huang, Xian-Ming Liu, Han Shen
Our tracker achieves leading performance in OTB2013, OTB2015, VOT2015, VOT2016 and LaSOT, and operates at a real-time speed of 26 FPS, which indicates our method is effective and practical.
no code implementations • 26 Jul 2019 • Tao Hu, Chengjiang Long, Leheng Zhang, Chunxia Xiao
In this paper, we propose a novel way to interpret text information by extracting visual feature presentation from multiple high-resolution and photo-realistic synthetic images generated by Text-to-image Generative Adversarial Network (GAN) to improve the performance of image labeling.
no code implementations • Proceedings of the AAAI Conference on Artificial Intelligence 2019 • Tao Hu, Pengwan Yang, Chiliang Zhang, Gang Yu, Yadong Mu, Cees G. M. Snoek
Few-shot learning is a nascent research topic, motivated by the fact that traditional deep learning methods require tremen- dous amounts of data.
Ranked #1 on Few-Shot Semantic Segmentation on Pascal5i
no code implementations • 17 Apr 2019 • Tao Hu, Zhizhong Han, Abhinav Shrivastava, Matthias Zwicker
Different from image-to-image translation network that completes each view separately, our novel network, multi-view completion net (MVCN), leverages information from all views of a 3D shape to help the completion of each single view.
4 code implementations • 26 Jan 2019 • Tao Hu, Honggang Qi, Qingming Huang, Yan Lu
Specifically, for each training image, we first generate attention maps to represent the object's discriminative parts by weakly supervised learning.
Ranked #12 on Fine-Grained Image Classification on CUB-200-2011
no code implementations • 6 Aug 2018 • Tao Hu, Jizheng Xu, Cong Huang, Honggang Qi, Qingming Huang, Yan Lu
Besides, we propose attention regularization and attention dropout to weakly supervise the generating process of attention maps.
no code implementations • 18 Mar 2018 • Tao Hu, Honggang Qi, Jizheng Xu, Qingming Huang
Only one self-iterative regressor is trained to learn the descent directions for samples from coarse stages to fine stages, and parameters are iteratively updated by the same regressor.
Ranked #16 on Face Alignment on 300W (NME_inter-pupil (%, Common) metric)
no code implementations • 2 Mar 2015 • Cengiz Pehlevan, Tao Hu, Dmitri B. Chklovskii
Such networks learn the principal subspace, in the sense of principal component analysis (PCA), by adjusting synaptic weights according to activity-dependent learning rules.
no code implementations • 2 Mar 2015 • Tao Hu, Cengiz Pehlevan, Dmitri B. Chklovskii
Here, to overcome this problem, we derive sparse dictionary learning from a novel cost-function - a regularized error of the symmetric factorization of the input's similarity matrix.
no code implementations • 28 Aug 2014 • Chao Zhang, DaCheng Tao, Tao Hu, Xiang Li
We are mainly concerned with two theoretical questions: 1) under what conditions does RMTL perform better with a smaller task sample size than STL?
no code implementations • 12 May 2014 • Tao Hu, Zaid J. Towfic, Cengiz Pehlevan, Alex Genkin, Dmitri B. Chklovskii
Here we propose to view a neuron as a signal processing device that represents the incoming streaming data matrix as a sparse vector of synaptic weights scaled by an outgoing sparse activity vector.
no code implementations • NeurIPS 2012 • Shaul Druckmann, Tao Hu, Dmitri B. Chklovskii
However, feedback inhibitory circuits are common in early sensory circuits and furthermore their dynamics may be nonlinear.
no code implementations • NeurIPS 2009 • Tao Hu, Anthony Leonardo, Dmitri B. Chklovskii
One of the central problems in neuroscience is reconstructing synaptic connectivity in neural circuits.