Table of Contents

Interests

Generally speaking, I am conducting researches on modeling and emulating the visual processing mechanism of human with computer algorithms, as well as practical problems involving such techniques, e.g. biometrics, intelligent Human-Computer Interaction and surveillance. Research areas that interest me include:

Projects

SODA-Boosting and Gender Recognition

SODA-Boosting and Gender Recognition We proposed a novel boosting family classification algorithm called SODA-Boosting (where SODA stands for Second Order Discriminant Analysis). SODA-Boosting aims at efficiently learning discriminative weak classifiers, based on linear features that can be computed in closed-form. It extends the idea of our MRC-Boosting algorithm and can serve as a generic binary classifier.

As an application, SODA-Boosting was employed in image based gender recognition. Experimental results on publicly available FERET database showed that SODA-Boosting achieved accuracy comparable to state-of-the-art approaches, and demonstrated superior performance compared to relevant boosting based algorithms. The algorithm has been integrated into our facial recognition software, capable of classifying gender in real-time from webcam feeds.

more...

Face Recognition with MRC-Boosting

Face Recognition with MRC-Boosting We show that face recognition can be modeled as a special two-class problem, namely “target detection”, where a target class should be discriminated from the surrounding clutter class. A classification algorithm called MRC-Boosting is proposed to attack the face recognition problem based on such motivation. Unlike conventional boosting approaches widely employed in the computer vision community, MRC-Boosting is computationally efficient as at each iteration the optimal feature is computed in closed-form, requiring neither exhaustive search nor time-consuming numerical optimization. Moreover, we show that MRC-Boosting is especially efficient for learning a face recognizer. As a result this algorithm provides a promising solution to face recognition, not only effective in handling large intra-personal variations (e.g. pose and lighting), but also able to efficiently learn from a large amount of training samples.

Loopy BP for Background Estimation

Loopy BP for Background Estimation Background estimation, i.e. automatic recovery of the background image from a sequence of images containing moving foreground objects, is an important module in many applications, e.g. surveillance and video segmentation. We show that background estimation can be modeled as a low level vision problem, formulated under the energy minimization framework, and solved with Loopy Belief Propagation. This leads to a simple yet effective approach for background estimation. The background can be robustly recovered even when the occluding foreground objects stay still for a long time. Furthermore, no motion information needs to be known or estimated for the foreground objects, implying that background can be recovered from a set of frames which are not consecutive temporally.

Peer Aware Silence Suppression

Peer Aware Silence Suppression A novel tandem-free solution for multiparty VoIP conferences called PASS (Peer-Aware Silence Suppression) is proposed. In contrast to conventional tandem-free solutions, PASS performs silence suppression and speaker selection in a completely distributed fashion. This configuration leads to better scalability, lower bandwidth occupation and jitter buffer delay, and higher compatibility with a wide variety of network topologies. Moreover, a novel algorithm is devised for robust silence suppression. Based on machine learning techniques, this approach reliably measures true voice activity even under complex environmental noises, resulting in accurate and transparent speaker selection.

Older Projects

Click here to see...

Publications

  • X. Xu and T. S. Huang, "A Loopy Belief Propagation Approach for Robust Background Estimation," in Proc. 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), Anchorage, Alaska, 2008, (acceptance rate 31.9%)[BibTeX][doi>][Download]
  • Y. Hu, Z. Zhang, X. Xu, Y. Fu, and T. S. Huang, "Building Large Scale 3D Face Database for Face Analysis," in Proc. MCAM 2007: International Workshop on Multimedia Content Analysis and Mining, Weihai, China, 2007, pp. 343–350.[BibTeX]
  • M. Liu, X. Xu, and T. S. Huang, "Audio-Visual Gender Recognition," in Proc. MIPPR'07: The Fourth International Symposium on Multispectral Image Processing and Pattern Recognition, Wuhan, China, 2007, (invited paper)[BibTeX]
  • X. Xu and T. S. Huang, "SODA-Boosting and Its Application to Gender Recognition," in Proc. LNCS 4778: 2007 IEEE International Workshop on Analysis and Modeling of Faces and Gestures (AMFG), in conjunction with ICCV, Rio de Janeiro, Brazil, 2007, pp. 193–204. (oral presentation, acceptance rate 15%)[BibTeX][doi>][Download]
  • X. Xu, L. He, D. Florêncio, and Y. Rui, "PASS: Peer-Aware Silence Suppression for Internet Voice Conferences," in Proc. 2006 IEEE International Conference on Multimedia & Expo (ICME 2006), 2006, (oral presentation, acceptance rate 22%)[BibTeX][doi>][Download]
  • J. Tu, A. Ivanovic, X. Xu, F. Li, and T. S. Huang, "Variational Shift Invariant Probabilistic PCA for Face Recognition," in Proc. 18th International Conference on Pattern Recognition (ICPR 2006), 2006,[BibTeX]
  • X. Xu, Y. Rui, and T. S. Huang, "Recognizing Faces in Recorded Meetings via MRC-Boosting," in Proc. 2006 IEEE International Conference on Multimedia & Expo (ICME 2006), 2006, (oral presentation, acceptance rate 22%)[BibTeX][doi>][Download]
  • Z. Zhang, X. Xu, and T. S. Huang, "Indecisive Classifier," in Proc. 2005 IEEE International Conference on Multimedia & Expo (ICME 2005), 2005,[BibTeX]
  • X. Xu and T. S. Huang, "Face Recognition with MRC-Boosting," in Proc. 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005, pp. 1770–1777. (acceptance rate 20.4%)[BibTeX][doi>][Download]
  • X. Xu, C. Zhang, and T. S. Huang, "Active Morphable Model: An Efficient Method for Face Analysis," in Proc. Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), 2004, pp. 837–842. (oral presentation, acceptance rate 15.6%)[BibTeX][doi>][Download]
  • X. Xu, Z. Lin, Y. Li, and C. Zhang, "Robust Flowchart Understanding for Pen-based User Interface," in preparation, 2003.[BibTeX]
  • X. Xu, "Active Morphable Model for the Analysis and Synthesis of Human Faces (In Chinese)," Master's thesis, 2003.[BibTeX]
  • X. Xu and C. Zhang, "Multi-agent Developmental Model of Plants," in Proc. 7th Conference on Artificial Life and Robotics (AROB'02), 2002,[BibTeX]
  • X. Xu, "A Computer Login System Based on Face and Fingerprint Recognition (In Chinese)," Bachelor's thesis, 2000.[BibTeX]

Patents

Codes

Coming soon…

Links

Bibiliography

Journals

Misc