About meI am a professor at the Department of Electronics and Information Engineering, Huazhong University of Science and Technology (HUST). I received my Ph.D. in Communication and Information Engineering, HUST. My research interests focus on Computer Vision, Pattern Recognition, and Deep Learning. I lead the Visual Computing group, which is part of Media and Communication Lab (MC lab), HUST. Here are some ongoing projects:
- Object Detection, Image\Person Retrieval, Contour Detection\Grouping
- OCR: Scene Text Detection and Recognition A brief survey, Slice (2014), Slice (2017)
- Shape Representation, 2D\3D Shape matching and Retrieval
- Jan., 2019: I got "AAAI-2019 Outstanding SPC Award".
- Dec., 2018: The source code of Mask TextSpotter released.
- Nov., 2018: I will serve as a senior PC of IJCAI 2019.
- Sep., 2018: The 14th IAPR Int. Workshop on Document Analysis Systems (DAS'20) will be organized by HUST.
- Sep.,2018: Special issue: Scene text reading and its applications (STRA) of Pattern Recognition Journal is approved.
- July, 2018: The source codes of Deep-Person and Aster released.
- May, 2018: Congratulations to Dr. Song Bai and Dr. Baoguang Shi! Wish they will have a bright future.
- May, 2018: I will serve as an area chair of CVPR 2019 & a senior PC of AAAI 2019.
- April, 2018: I will serve as an area chair of ICDAR 2019.
- Jan., 2018: DOTA dataset released. Code of Texboxes ++ including the whole pipeline for end-to-end text recognition released.
- Dec., 2017: I will serve as an area chair of ICPR 2018 and ACCV 2018.
- Nov., 2017: Keynote Speech: "Deep Neural Networks for Scene Text Reading Revisited" at ICDAR 2017, Kyoto. [ppt]
- Nov., 2017: ICDAR2017 Competiton on Reading Chinese Scene Text in the Wild (RCTW-17) was successfully done.
- Aug., 2017: Our paper "Ensemble diffusion for retrieval" was accepted by ICCV2017 as an Oral Presentation.
- July, 2017: I was identified as a "CVPR 2017 Outstanding Reviewer".
- April., 2017: The invited talk "Oriented Scene Text Detection Revisited" at VALSE 2017, Xiamen. [ppt]
- Jan., 2017: I become an editorial board member of the journal "Pattern Recognition"
- June, 2016: Baoguang got the support from CSC for visiting Cornell University. Congratulations!
- March, 2016: We achieved the first place in Shrec2016 competition: Large-Scale 3D Shape Retrieval under the perturbed case.
Selected Recent Publications
Y. Xu, Y. Wang, W. Zhou, Y. Wang, Z. Yang, X. Bai. TextField: Learning A Deep Direction Field for Irregular Scene Text Detection. IEEE Trans. on Image Processing, accepted.
M. Liao, J. Zhang, Z. Wan, F. Xie, J. Liang, P. Lyu, C. Yao, X. Bai. Scene Text Recognition from Two-Dimensional Perspective. AAAI, 2019.
P. Lyu, M. Liao, C. Yao, W. Wu, X. Bai. Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes. ECCV, 2018. [code] (An end-to-end trainable neural network for end-to-end recognition of horizontal, multi-oriented and curved texts)
B. Shi, M. Yang, X. Wang, P. Lyu, C. Yao, X. Bai. ASTER: An Attentional Scene Text Recognizer with Flexible Rectification. IEEE Trans. on PAMI, accepted. [code]
S. Bai, X. Bai, Q. Tian, L. Latecki. Regularized Diffusion Process on Bidirectional Context for Object Retrieval. IEEE Trans. on PAMI, accepted. [code]
M. Liao, B. Shi, X. Bai. TextBoxes++: A Single-Shot Oriented Scene Text Detector. IEEE Trans. on Image Proc., 27(8): 3676-3690, 2018. [code]
G. Xia, X. Bai, J. Ding, Z. Zhu, S. Belongie, J. Luo, M. Datcu, M. Pelillo, L. Zhang. DOTA: A Large-scale Dataset for Object Detection in Aerial Images. CVPR, 2018. [dataset address 1][dataset address 2]
X. Bai, M. Yang, et al.. Deep-Person: Learning Discriminative Deep Features for Person Re-Identification. arXiv:1711.10658 .[code]
S. Bai, et al. Ensemble Diffusion for Retrieval. ICCV, 2017.(Oral Presentation, AR = 2.1%) [code]
B. Shi, X. Bai, C. Yao. An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition. IEEE Trans. on PAMI , 39(11): 2298-2304, 2017. [code][music score recognition datasets][GitHub]
Associate Editor of ACTA AUTOMATICA SINICA (2018-), Pattern Recognition (2017-), Patten Recognition Letters (2016-), Frontier of Computer Science, Neurocomputing (2015-)
Guest Editor of SI: Scene Text Reading and its Applications, Pattern Recognition, 2019;
Guest Editor of Special Section on Deep Learning, Journal of Computer Science and Technology, 2017;
Guest Editor of SI: Multi-Instance Learning in Pattern Recognition and Vision, Pattern Recognition, 2017;
Guest Editor of SI: Deep Learning Applications in Computer Vision, Frontier of Computer Science, 2017;
Guest Editor of SI: Efficient Shape Representation, Matching, Ranking, and its Applications, Pattern Recognition Letters, 2016.
Program Chair of the 14th IAPR Int. Workshop on Document Anaylsis Systems (DAS'20), Wuhan.
General Chair of the 6th Vision and Learning Seminar (VALSE'16), Wuhan.
Program Chair of the 1st IEEE SPS Signal and Data Science Forum (SIDAS'16), Wuhan.
Organizer of the 1st Int. Workshop on Deep Learning for Document Analysis and Recognition (DLDAR'18), in conjunction with ICPR'18.
Organizer of the 1st Int. Workshop on Deep Learning for Pattern Recognition (DLPR'16), in conjunction with ICPR'16.
Organizer of the 2nd Int. Workshop on Deep Learning for Pattern Recognition (DLPR'18), in conjunction with ICPR'18.
Organizer of the 3rd Int.Workshop on Robust Reading (IWRR'18), in conjunction with ACCV'18.
Organizer of special session on Visual Semantic Learning form Big Surveillance Data (VISA), WCCI/IJCNN'16.
Organizer of ICPR 2018 Contest on Contest on Object Detection in Aerial Images (ODAI'18).
Organizer of ICDAR 2017 Competition on Reading Chinese Text in the wild (RCTW'17).
TPC of CVPR (08-18), ICCV (09-17), ECCV (10-18), NIPS (15-18), ICLR (18-19), ICML (18), AAAI (17-18), AISTAT (17-19), etc;
Area Chair or Senior TPC of CVPR 19, AAAI 19, IJCAI (17-19), ICDAR 19, ICIP 17, MVA 17, ACCV 18, ICPR 18.
Contest Chair of ICPR 18.
Reviewers of more than 40 int. journals including TPAMI, IJCV, TIP, TKDE, Pattern Recognition, TNNLS, TMM, TCYB, TIE, TIST, TVCG, IVC, CVIU, PRL, SPL, IJDAR, IJPRAI, etc.
Awards and Honors
Most Cited Chinese Researchers, 2014-2017; National Program for Support of Top-notch Young Professionals, 2016; Program for HUST Academic Frontier Youth Team, 2016; Hongshan District Outstanding Youth, 2014; Excellent Young Scientist Foundation of NSFC 2012; New century excellent talent of Ministry of Education, 2012; Hubei Province Outstanding Doctoral Thesis, 2011; Microsoft Fellowship 2007.
- Xinwei He Ph.D. student
- Xiaolong Liu Ph.D. student
- Mingkun Yang Ph.D. student
- Minghui Liao Ph.D. student (National Master Fellowship 2017)
- Zhiyong Dou Ph.D. student
- Zhen Zhu Master student
- Tengteng Huang Master student
Song Bai Ph.D. (National PhD Fellowship 2015, 2017) Dissertation: Context-based Afﬁnity Learning: Theory and Algorithms, defended on Feb. 3th, 2018. Now a postdoc at Oxford.
Baoguang Shi Ph.D. Dissertation: Deep Learning-Based Methods for Text Detection and Recognition in Natural Images, defended on May 24th, 2018. Now a researcher at Microsoft, Seattle.
- Bo Wang, (2010) Hubei Province Excellent Undergradute Graduation thesis, now at Stanford University.
- Tianyang Ma, (2010) Undergradute Student, HUST; Ph.D. Temple Univerisity; Now at Amazon Company.
- Wei Shen (2013), Ph.D. HUST, Tencent Fellowship, co-supervised with Prof. Hongyuan Wang, now a faculty member of Shanghai University
- Xinggang Wang (2014), Ph.D. HUST, Microsoft Fellowship 2012, National PhD Fellowship, co-supervised with Prof. Wenyu Liu, Ass. Prof. of HUST
- Yu Zhou (2014), Ph.D. HUST, co-supervised with Prof. Wenyu Liu, now a faculty member of Beijing University of Post and Telecommunications
- Junwei Wang, Ph.D (2012), HUST co-supervised with Prof. Wenyu Liu, now at 709 Institute, Wuhan
- Chunyuan Li (2011), Undergradute Student, HUST; Now at Duke University
- Chen Shen (2013) , Master HUST, now at Temple University
- Cong Rao (2014) , Master HUST, National Master Fellowship, now at Temple University.
- Weichao Qiu (2014) , Master HUST, now at University of California, Los Angeles (UCLA)
- Yan Wang,(2011) Hubei Province Excellent Undergradute Graduation thesi, now at NTU
- Quanming Yao (2013), Hubei Province Excellent Undergradute Graduation thesis, now at HKUST
- Yueming Wang (2014), Master HUST, National Master Fellowship, now at Taobao Company, Hangzhou
- Yi Xiong (2014), Master HUST, now at Tencent Company, Shenzhen
- Changtao Wang (2013), Master HUST, worked in Baidu Company, Beijing
- Chao Cai (2013), Master HUST, now at Zhongyuan Electronic Information Corporation, Wuhan
- Le He (2015) Master HUST; now at Toutiao
- Tingwu Hou (2015) Master HUST; now at Huawei
- Xu Min (2014), Undergradute student, HUST; Now at Tsinghua University
- Yajun Gao (2011), Undergadute student, HUST; Georgio Insititute of Technology, Master. Now at Amazon
- Zhuotun Zhu (2015) Undergradute student, HUST, Young Microsoft Fellowship; Now at University of California, Los Angeles (UCLA)
- Zheng Zhang (2016), Master, HUST, National Master Fellowship 2015, Now at Microsoft, Suzhou.
- Chengquan Zhang (2016), Master, HUST, Now at Baidu, Shenzhen.
- Hongyang Wang (2016) Undergradute student, HUST. Now at Carnegie Mellon University (CMU)
- Pan Chen (2016) Ph.D., HUST, co-supervised with Prof. Wenyu Liu. Now a faculty member in Chinese University of Geosciences
- Duoyou Zhou (2017) Master, HUST, now at Toutiao
- Xiong Duan (2017) Master, HUST, now at Tencent
- Zhichao Zhou (2018) Master, HUST, National Master Fellowship 2017, now at Baidu
- Pei Xu (2018) Master, HUST, now at Tencent
- Pengyuan Lv (2018) Master, HUST, National Master Fellowship 2017, now at Tencent AI
- Lingyan Cui (2018) Master, HUST, now at Meituan
Yang Yang (2018) Master HUST, Now at ...