Faculty

Zhao Hai Professor

MainPage: [Click here]

Office Telephone: +86-21-3420-4273

Office Address: SEIEE-3-521

Email: zhaohai@cs.sjtu.edu.cn

Lab: The China Ministry of Education (MOE) - Microsoft Key Laboratory of Intelligent Computing and System

  • Research
  • Education
  • Work Experience
  • Teaching Assignment
  • Publications
  • Project Fund
  • Awards
  • Academic Service
natural language processing, machine learning, data mining, bioinformatics and artificial intelligence
2000-2005, Doctor of Philosophy in Computer Software and Theory, Shanghai Jiao Tong University, Shanghai, China
1998-2000, Master of Philosophy in Control Theory and Control Engineering, Yanshan University, Qinhuangdao, Hebei, China
1995-1999, Bachelor of Engineering in Sensor and Instruments, Yanshan University, Qinhuangdao, Hebei, China 
2012, Visiting researcher at Machine translation lab of NICT, Japan
2011, Visiting scholar of Startrack program at Natural Language Computing Group, Microsoft Research Asia, Beijing, China
2006-2009, Research Fellow in Computational Linguistics, City University of Hong Kong
2005-2006, Visiting Work at Natural Language Computing Group, Microsoft Research Asia, Beijing, China
programming thinking and methodology
natural language processing

[2017]

  • Deng Cai, Hai Zhao*, Yang Xin, Yuzhu Wang, Zhongye Jia
    A Hybrid Model for Chinese Spelling Check,
    ACM Transactions on Asian Low-Resource Language Information Process, 2017

[2016]

  • Rui Wang, Hai Zhao*, Bao-Liang Lu, Masao Utiyama and Eiichro Sumita,
    Connecting Phrase based Statistical Machine Translation Adaptation,
    COLING-2016, pp.3135-3145, Osaka, Japan, December, 2016
    [PDF]

  • Lianhui Qin, Zhisong Zhang, and Hai Zhao*
    Implicit Discourse Relation Recognition with Context-aware Character-enhanced Embeddings,
    COLING-2016, pp.1914-1924, Osaka, Japan, December, 2016
    [PDF]

  • Lianhui Qin, Zhisong Zhang, and Hai Zhao*
    A stacking gated neural architecture for implicit discourse relation classification.
    Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.2263-2270, Austin, USA, November, 2016
    [PDF]

  • Chenxi Pang, Hai Zhao*, Zhongyi Li,
    I Can Guess What You Mean: A Monolingual Query Enhancement for Machine Translation,
    LNCS Vol.10035, CCL-2016, Yantai, China, Oct 15-16, 2016
    [PDF]

  • Zhongyi Li, Hai Zhao*, Chenxi Pang, Lili Wang, Huan Wang
    A Constituent Syntactic Parse Tree based Discourse Parser,
    CoNLL-2016 Shared Task, pp.60-64, Berlin, Germany, August 7-12, 2016

  • Lianhui Qin, Zhisong Zhang, Hai Zhao*
    Shallow Discourse Parsing using Convolutional Neural Network,
    CoNLL-2016 Shared Task, pp.70-77, Berlin, Germany, August 7-12, 2016

  • Deng Cai, Hai Zhao*
    Neural Word Segmentation Learning for Chinese ,
    ACL-2016, pp.409-420, Berlin, Germany, August 7-12, 2016
    [PDF]

  • Zhisong Zhang, Hai Zhao*, Lianhui Qin
    Probabilistic Graph-based Dependency Parsing with Convolutional Neural Network,
    ACL-2016, pp. 1382-1392, Berlin, Germany, August 7-12, 2016
    [PDF]

  • Rui Wang, Hai Zhao*, Sabine Ploux, Bao-Liang Lu, Masao Utiyama
    A Bilingual Graph-based Semantic Model for Statistical Machine Translation,
    IJCAI-2016, pp.2950-2956, New York, USA, July 9-15, 2016
    [PDF]

  • Peilu Wang, Yao Qian,Hai Zhao*, Frank K. Soong, Lei He, Ke Wu
    Learning Distributed Word Representations For Bidirectional LSTM Recurrent Neural Network,
    NAACL-2016, pp.527-533, San Diego, USA, June 12-15, 2016
    [PDF]

  • Rui Wang, Masao Utiyama, Isao Goto, Eiichiro Sumita, Hai Zhao*, Bao-Liang Lu,
    Converting Continuous-Space Language Models into N-gram Language Models with Efficient Bilingual Pruning for Statistical Machine Translation,
    ACM Transactions on Asian Low-Resource Language Information Process, Vol. 15(3), Article 11, pp.1-26, January, 2016

  • Jingyi Zhang, Masao Utiyama, Eiichro Sumita, Hai Zhao*, Graham Neubig, Satoshi Nakamura,
    Learning local word reorderings for hierarchical phrase-based statistical machine translation,
    Machine Translation, Spinger, 2016

[2015]

  • Peilu Wang, Yao Qian, Frank K. Soong, Lei He, Hai Zhao
    Word Embedding for Recurrent Neural Betwork based TTS Synthesis,
    Proc. of Acoustics, Speech and Signal Processing (ICASSP), pp. 4879-4883,
    Brisbane, Australia, 2015

  • Ge Yan, Zhao Hai, Qin Yulin et al.
    Mining National and Regional Images from Newspaper Reports (in Chinese)
    Academic Monthly, Vol.47(7): 163-170, July, 2015

  • Changge Chen, Hai Zhao*, Yang Yang
    Deceptive Opinion Spam Detection using Deep Level Linguistic Features,
    The 4th CCF Conference on Natural Language Processing & Chinese Computing(NLPCC 2015),
    October 9-13, 2015, Nanchang, China

  • Shuo Zang, Hai Zhao*, Chunyang Wu, Rui Wang,
    A Novel Word Reordering Method for Statistical Machine Translation,
    The 2015 11th International Conference on Natural Computation (ICNC’15) and the 2015 12th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD’15),
    August 15-17, 2015, Zhangjiajie, China

  • Changge Chen, Peilu Wang, Hai Zhao*,
    Shallow Discourse Parsing Using Constituent Parsing Tree,
    CoNLL 2015
    July 30, 2015, Beijing, China

  • Jingyi Zhang, Masao Utiyama, Eiichro Sumita, Hai Zhao*,
    LearningWord Reorderings for Hierarchical Phrase-based Statistical Machine Translation,
    ACL-IJCNLP 2015
    July 26-31, 2015, Beijing, China

  • Rui Wang, Hai Zhao*, Bao-Liang Lu, Masao Utiyama and Eiichiro Sumita,
    Bilingual Continuous-Space Language Model Growing for Statistical Machine Translation,
    IEEE/ACM Transactions on Audio, Speech, and Languange Processing, Vol.23(7): 1209-1220, 2015

[2014]

  • Rui Wang, Hai Zhao, Bao-Liang Lu, Masao Utiyama and Eiichro Sumita
    Neural Network Based Bilingual Language Model Growing for Statistical Machine Translation
    EMNLP 2014: 189-195, Doha, Qatar, October, 2014

  • Jingyi Zhang, Masao Utiyama and Eiichro Sumita, Hai Zhao
    Learning Hierarchical Translation Spans
    EMNLP 2014: 183-188, Doha, Qatar, October, 2014

  • Yang Xin, Hai Zhao, Yuzhu Wang and Zhongye Jia
    An Improved Graph Model for Chinese Spell Checking
    SIGHAN-2014, Wuhan, China, October, 2014

  • Xiaolin Wang, Hai Zhao, Bao-Liang Lu
    A Meta-Top-down Method for Large-scale Hierarchical Classification
    IEEE Transactions on Knowledge and Data Engineering, Vol.26(3):500-513,March 2014

  • Xiaolin Wang, Yangyang Chen, Hai Zhao, Bao-Liang Lu
    Parallelized Extreme Learning Machine Ensemble Based on Min-Max Modular Network
    Neurocomputing, Vol.128:31-41, March 2014

  • Jia, Zhongye, Hai Zhao
    A Joint Graph Model for Pinyin-to-Chinese Conversion with Typo Correction
    In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)(ACL 2014), pages 1512--1523, Baltimore, Maryland, June

  • Wang, Peilu and Jia, Zhongye and Hai Zhao
    Grammatical Error Detection and Correction using a Single Maximum Entropy Model
    Proceedings of the Eighteenth Conference on Computational Natural Language Learning (CoNLL-2014), pages 74--82, Baltimore, Maryland, June


[2013]

Xiaolin Wang, Hai Zhao, Bao-Liang Lu, A Meta-Top-down Method for Large-scale Hierarchical Classification, IEEE Transactions on Knowledge and Data Engineering, 2013


Xiaolin Wang, Yangyang Chen, Hai Zhao, Bao-Liang Lu, Parallelized Extreme Learning Machine Ensemble Based on Min-Max Modular Network, Neurocomputing, 2013


Hai Zhao, Xiaotian Zhang, and Chunyu Kit, Integrative Semantic Dependency Parsing via Efficient Large-scale Feature Selection, Journal of Artificial Intelligence Research, Volume 46:203-233, 2013


Hai Zhao, Masao Utiyama, Eiichro Sumita, and Bao-Liang Lu, An Empirical Study on Word Segmentation for Chinese Machine Translation, A. Gelbukh (Ed.): CICLing 2013, Part II, LNCS 7817, pp. 248–263, 2013


[2012]
Shaohua Yang, Hai Zhao, Xiaolin Wang and Bao-liang Lu, Spell Checking for Chinese, Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’12), pages 730-736, Istanbul, Turkey, May, 2012


Chunyang Wu and Hai Zhao, Regression with Phrase Indicators for Estimating MT Quality, Proceedings of the 7th Workshop on Statistical Machine Translation of NAACL-2012, pages 152–156,Montreal, Quebec, Canada, June 7 - 8, 2012


Heming Shou and Hai Zhao, Hybrid Rule-based Algorithm for Coreference Resolution, Proceedings of the Joint Conference on EMNLP and CoNLL, pages 118-121, Jeju Island, Korea, July, 2012


Xiaotian Zhang, Chunyang Wu and Hai Zhao, Chinese Coreference Resolution via Ordered Filtering, Proceedings of the Joint Conference on EMNLP and CoNLL, pages 95-99, Jeju Island, Korea, July, 2012


Shaohua Yang, Hai Zhao and Bao-Liang Lu. A Machine Translation Approach for Chinese Whole-Sentence Pinyin-to-Character Conversion, PACLIC-26, Bali, Indonesia, November, 2012


Xiaotian Zhang, Yao Qian, Hai Zhao, Frank Soong, Break index labeling of Mandarin text via syntactic-to-prosodic tree mapping, The 8th International Symposium on Chinese Spoken Language Processing (ISCSLP-2012), Hong Kong, December 5-8, 2012


Xiaotian Zhang, Hai Zhao and Cong Hui, A Machine Learning Approach to Convert CCGbank to Penn Treebank, the 24th International Conference on Computational Linguistics (COLING 2012), pp.535-542, Mumbai, India, 8-15 December 2012


Qiongkai Xu and Hai Zhao, Using Deep Linguistic Features for Finding Deceptive Opinion Spam, the 24th International Conference on Computational Linguistics (COLING 2012), Mumbai, India, 8-15 December 2012


Xuezhe Ma and Hai Zhao, Fourth-Order Dependency Parsing, the 24th International Conference on Computational Linguistics (COLING 2012), Mumbai, India, 8-15 December 2012




[2011]
    Xiaolin Wang, Hai Zhao and Bao-Liang Lu, Enhance Top-down method with Meta-Classification for Very Large-scale Hierarchical Classification, IJCNLP-2011, Chiang Mai, Thailand, November 9-11, 2011


    Xiaotian Zhang and Hai Zhao, Unsupervised Chinese Phrase Parsing Based on Tree Pattern Mining, The 11th Confernece of China Computational Linguistics, Luoyang, China, August 20-22, 2011


    Hai Zhao and Chunyu Kit, Integrating unsupervised and supervised word segmentation: The role of goodness measures, Information Sciences, Vol.181(1): 163-183, 2011, Elsevier


[2010]
    ZHAO Hai, Natural Language Processing as A Branch of Artificial Intelligence : The Stagnant Tech, The Seventh Young Scholar Symposium on Natural Language Processing, Shenyang, China, September 18-19, 2010


    Xuezhe Ma, Xiaotian Zhang, Hai Zhao, Bao-Liang Lu, Dependency Parser for Chinese Constituent Parsing,  CIPS-SIGHAN-2010, August, 2010, Beijing, China


    Yan Song, Chunyu Kit and Hai Zhao, Reranking with Multiple Features for Better Transliteration, NEWS-2010, pp.62-65, July, 2010, Uppsala, Sweden


    Cong Hui, Hai Zhao, Yan Song, Bao-Liang Lu, An Empirical Study on Development Set Selection Strategy for Machine Translation Learning, WMT-2010, pp.67-71, July, 2010, Uppsala, Sweden


    Shaodian Zhang, Hai Zhao, Guodong Zhou and Bao-liang Lu, Hedge Detection and Scope Finding by Sequence Labeling with Procedural Feature Selection, CoNLL-2010, pp.92-99, July, 2010, Uppsala, Sweden


    Jian Zhang, Hai Zhao, and Bao-Liang Lu,  A Comparative Study on Two Large-Scale Hierarchical Text Categorization Tasks’ Solutions, IWWIP-2010, July, 2010, Qingdao, China


    Hai Zhao, Chang-Ning Huang, Mu Li, Bao-Liang Lu, A Unified Character-Based Tagging Framework for Chinese Word Segmentation, ACM Trans. Asian Lang. Inf. Process. 9(2): 2010


    Gang Jin, Qi Kong, Jian Zhang, Xiaolin Wang, Cong Hui, Hai Zhao, and Bao-Liang Lu, Multiple Strategies for NTCIR-08 Patent Mining at BCMI, NTCIR-8, June, 2010, Tokyo, Japan


    Minzhang Huang, Hai Zhao, Bao-Liang Lu, Pruning Training Samples Using a Supervised Clustering Algorithm,     ISNN (2) 2010: 250-257, June, 2010, Shanghai, China


    Hai Zhao, Yan Song, Chunyu Kit, How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method,  LREC 2010, May, 2010, Malta




[2009]
    SONG Yan, CAI Dong-Feng, ZHANG Gui-Ping, ZHAO Hai,An Approach to Chinese Word Segmentation based on Character-Word Joint Decoding, Journal of Software, Vol.20, No.9, pp.2366-2375, 2009


    Hai Zhao, Wenliang Chen, Chunyu Kit, Semantic Dependency Parsing of NomBank and PropBank: An Efficient Integrated Approach via a Large-scale Feature Selection, EMNLP 2009: conference on Empirical Methods in Natural Language Processing, pp.30-30, Singapore, August 6-7, 2009


    Junhui Li, Guodong Zhou, Hai Zhao, Qiaoming Zhu, Peide Qian, Improving Nominal SRL in Chinese Language with Verbal SRL Information and Automatic Predicate Recognition, EMNLP 2009: conference on Empirical Methods in Natural Language Processing, pp.1280-1288, Singapore, August 6-7, 2009


    Hai Zhao, Yan Song, Chunyu Kit, and Guodong Zhou,Cross Language Dependency Parsing using a Bilingual Lexicon,  Joint conference of the 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), pp.55-63, Singapore, August 2-5, 2009


    Hai Zhao, Chunyu Kit, and Yan Song, Character Dependency Tree based Lexical and Syntactic All-in-one Parsing for Chinese, The 10th Chinese National Conference on Computational Linguistics (CNCCL-2009), pp.82-88, Yantai, China, July 24-26, 2009


    Hai Zhao, Wenliang Chen, Jun’ichi Kazama, Kiyotaka Uchimoto, and Kentaro Torisawa, Multilingual Dependency Learning: Exploiting Rich Features for Tagging Syntactic and Semantic Dependencies, Thirteenth Conference on Computational Natural Language Learning, (CoNLL-09), pp. 61-66, Boulder, CO, USA, June 4-5, 2009


    Hai Zhao, Wenliang Chen, Chunyu Kit, and Guodong Zhou, Multilingual Dependency Learning: A Huge Feature Engineering Method to Semantic Dependency Parsing, Thirteenth Conference on Computational Natural Language Learning, (CoNLL-09), pp. 55-60, Boulder, CO, USA, June 4-5, 2009


    Hai Zhao, Character-Level Dependencies in Chinese: Usefulness and Learning, The 12th Conference of the European Chapter of the Association for Computational Linguistics, (EACL-09), pp.879-887, Athens, Greece, March 30 - April 3, 2009


    Hai Zhao and Chunyu Kit, A Simple and Efficient Model Pruning Method for Conditional Random Fields, The 22nd International Conference on the Computer Processing of Oriental Languages (ICCPOL 2009), LNCS, Vol.5459, pp.149-159, Hong Kong, March 26-27, 2009
   
[2008]
    Hai Zhao and Chunyu Kit, Parsing Syntactic and Semantic Dependencies with Two Single-Stage Maximum Entropy Models, Twelfth Conference on Computational Natural Language Learning, (CoNLL-2008), pp.203-207, Manchester, UK, August 16-17, 2008


    Hai Zhao and Chunyu Kit, Scaling Conditional Random Fields by One-Against-the-Other Decomposition, Journal of Computer Science and Technology, Vol. 23(4): 612-619, July, 2008


    Hai Zhao and Chunyu Kit, Exploiting Unlabeled Text with Different Unsupervised Segmentation Criteria for Chinese Word Segmentation, The 9th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2008), Haifa, Israel, February 17-23, 2008, Also in Research in Computing Science, Vol. 33: 93-104, 2008


    Hai Zhao and Chunyu Kit, Unsupervised Segmentation Helps Supervised Learning of Character Tagging for Word Segmentation and Named Entity Recognition, The Sixth SIGHAN Workshop on Chinese Language Processing (SIGHAN-6), pp.106-111, Hyderabad, India, January 11-12, 2008


    Hai Zhao and Chunyu Kit, An Empirical Comparison of Goodness Measures for Unsupervised Chinese Word Segmentation with a Unified Framework, The Third International Joint Conference on Natural Language Processing (IJCNLP-2008), Vol. 1: 9-16, Hyderabad, India, January 8-10, 2008

NSFC No. 61170114: Internet Pseudo-information Recognition and Evaluation based on Linguistic Characteristics Analysis
NSFC No. 60903119: Character-level Dependency Tree based Annotation and Learning for Chinese Fine Structure

ACL-2018 senior area chair on phonology, morphology and word segmentation

ACL-2017 parsing area co-chair

ACL-2016 publication co-chair

PACLIC 29 PC chair

Contact webmaster@cs.sjtu.edu.cn

Copyright @ 2013 SJTU Computer Science & Engineering All Rights Reserved