Jimmy Huang

School of Information Technology

Professor
School Director

Office: 3048 Victor Phillip Dahdaleh Building (DB)
(Formerly known as Technology Enhanced
Learning Building)
Phone: 416-736-2100 Ext: 30149
Emailjhuang@yorku.ca
Primary websitewww.yorku.ca/jhuang
Secondary websitewww.yorku.ca/jhuang/irlab/

Jimmy Huang is now a Professor at the School of Information Technology and the founding director of York’s Information Retrieval & Knowledge Management Research Lab at the York University, where he is also cross-appointed as a graduate faculty member in the programs of Computer Science and Engineering, Mathematics and Statistics, Health Informatics, Information Systems & Technology and Emergency Management. Dr. Huang has extensive industry working experience where he was awarded a CIO Achievement Award in Manulife before joining York University. Previously, he was a Post Doctoral Fellow working with Professor Nick Cercone at the School of Computer Science, University of Waterloo. He did his PhD in Information Science at City University in London, England, with Professor Stephen Robertson.

More...

Since joining York University in July 2003, he has published 81 refereed papers in top ranking journals, book chapters and international conference proceedings (such as ACM SIGIR, ACM CIKM, COLING and IEEE ICDM). In total, he has published more than 100 refereed papers. In the past three years, Dr. Huang has been Sole PIs of 19 external grants and Co- PIs of 6 external grants. He has received $1,576,796 external grants since 2007, primarily funded by Ontario Ministry of Research & Innovation, Natural Sciences and Engineering Research Council of Canada, NSERC Research Tools and Instruments Grant, Tri-Agency (SSHRC, CIHR and NSERC) Syntheses Grant, AlphaGlobal iT, CRD Grant, IBM, CIHR and Institute for Clinical Evaluative Sciences (ICES), SSHRC, Petro Canada, Mathematics of Information Technology and Complex Systems (MITACS), CGA-CanadaCAAA, Shared Hierarchical Academic Research Computing Network (SHARCNet), Information Retrieval Facility (IRF) and Ontario Research Fund - Research Excellence (ORF/RE). Since July 2003, he has supervised 9 Masters students, 7 PhD and 3 Postdoctoral Fellows. Currently, 3 Postdoctoral Fellows, 10 Ph.D. and Master students are under his supervision. He has also served on the supervisory committees of 16 Ph.D. and M.Sc. students. Jimmy Huang received the Dean’s Award for Outstanding Research in 2006, an Early Researcher

Award, formerly the Premier’s Research Excellence Awards, in 2007 for a healthcare project entitled “Analyzing and Searching Medical Data for Cost Effective Health Care”, the Petro Canada Young Innovators Award in 2008, the SHARCNET Research Fellowship Award in 2009, the MITACS Networking Award in 2010 and both the Best Paper Award and the Best Student Paper Award with his PhD student at the 32nd European Conference on Information Retrieval (ECIR 2010) in UK. He is the General Conference Chair for the 19th International ACM CIKM Conference in 2010. ACM CIKM conference is one of the most prestigious and competitive conferences in Information Retrieval, Database and Knowledge Management. He is also the General Program Chair for IEEE/ACM International Joint Conferences on Web Intelligence & Intelligent Agent Technology in 2010. He also serves as steering committee members of review and adjundication panels for Natural Sciences and Engineering Research Council of Canada (NSERC), MRI Early Researcher Awards (ERA) Program and National Science Foundation (NSF) of USA.

Area of Specialization

Information Technology

Degrees

PhD Information Science, City University in London, England
M.Eng. Computer Organization and Architecture,
B.Eng. Computer Engineering,


Research Interests

Information Technologies , Digital Library and Bioinformatics, Computational Linguistics, Data/Web/Data Mining

Selected Publications

Fuchun Peng and Xiangji Huang. "Machine Learning Approaches to Automatic Text Classification for Asian Languages", Journal of Documentation. Emerald Group Publishing Limited, U.K. ISSN: 0022-0418. Vol.63, No.3, 2007. pp.378-397.

Xiangji Huang, Qingsong Yao and Aijun An. "Applying Language Modeling to Session Identification from Database Trace Logs" (32 pages), Knowledge and Information Systems: An International Journal (KAIS). Springer-Verlag Publisher. ISSN (Printed): 0219-1377 and ISSN (Online): 0219-3116. Vol.10, No.4, 2006. pp.473-504.

Huang, X. “Comparison of Interestingness Measures for Web Usage Mining: An Empirical Study”, International Journal of Information Technology & Decision Making, 6(1):15-41, 2007. World Scientific Publishing Co., ISSN: 0219-6220. ISI Journal Impact Factor: 1.312

Bill Andreopoulos, Aijun An, Xiangji Huang and Xiaogang Wang. "Finding Molecular Complexes through Multiple Layer Clustering of Protein Interaction Networks" (22 pages), accepted by International Journal of Bioinformatics Research and Applications (IJBRA). Inderscience Publisher. ISSN (Printed): 1744-5485 and ISSN (Online): 1744-5493.

Aijun An, Shakil Khan and Xiangji Huang. "Hierarchical Grouping of Association Rules and Its Application to a Real-World Domain" (30 pages), accepted by International Journal of Systems Science (IJSS), Special Issue on Advances in Data Mining and Its Applications. Taylor & Francis Group Publisher. ISSN (Printed): 0020-7721 and ISSN (Online): 1464-5319.

Current Research Projects

Context-aware information retrieval and semantic text analysis for very large unstructured data


Project Type: Funded
Funders: 
Research Tools and Instruments (NSERC)

Personalizing and Searching Medical Data for Cost Effective Health Care


Project Type: Funded
Funders: 
Discovery Grants (NSERC)

Enter the Proposal Title in Non-Technical Language


Project Type: Funded
Funders: 
Ontario Early Researcher Award


Project Type: Funded
Funders: 
Ministry of Research and Innovation


Project Type: Funded
Funders: 
Minor Research Grant (York Internal Grant)


Project Type: Funded
Funders: 
Junior Faculty Fund (York Internal Grant)


Project Type: Funded
Funders: 
Minor Research Grant (York Internal Grant)


Project Type: Funded
Funders: 
Minor Research Grant (York Internal Grant)


Project Type: Funded
Funders: 
ATK Fellowship


Project Type: Funded
Funders: 
ATK Fellowship

Selected Publications

Fuchun Peng and Xiangji Huang. "Machine Learning Approaches to Automatic Text Classification for Asian Languages", Journal of Documentation. Emerald Group Publishing Limited, U.K. ISSN: 0022-0418. Vol.63, No.3, 2007. pp.378-397.

Xiangji Huang, Qingsong Yao and Aijun An. "Applying Language Modeling to Session Identification from Database Trace Logs" (32 pages), Knowledge and Information Systems: An International Journal (KAIS). Springer-Verlag Publisher. ISSN (Printed): 0219-1377 and ISSN (Online): 0219-3116. Vol.10, No.4, 2006. pp.473-504.

Huang, X. “Comparison of Interestingness Measures for Web Usage Mining: An Empirical Study”, International Journal of Information Technology & Decision Making, 6(1):15-41, 2007. World Scientific Publishing Co., ISSN: 0219-6220. ISI Journal Impact Factor: 1.312

Bill Andreopoulos, Aijun An, Xiangji Huang and Xiaogang Wang. "Finding Molecular Complexes through Multiple Layer Clustering of Protein Interaction Networks" (22 pages), accepted by International Journal of Bioinformatics Research and Applications (IJBRA). Inderscience Publisher. ISSN (Printed): 1744-5485 and ISSN (Online): 1744-5493.

Aijun An, Shakil Khan and Xiangji Huang. "Hierarchical Grouping of Association Rules and Its Application to a Real-World Domain" (30 pages), accepted by International Journal of Systems Science (IJSS), Special Issue on Advances in Data Mining and Its Applications. Taylor & Francis Group Publisher. ISSN (Printed): 0020-7721 and ISSN (Online): 1464-5319.

All Publications

Book Chapters

Andreopoulos, B., Huang, X., An, A., Labudde, D. and Hu, Q. “Promoting Diversity in Top Hits for Biomedical Passage Retrieval” (22 pages). In Zbigniew W. Ras (Editor), Advances in Data Management, Springer-Verlag Publisher. May 2009.

Liu, Y., Yu, X., Huang, X. and An, A. “Blog Data Mining: The Predictive Power of Sentiments” (22 pages). In Longbing Cao, Philip S. Yu and Chengqi Zhang (Editors), Data Mining for Business Applications, Springer-Verlag Publisher. November 2008. ISBN: 978-0-38779-419-8.

Huang, X. “Clustering Analysis and Algorithms”. In Vijayan Sugumaran (Ed.), Intelligent Information Technologies: Concepts, Methodologies, Tools, and Applications (Reprinted), Information Science Publishing (an imprint of Idea Group Inc.), August 2008. ISBN: 978-1-59904-941-0.

Huang, X., An, A. and Liu, Y. “Web Usage Mining with Web Logs” (15 pages). In John Wang (Ed.), Encyclopedia of Data Warehousing and Mining, Second Edition, Information Science Publishing (an imprint of Idea Group Inc.), August 2008. ISBN: 978-1-60566-010-3.

Xiangji Huang, "Clustering Analysis and Algorithms". In John Wang (Ed.), Encyclopedia of Data Warehousing and Mining, Information Science Publishing (an imprint of Idea Group Inc.), ISBN: 2-59140-557-2. 2005.

Journal Articles

Yin, X and Huang, X. “Re-Ranking with Context for High-Performance Biomedical Information Retrieval” (14 pages), accepted by International Journal of Data Mining and Bioinformatics. March 2010 (Xiaoshi is my PhD student). ISI Journal Impact Factor: 0.933

He, B and Huang, X. “Mining Authoritative and Topical Evidence for Improving Opinion Retrieval from the Blogosphere” (15 pages), conditionally accepted and under the 2nd round review by VLDB Journal with minor revision. Springer-Verlag Publisher, February 2010 (Ben He is my postdoctoral fellow).

Yu, X, Liu, Y., Huang, X. and An, A. “Mining Online Reviews for Predicting Sales Performance: A Case Study in the Movie Domain” (14 pages), accepted undergo a minor revision by IEEE Transactions on Knowledge and Data Engineering (TKDE). IEEE Publisher, May 16, 2010 (TKDE is a top tier journal in data mining. Xiaoshi is my PhD student and the paper published is a part of her PhD thesis). ISI Journal Impact Factor: 2.285

Hu, Q. and Huang, X. “Passage Extraction and Result Combination for Genomics Information Retrieval” (23 pages), Journal of Intelligent Information Systems (JIIS). 34(3): 249-274, 2010. Springer-Verlag Publisher, June 2010 (JIIS is a leading journal on Intelligent Information Systems and Q.Hu is my graduate student). ISSN (Printed): 0925-9902 and ISSN (Online): 1573-7675. ISI Journal Impact Factor: 0.980

Zhu, J., Huang, X., Song, D. and Ruger, S. “Integrating Multiple Document Features in Language Models for Expert Finding”, Knowledge and Information Systems: An International Journal (KAIS). 23(1): 29-54, 2010. Springer-Verlag Publisher, January 2010 (KAIS is one of the most esteemed journals in knowledge systems, data mining and advanced information systems. My contribution to this paper is 35%.). ISSN (Printed): 0219-1377 and ISSN (Online): 0219-3116. ISI Journal Impact Factor: 2.211

Yin, X and Huang, X. “Mining and Modeling Linkage Information from Citation Context for Improving Biomedical Literature Retrieval” (32 pages), accepted by Information Processing & Management: An International Journal (IPM). ELSEVIER Publisher, March 26, 2010 (IPM is a top tier journal in information retrieval, information systems and information management. My contribution to this paper is at least 50%. Xiaoshi is my PhD student). ISSN: 0306-4573. ISI Journal Impact Factor: 1.783

Lupu, M., Huang, Jimmy X., Zhu, J. and Tait, J. “TREC-CHEM: Large Scale Chemical Information Retrieval Evaluation at TREC”. SIGIR Forum 42(1): 63-77. December 2009.

Fuchun Peng and Xiangji Huang. "Machine Learning Approaches to Automatic Text Classification for Asian Languages", Journal of Documentation. Emerald Group Publishing Limited, U.K. ISSN: 0022-0418. Vol.63, No.3, 2007. pp.378-397.

Huang, X. “Comparison of Interestingness Measures for Web Usage Mining: An Empirical Study”, International Journal of Information Technology & Decision Making, 6(1):15-41, 2007. World Scientific Publishing Co., ISSN: 0219-6220. ISI Journal Impact Factor: 1.312

Yang Liu, Xiangji Huang and Aijun An. "Personalized Recommendation with Adaptive Mixture of Markov Models," Journal of the American Society for Information Science and Technology (JASIST). Wiley InterScience Publisher. ISSN (Printed): 1532-2882 and ISSN (Online): 1532-2890. Vol.58, No.12, 2007. pp.1851-1870.

Liu, Y., Huang, X. and An, A. “Adaptive Personalized Recommendation with a Mixture of Markov Models”, Journal of the American Society for Information Science & Technology (JASIST), 58(12):1851-1870, 2007. John Wiley & Sons. (JASIST is the most prestigious journal in information science and technology. Liu is my PhD student). ISI Journal Impact Factor: 2.300

Xiangji Huang, Qingsong Yao and Aijun An. "Applying Language Modeling to Session Identification from Database Trace Logs" (32 pages), Knowledge and Information Systems: An International Journal (KAIS). Springer-Verlag Publisher. ISSN (Printed): 0219-1377 and ISSN (Online): 0219-3116. Vol.10, No.4, 2006. pp.473-504.

Bill Andreopoulos, Aijun An, Xiangji Huang and Xiaogang Wang. "Finding Molecular Complexes through Multiple Layer Clustering of Protein Interaction Networks" (22 pages), accepted by International Journal of Bioinformatics Research and Applications (IJBRA). Inderscience Publisher. ISSN (Printed): 1744-5485 and ISSN (Online): 1744-5493.

Aijun An, Shakil Khan and Xiangji Huang. "Hierarchical Grouping of Association Rules and Its Application to a Real-World Domain" (30 pages), accepted by International Journal of Systems Science (IJSS), Special Issue on Advances in Data Mining and Its Applications. Taylor & Francis Group Publisher. ISSN (Printed): 0020-7721 and ISSN (Online): 1464-5319.

Xiangji Huang, Fuchun Peng, Aijun An and Dale Schuurmans. "Dynamic Web Log Session Identification with Statistical Language Models", Journal of the American Society for Information Science and Technology (JASIST). Wiley InterScience Publisher. ISSN (Printed): 1532-2882 and ISSN (Online): 1532-2890. Vol.55, No.14, 2004. pp.1290-1303.

Aijun, An, Y. Huang, Xiangji Huang and Nick Cercone. "Feature Selection with Rough Sets for Web Page Classification", LNCS Transactions on Rough Sets, Springer Verlag, Vol.2, 2004. pp.1-13

Xiangji Huang, Fuchun Peng, Dale Schuurmans, Nick Cercone and Stephen Robertson. "Applying Machine Learning to Text Segmentation for Information Retrieval", Information Retrieval, Vol 6, Issue 4, pp.333-362, 2003.

Xiangji Huang, Stephen E. Robertson, Nick Cercone and Aijun An "Probability-based Chinese Text Processing and Retrieval", Computational Intelligence: An International Journal (CI), Vol.16, No.4, 2000. pp.552-569

Xiangji Huang and Stephen E. Robertson. "Application of Probabilistic Models to Chinese Text Retrieval", Journal of Documentation (JDoc), Vol.53, No.1, 1997. pp.74-79.

Conference Papers

Huang, X., An, A. and Hu, Q. “Medical Search and Classification Tools for Recommendation”, Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Geneva, Switzerland, July 19-23, 2010 (32% acceptance rate: 99 papers accepted out of 310 submissions. ACM SIGIR is the best conference in the field of Information Retrieval. Q. Hu is my PhD student).

Yin, X. and Huang, X. “A Survival Modeling Approach to Biomedical Search Result Diversification UsingWikipedia”, Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Geneva, Switzerland, July 19-23, 2010 (32% acceptance rate: 99 papers accepted out of 310 submissions. ACM SIGIR is the best conference in the field of Information Retrieval. X. Yin is my PhD student).

Yu, X., Liu, Y., Huang, X. and An, A. “S-PLSA+: Adaptive Sentiment Analysis with Application to Sales”, Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Geneva, Switzerland, July 19-23, 2010 (32% acceptance rate: 99 papers accepted out of 310 submissions. ACM SIGIR is the best conference in the field of Information Retrieval. Dr. Y. Liu is my PDF).

Yu, X., Liu, Y., Huang, X. and An, A. “A Quality-Aware Model for Sales Prediction Using Reviews”, Proceedings of the 19th International World Wide Web Conference (WWW’10), Raleigh, North Carolina, USA, April 26-30, 2010. (Dr. Y. Liu is my postdoctoral fellow).

Hu, Q. and Huang, X. “Genomics Information Retrieval Using a Bayesian Model for Learning and Re-ranking”, Proceedings of the 2010 ACM International Conference on Bioinformatics and Computational Biology (BCB), New York, USA, August 2-4, 2010. (Q. Hu is my PhD student starting from September 2010).

An, X., Huang, X. and Cercone, N. “The Optimal IR: How Far Away?” (full paper, 12 pages), Proceedings of the 11th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing’10), Iasi, Romania, March 21- 27, 2010. Lecture Notes in Computer Science (LNCS) 6008, 602-613. Springer-Verlag Publisher, ISBN 978-3-642-12115-9. (22.6% acceptance rate: 57 full papers accepted out of 252 submissions. Dr. X. An is my postdoctoral fellow starting from April 2010).

Yin, X. and Huang, X. “Promoting Ranking Diversity for Biomedical Information Retrieval Using Wikipedia” (full paper, 12 pages), Proceedings of the 32nd European Conference on Information Retrieval (ECIR2010), Milton Keynes, UK, 28-31 March 2010. Lecture Notes in Computer Science (LNCS) 5993, 107-118. Springer-Verlag Publisher, ISBN 978-3-642-12274-3. (22% acceptance rate. X. Yin is my PhD student. Jimmy Huang and Xiaoshi Yin received the best paper award at ECIR2010 for this paper1. As Xiaoshi is a student, this paper was also awarded the best student paper at ECIR this year.)

Ye, Z. and Huang, X. “Exploring Social Annotation Tags to Enhance Information Retrieval Performance” (full paper, 12 pages), Proceedings of the 2010 International Conference on Active Media Technology (AMT’10), Toronto, Canada, August 28-30, 2010. Lecture Notes in Computer Science (LNCS) 5993, 107-118. Springer-Verlag Publisher, ISBN 978-3-642-12274-3. (Z. Ye is my PhD student).

Hu, Vivian Q., Ye, Z. and Huang, X. “Enhancing Content-Based Image Retrieval Using Machine Learning Techniques” (full paper, 12 pages), Proceedings of the 2010 International Conference on Active Media Technology (AMT’10), Toronto, Canada, August 28-30, 2010. Lecture Notes in Computer Science (LNCS) 5993, 107-118. Springer-Verlag Publisher, ISBN 978-3-642-12274-3. (Vivian Hu and Z. Ye are my PhD student).

Hu, Vivian Q., Huang, X., William Melek and C. Joseph Kurian. “A Time Series Based Method for Analyzing and Predicting Personalized Medical Data” (full paper, 12 pages), Proceedings of the 2010 International Conference on Brain Informatics (BI’10), Toronto, Canada, August 28-30, 2010. Lecture Notes in Computer Science (LNCS) 5993, 107-118. Springer-Verlag Publisher, ISBN 978-3-642-12274-3. (Vivian Hu is my PhD student)

Yin, X., Huang, X. and Li, Z. “Towards A Better Ranking for Biomedical Information Retrieval Using Context” (full paper), Proceedings of the 2009 IEEE International Conference on Bioinformatics & Biomedicine , Washington D.C., USA, November 1- 4, 2009 (18.9% acceptance rate: 44 full papers accepted out of 233 submissions. X. Yin is my PhD student).

Yin, X., Huang, X. and Li, Z. “BioCLink: A Probabilistic Approach for Improving Genomics Search with Citation Links” (short paper), Proceedings of the 2009 IEEE International Conference on Bioinformatics & Biomedicine , Washington D.C., USA, November 1-4, 2009 (15.9% acceptance rate: 37 short papers accepted out of 233 submissions. X. Yin is my PhD student).

Rohian, H., An, A., Zhao, J. and Huang, X. “Discovering Temporal Associations among Significant Changes in Gene Expression” (short paper), Proceedings of the 2009 IEEE International Conference on Bioinformatics & Biomedicine , Washington D.C., USA, November 1-4, 2009 (15.9% acceptance rate: 37 short papers accepted out of 233 submissions. Rohian and Zhao are my Master and PhD students respectively. This paper was also awarded the best student poster paper at BIBM this year.).

Ye, Z., Huang, X. and Lin, H. “Towards A Better Performance for Medical Image Retrieval Using An Integrated Approach”, Proceedings of the 10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009, Corfu, Greece, September 30 - October 2, 2009 (Z. Ye is my PhD student).

Huang, X. and Hu, Q. “A Bayesian Learning Approach to Promoting Diversity in Ranking for Biomedical Information Retrieval” (full paper), Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Boston, USA, July 19-23, 2009 (15.8% acceptance rate: 78 regular papers accepted out of 494 submissions. ACM SIGIR is the best conference in the field of Information Retrieval. Q. Hu is my PhD student).

Ye, Z., Huang, X. and Lin, H. “A Graph-based Approach to Mining Multilingual Word Associations from Wikipedia” (poster paper), Proceedings of the 32th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Boston, USA, July 19-23, 2009 (33.6% acceptance rate: 86 poster papers accepted out of 256 submissions. ACM SIGIR is the best conference in the field of Information Retrieval. Z. Ye is my PhD student).

An, A., Wan, Q., Zhao J. and Huang, X. “Diverging Patterns: Discovering Significant Frequency Change Dissimilarities in Large Databases”, Proceedings of the 18th International ACM Conference on Information and Knowledge Management (CIKM’09), Hong Kong, November 2-6, 2009. (20.2% acceptance rate: 171 short papers accepted out of 847 submissions)

Yin, X., Huang, X., Hu, Q and Li, Z. “Boosting Biomedical Information Retrieval Performance through Citation Graph: An Empirical Study”, Proceedings of the 13th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’09), Bangkok, Thailand, April 27-30, 2009. Lecture Notes in Computer Science (LNCS), 8 pages. Springer-Verlag Publisher. (21.30% acceptance rate from 338 submissions – PAKDD is a leading international conference in the areas of data mining and knowledge discovery. X. Yin and Q. Hu are my PhD and Master students respectively).

Huang, Jimmy X., An, A., Hu, Q. and Tu, K. “Medical Text Analytics Tools for Search and Classification”, Proceedings of the 2009 Annual International Conference on Information Technology and Communications in Health (ITCH’09), Laurel Point Inn, Victoria, BC, Canada, February 19-22, 2009. (Q. Hu is my MSc students).

Yang Liu, Xiangji Huang, Aijun An, and Xiaohui Yu “Modeling and Predicting the Helpfulness of Online Reviews”, Proceedings of the 2008 IEEE International Conference on Data Mining, Pisa, Italy, December 15-19, 2008. (9.7% acceptance rate: 70 regular papers accepted out of 724 submissions. IEEE ICDM is the best conference in the field of Data Mining. Y.Liu is my PhD student).

Hu, Q. and Huang, X. “A Reranking Model for Genomics Aspect Search”, Proceedings of the 31th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Singapore, July 20-24, 2008. (ACM SIGIR is the best conference in the field of Information Retrieval. Q. Hu is my Master student).

Liu, Y., Huang, X., An, A. and Yu, X. “HelpMeter: A Nonlinear Model for Predicting the Helpfulness of Online Reviews”, Proceedings of the 2008 IEEE/ACM International Conference on Web Intelligence, Sydney, Australia, December 9-12, 2008. (IEEE/ACM WI is the best conference in the field of Web Intelligencel. Y. Liu is my PhD student and the acceptance rate is 20% out of 430 submissions).

Hu, Q. and Huang, X. “A Dynamic Window Based Passage Extraction Algorithm for Genomic Information Retrieval”, Proceedings of the 17th International Symposium on Methodologies for Intelligent Systems (ISMIS’08), Toronto, Canada. May 20-23, 2008. Lecture Notes in Computer Science (LNCS). Springer-Verlag Publisher. (Q.Hu is my Master student)

Zhu, J., Song, D., Ruger S. and Huang, X. “Modeling Document Features for Expert Finding”, Proceedings of the 17th International ACM Conference on Information and Knowledge Management (CIKM’08), Napa Valley, CA, USA, October 26-30, 2008. (16% acceptance rate: 122 short papers accepted out of 772 submissions)

Andreopoulos, B., An, A, Huang, X. and Labudde, Dirk. “Integration of Genomic, Proteomic and Biomedical Information on the Semantic Web”, Proceedings of 2nd International Workshop on Conceptual Modelling for Life Sciences Applications (CMLSA 2008), Barcelona, Spain. October 20-23, 2008. Lecture Notes in Computer Science (LNCS). Springer-Verlag Publisher. (33% acceptance rate. See http://cmlsa2008.mucoms.org/index.shtml?papers for more information).

Y. Liu, X. Huang, A. An and X. Yu. "ARSA: A Sentiment-Aware Model for Predicting Sales Performance Using Blogs", to appear in the Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'07), Amsterdam, July 23-27, 2007.

Yao, Y., Zeng, Y., Zhong, N. and Huang, X. “Knowledge Retrieval”, Proceedings of the 2007 IEEE/WIC/ACM International Conference on Web Intelligence (WI’07), Silicon Valley, November 2-5, 2007. (17% acceptance rate: 58 regular papers accepted out of 343 submissions)

Xiangji Huang, YanRui Huang, Miao Wen, Aijun An, Yang Liu and Josiah Poon. "Applying Data Mining to Pseudo-Relevance Feedback for High Performance Text Retrieval", Proceedings of the 2006 IEEE International Conference on Data Mining (ICDM'06), Hong Kong, December 18-22, 2006.

Xiangji Huang, Miao Wen, Aijun An and YanRui Huang. "A Platform for Okapi-Based Contextual Information Retrieval", Proceedings of ACM SIGIR 2006, Seattle, Washington, August 6-11, 2006.

Ming Zhong and Xiangji Huang. "Concept-Based Biomedical Text Retrieval", Proceedings of ACM SIGIR 2006, Seattle, Washington, August 6-11, 2006.

Yang Liu, Aijun An and Xiangji Huang. "Boosting Prediction Accuracy on Imbalanced Datasets with SVM Ensembles", Proceedings of 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2006), Singapore, April 9-12, 2006. Lecture Notes in Computer Science (LNCS), Springer-Verlag Publisher.

Miao Wen and Xiangji Huang. "A Multi-level Searching and Re-ranking Framework for Information Retrieval", Proceedings of 2006 IEEE International Conference on Granular Computing, Atlanta, USA, May 10-12, 2006.

Xiangji Huang, YanRui Huang and Miao Wen. "A Dual Index Model for Contextual Information Retrieval", Proceedings of ACM SIGIR 2005, Salvador, Brazil, August 15-19, 2005.

Xiangji Huang and YanRui Huang. "Using Contextual Information to Improve Retrieval Performance", Proceedings of 2005 IEEE International Conference on Granular Computing, Beijing, China, July 25-27, 2005. ISBN: 0-7803-9017-2 and IEEE Catalog Number: 05EX1036.

Luo Si, T. Kanungo and Xiangji Huang "Boosting Performance of Bio-Entity Recongition by Combining Results from Multiple Systems", Proceedings of the 5th ACM SIGKDD Workshop on Data Mining in Bioinformatics, Chicago, USA, August 21, 2005.

Xiangji Huang. "Incorporating Contextual Retrieval into Okapi", Proceedings of ACM SIGIR Workshop on IR in Context (IRiX'05), Salvador, Brazil, August 19, 2005. ISBN: 87-7415-290-4

Ying Zou, Aijun An and Xiangji Huang. "Evaluation and Automatic Selection of Methods for Handling Missing Data", Proceedings of 2005 IEEE International Conference on Granular Computing, Beijing, China, July 25-27, 2005. ISBN: 0-7803-9017-2 and IEEE Catalog Number: 05EX1036.

Qingsong Yao, Q. Xiangji Huang and Aijun An. "A Machine Learning Approach to Identify Database Sessions Using Unlabeled Data", Proceedings of the 7th International Conference on Data Warehousing and Knowledge Discovery, Copenhagen, Denmark, August 22-26, 2005. Lecture Notes in Computer Science (LNCS) 3589: 254-264. Springer-Verlag Publisher.

Qingsong Yao, Q. Aijun An and Xiangji Huang. " Finding and Analyzing Database User Sessions", Proceedings of the 10th International Conference on Database Systems for Advanced Applications (DASFAA 2005), Beijing, China, April 18-20, 2005. Lecture Notes in Computer Science (LNCS) 3453: 851-862. Springer-Verlag Publisher.

Qingsong Yao, Q. Aijun An and Xiangji Huang. " A Distance-based Algorithm for Clustering Database User Sessions", Proceedings of the 15th International Symposium on Methodologies for Intelligent Systems (ISMIS 2005), Saratoga Springs, New York, May 25-28, 2005. Lecture Notes in Computer Science (LNCS) 3488: 562-572. Springer-Verlag Publisher.

Yang Liu, Xiangji Huang and Aijun An. " Clustering Web Surfers with Probabilistic Models in a Real Application", Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence (WI'04), Beijing, China, September 20-24, 2004. 851-862.

Yang Liu, Aijun An and Xiangji Huang. " Web Surfing Recommendations in a Real Application", Proceedings of the ECML/PKDD'04 workshop on Statistical Approaches for Web Mining, Pisa, Italy, September 20-24, 2004. 2-13.

Aijun An, Shakil Khan and Xiangji Huang. " Objective and Subjective Algorithms for Grouping Association Rules", Proceedings of the Third IEEE International Conference on Data Mining (ICDM 2003), Florida, USA, November 19-22, 2003.

Fuchun Peng, Xiangji Huang, Dale Schuurmans and Shaojun Wang. "Text Classification in Asian Languages without Word Segmentation", Proceedings of the Sixth International Workshop on Information Retrieval with Asian Languages (IRAL 2003), Sapporo, Japan, July 2003.

Xiangji Huang, Fuchun Peng, Aijun An and Dale Schuurmans. "Session Boundary Detection for Association Rule Learning Using N-Gram Language Models", Proceedings of the Sixteenth Canadian Conference on Artificial Intelligence (CAI-03), Halifax, Canada, June 11-13, 2003. Lecture Notes in Computer Science (LNCS) 2671: 237-251. Springer-Verlag Publisher.

Xiangji Huang, Aijun An, Nick Cercone and Gary Promhouse. "Discovery of Interesting Association Rules from Livelink Web Log data", Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM'02), Maebashi TERRSA, Maebashi City, Japan, December 9-12, 2002.

Xiangji Huang, Aijun An, Nick Cercone and Gary Promhouse. "Comparison of Interestingness Functions for Learning Web Usage Patterns", Proceedings of the 11th International ACM Conference on Information and Knowledge Management (CIKM'02), McLean, VA, USA, November 4-9, 2002.

Xiangji Huang, Fuchun Peng, Dale Schuurmans and Nick Cercone. "Waterloo at NTCIR-3: Comparing and Analysing Different Text Extraction Methods''", Proceedings of the Third NTCIR Workshop on Evaluation of Information Retrieval, Q&A, and Summarization (NTCIR'02), ISBN:4-86049-016-9, National Institute of Informatics, Tokyo Japan, October 8-10, 2002.

Fuchun Peng, Xiangji Huang, Dale Schuurmans and Nick Cercone. "Investigating the Relationship of Word Segmentation Performance and Retrieval Performance in Chinese IR", Proceedings of the 19th Biennial International Conference on Computational Linguistics (COLING'02), Taipei, Taiwan, August 24-September 1, 2002.

Fuchun Peng, Xiangji Huang, Dale Schuurmans, Nick Cercone and Stephen Robertson. "Using Self-Supervised Word Segmentation in Chinese Information Retrieval", Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'02), Tampere, Finland, August 11-15, 2002.

Xiangji Huang and Stephen Robertson. "Comparisons of Probabilistic Compound Unit Weighting Methods''", Proceedings of the 2001 IEEE ICDM Workshop on Text Mining, San Jose, USA, November 29-December 2 2001. 1-15.

Aijun An, Nick Cercone and Xiangji Huang. "A Case Study for Learning from Imbalanced Data Sets", Proceedings of the 14th Canadian Conference on Artificial Intelligence (CAI-03), Ottawa, Canada, June 7-9 2001. Lecture Notes in Computer Science (LNCS) 2056: 1-15. Springer-Verlag Publisher.

Xiangji Huang and Stephen E. Robertson. "A Probabilistic Approach to Chinese Information Retrieval: Theory and Experiments", Proceedings of the 22nd Annual BCS-IRSG Colloquium on Information Retrieval Research, Cambridge, England, April 2000. 178-193.

Upcoming Courses

TermCourse NumberSectionTitleType 
Fall 2017 AP/ITEC4020 3.0  Internet Client-Server Systems LECT  


Jimmy Huang is now a Professor at the School of Information Technology and the founding director of York’s Information Retrieval & Knowledge Management Research Lab at the York University, where he is also cross-appointed as a graduate faculty member in the programs of Computer Science and Engineering, Mathematics and Statistics, Health Informatics, Information Systems & Technology and Emergency Management. Dr. Huang has extensive industry working experience where he was awarded a CIO Achievement Award in Manulife before joining York University. Previously, he was a Post Doctoral Fellow working with Professor Nick Cercone at the School of Computer Science, University of Waterloo. He did his PhD in Information Science at City University in London, England, with Professor Stephen Robertson.


Since joining York University in July 2003, he has published 81 refereed papers in top ranking journals, book chapters and international conference proceedings (such as ACM SIGIR, ACM CIKM, COLING and IEEE ICDM). In total, he has published more than 100 refereed papers. In the past three years, Dr. Huang has been Sole PIs of 19 external grants and Co- PIs of 6 external grants. He has received $1,576,796 external grants since 2007, primarily funded by Ontario Ministry of Research & Innovation, Natural Sciences and Engineering Research Council of Canada, NSERC Research Tools and Instruments Grant, Tri-Agency (SSHRC, CIHR and NSERC) Syntheses Grant, AlphaGlobal iT, CRD Grant, IBM, CIHR and Institute for Clinical Evaluative Sciences (ICES), SSHRC, Petro Canada, Mathematics of Information Technology and Complex Systems (MITACS), CGA-CanadaCAAA, Shared Hierarchical Academic Research Computing Network (SHARCNet), Information Retrieval Facility (IRF) and Ontario Research Fund - Research Excellence (ORF/RE). Since July 2003, he has supervised 9 Masters students, 7 PhD and 3 Postdoctoral Fellows. Currently, 3 Postdoctoral Fellows, 10 Ph.D. and Master students are under his supervision. He has also served on the supervisory committees of 16 Ph.D. and M.Sc. students. Jimmy Huang received the Dean’s Award for Outstanding Research in 2006, an Early Researcher

Award, formerly the Premier’s Research Excellence Awards, in 2007 for a healthcare project entitled “Analyzing and Searching Medical Data for Cost Effective Health Care”, the Petro Canada Young Innovators Award in 2008, the SHARCNET Research Fellowship Award in 2009, the MITACS Networking Award in 2010 and both the Best Paper Award and the Best Student Paper Award with his PhD student at the 32nd European Conference on Information Retrieval (ECIR 2010) in UK. He is the General Conference Chair for the 19th International ACM CIKM Conference in 2010. ACM CIKM conference is one of the most prestigious and competitive conferences in Information Retrieval, Database and Knowledge Management. He is also the General Program Chair for IEEE/ACM International Joint Conferences on Web Intelligence & Intelligent Agent Technology in 2010. He also serves as steering committee members of review and adjundication panels for Natural Sciences and Engineering Research Council of Canada (NSERC), MRI Early Researcher Awards (ERA) Program and National Science Foundation (NSF) of USA.

Area of Specialization

Information Technology

Degrees

PhD Information Science, City University in London, England
M.Eng. Computer Organization and Architecture,
B.Eng. Computer Engineering,

Research Interests:

Information Technologies , Digital Library and Bioinformatics, Computational Linguistics, Data/Web/Data Mining

Current Research Projects

Context-aware information retrieval and semantic text analysis for very large unstructured data


Project Type: Funded
Funders: 
Research Tools and Instruments (NSERC)

Personalizing and Searching Medical Data for Cost Effective Health Care


Project Type: Funded
Funders: 
Discovery Grants (NSERC)

Enter the Proposal Title in Non-Technical Language


Project Type: Funded
Funders: 
Ontario Early Researcher Award


Project Type: Funded
Funders: 
Ministry of Research and Innovation


Project Type: Funded
Funders: 
Minor Research Grant (York Internal Grant)


Project Type: Funded
Funders: 
Junior Faculty Fund (York Internal Grant)


Project Type: Funded
Funders: 
Minor Research Grant (York Internal Grant)


Project Type: Funded
Funders: 
Minor Research Grant (York Internal Grant)


Project Type: Funded
Funders: 
ATK Fellowship


Project Type: Funded
Funders: 
ATK Fellowship

All Publications

Book Chapters

Andreopoulos, B., Huang, X., An, A., Labudde, D. and Hu, Q. “Promoting Diversity in Top Hits for Biomedical Passage Retrieval” (22 pages). In Zbigniew W. Ras (Editor), Advances in Data Management, Springer-Verlag Publisher. May 2009.

Liu, Y., Yu, X., Huang, X. and An, A. “Blog Data Mining: The Predictive Power of Sentiments” (22 pages). In Longbing Cao, Philip S. Yu and Chengqi Zhang (Editors), Data Mining for Business Applications, Springer-Verlag Publisher. November 2008. ISBN: 978-0-38779-419-8.

Huang, X. “Clustering Analysis and Algorithms”. In Vijayan Sugumaran (Ed.), Intelligent Information Technologies: Concepts, Methodologies, Tools, and Applications (Reprinted), Information Science Publishing (an imprint of Idea Group Inc.), August 2008. ISBN: 978-1-59904-941-0.

Huang, X., An, A. and Liu, Y. “Web Usage Mining with Web Logs” (15 pages). In John Wang (Ed.), Encyclopedia of Data Warehousing and Mining, Second Edition, Information Science Publishing (an imprint of Idea Group Inc.), August 2008. ISBN: 978-1-60566-010-3.

Xiangji Huang, "Clustering Analysis and Algorithms". In John Wang (Ed.), Encyclopedia of Data Warehousing and Mining, Information Science Publishing (an imprint of Idea Group Inc.), ISBN: 2-59140-557-2. 2005.

Journal Articles

Yin, X and Huang, X. “Re-Ranking with Context for High-Performance Biomedical Information Retrieval” (14 pages), accepted by International Journal of Data Mining and Bioinformatics. March 2010 (Xiaoshi is my PhD student). ISI Journal Impact Factor: 0.933

He, B and Huang, X. “Mining Authoritative and Topical Evidence for Improving Opinion Retrieval from the Blogosphere” (15 pages), conditionally accepted and under the 2nd round review by VLDB Journal with minor revision. Springer-Verlag Publisher, February 2010 (Ben He is my postdoctoral fellow).

Yu, X, Liu, Y., Huang, X. and An, A. “Mining Online Reviews for Predicting Sales Performance: A Case Study in the Movie Domain” (14 pages), accepted undergo a minor revision by IEEE Transactions on Knowledge and Data Engineering (TKDE). IEEE Publisher, May 16, 2010 (TKDE is a top tier journal in data mining. Xiaoshi is my PhD student and the paper published is a part of her PhD thesis). ISI Journal Impact Factor: 2.285

Hu, Q. and Huang, X. “Passage Extraction and Result Combination for Genomics Information Retrieval” (23 pages), Journal of Intelligent Information Systems (JIIS). 34(3): 249-274, 2010. Springer-Verlag Publisher, June 2010 (JIIS is a leading journal on Intelligent Information Systems and Q.Hu is my graduate student). ISSN (Printed): 0925-9902 and ISSN (Online): 1573-7675. ISI Journal Impact Factor: 0.980

Zhu, J., Huang, X., Song, D. and Ruger, S. “Integrating Multiple Document Features in Language Models for Expert Finding”, Knowledge and Information Systems: An International Journal (KAIS). 23(1): 29-54, 2010. Springer-Verlag Publisher, January 2010 (KAIS is one of the most esteemed journals in knowledge systems, data mining and advanced information systems. My contribution to this paper is 35%.). ISSN (Printed): 0219-1377 and ISSN (Online): 0219-3116. ISI Journal Impact Factor: 2.211

Yin, X and Huang, X. “Mining and Modeling Linkage Information from Citation Context for Improving Biomedical Literature Retrieval” (32 pages), accepted by Information Processing & Management: An International Journal (IPM). ELSEVIER Publisher, March 26, 2010 (IPM is a top tier journal in information retrieval, information systems and information management. My contribution to this paper is at least 50%. Xiaoshi is my PhD student). ISSN: 0306-4573. ISI Journal Impact Factor: 1.783

Lupu, M., Huang, Jimmy X., Zhu, J. and Tait, J. “TREC-CHEM: Large Scale Chemical Information Retrieval Evaluation at TREC”. SIGIR Forum 42(1): 63-77. December 2009.

Fuchun Peng and Xiangji Huang. "Machine Learning Approaches to Automatic Text Classification for Asian Languages", Journal of Documentation. Emerald Group Publishing Limited, U.K. ISSN: 0022-0418. Vol.63, No.3, 2007. pp.378-397.

Huang, X. “Comparison of Interestingness Measures for Web Usage Mining: An Empirical Study”, International Journal of Information Technology & Decision Making, 6(1):15-41, 2007. World Scientific Publishing Co., ISSN: 0219-6220. ISI Journal Impact Factor: 1.312

Yang Liu, Xiangji Huang and Aijun An. "Personalized Recommendation with Adaptive Mixture of Markov Models," Journal of the American Society for Information Science and Technology (JASIST). Wiley InterScience Publisher. ISSN (Printed): 1532-2882 and ISSN (Online): 1532-2890. Vol.58, No.12, 2007. pp.1851-1870.

Liu, Y., Huang, X. and An, A. “Adaptive Personalized Recommendation with a Mixture of Markov Models”, Journal of the American Society for Information Science & Technology (JASIST), 58(12):1851-1870, 2007. John Wiley & Sons. (JASIST is the most prestigious journal in information science and technology. Liu is my PhD student). ISI Journal Impact Factor: 2.300

Xiangji Huang, Qingsong Yao and Aijun An. "Applying Language Modeling to Session Identification from Database Trace Logs" (32 pages), Knowledge and Information Systems: An International Journal (KAIS). Springer-Verlag Publisher. ISSN (Printed): 0219-1377 and ISSN (Online): 0219-3116. Vol.10, No.4, 2006. pp.473-504.

Bill Andreopoulos, Aijun An, Xiangji Huang and Xiaogang Wang. "Finding Molecular Complexes through Multiple Layer Clustering of Protein Interaction Networks" (22 pages), accepted by International Journal of Bioinformatics Research and Applications (IJBRA). Inderscience Publisher. ISSN (Printed): 1744-5485 and ISSN (Online): 1744-5493.

Aijun An, Shakil Khan and Xiangji Huang. "Hierarchical Grouping of Association Rules and Its Application to a Real-World Domain" (30 pages), accepted by International Journal of Systems Science (IJSS), Special Issue on Advances in Data Mining and Its Applications. Taylor & Francis Group Publisher. ISSN (Printed): 0020-7721 and ISSN (Online): 1464-5319.

Xiangji Huang, Fuchun Peng, Aijun An and Dale Schuurmans. "Dynamic Web Log Session Identification with Statistical Language Models", Journal of the American Society for Information Science and Technology (JASIST). Wiley InterScience Publisher. ISSN (Printed): 1532-2882 and ISSN (Online): 1532-2890. Vol.55, No.14, 2004. pp.1290-1303.

Aijun, An, Y. Huang, Xiangji Huang and Nick Cercone. "Feature Selection with Rough Sets for Web Page Classification", LNCS Transactions on Rough Sets, Springer Verlag, Vol.2, 2004. pp.1-13

Xiangji Huang, Fuchun Peng, Dale Schuurmans, Nick Cercone and Stephen Robertson. "Applying Machine Learning to Text Segmentation for Information Retrieval", Information Retrieval, Vol 6, Issue 4, pp.333-362, 2003.

Xiangji Huang, Stephen E. Robertson, Nick Cercone and Aijun An "Probability-based Chinese Text Processing and Retrieval", Computational Intelligence: An International Journal (CI), Vol.16, No.4, 2000. pp.552-569

Xiangji Huang and Stephen E. Robertson. "Application of Probabilistic Models to Chinese Text Retrieval", Journal of Documentation (JDoc), Vol.53, No.1, 1997. pp.74-79.

Conference Papers

Huang, X., An, A. and Hu, Q. “Medical Search and Classification Tools for Recommendation”, Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Geneva, Switzerland, July 19-23, 2010 (32% acceptance rate: 99 papers accepted out of 310 submissions. ACM SIGIR is the best conference in the field of Information Retrieval. Q. Hu is my PhD student).

Yin, X. and Huang, X. “A Survival Modeling Approach to Biomedical Search Result Diversification UsingWikipedia”, Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Geneva, Switzerland, July 19-23, 2010 (32% acceptance rate: 99 papers accepted out of 310 submissions. ACM SIGIR is the best conference in the field of Information Retrieval. X. Yin is my PhD student).

Yu, X., Liu, Y., Huang, X. and An, A. “S-PLSA+: Adaptive Sentiment Analysis with Application to Sales”, Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Geneva, Switzerland, July 19-23, 2010 (32% acceptance rate: 99 papers accepted out of 310 submissions. ACM SIGIR is the best conference in the field of Information Retrieval. Dr. Y. Liu is my PDF).

Yu, X., Liu, Y., Huang, X. and An, A. “A Quality-Aware Model for Sales Prediction Using Reviews”, Proceedings of the 19th International World Wide Web Conference (WWW’10), Raleigh, North Carolina, USA, April 26-30, 2010. (Dr. Y. Liu is my postdoctoral fellow).

Hu, Q. and Huang, X. “Genomics Information Retrieval Using a Bayesian Model for Learning and Re-ranking”, Proceedings of the 2010 ACM International Conference on Bioinformatics and Computational Biology (BCB), New York, USA, August 2-4, 2010. (Q. Hu is my PhD student starting from September 2010).

An, X., Huang, X. and Cercone, N. “The Optimal IR: How Far Away?” (full paper, 12 pages), Proceedings of the 11th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing’10), Iasi, Romania, March 21- 27, 2010. Lecture Notes in Computer Science (LNCS) 6008, 602-613. Springer-Verlag Publisher, ISBN 978-3-642-12115-9. (22.6% acceptance rate: 57 full papers accepted out of 252 submissions. Dr. X. An is my postdoctoral fellow starting from April 2010).

Yin, X. and Huang, X. “Promoting Ranking Diversity for Biomedical Information Retrieval Using Wikipedia” (full paper, 12 pages), Proceedings of the 32nd European Conference on Information Retrieval (ECIR2010), Milton Keynes, UK, 28-31 March 2010. Lecture Notes in Computer Science (LNCS) 5993, 107-118. Springer-Verlag Publisher, ISBN 978-3-642-12274-3. (22% acceptance rate. X. Yin is my PhD student. Jimmy Huang and Xiaoshi Yin received the best paper award at ECIR2010 for this paper1. As Xiaoshi is a student, this paper was also awarded the best student paper at ECIR this year.)

Ye, Z. and Huang, X. “Exploring Social Annotation Tags to Enhance Information Retrieval Performance” (full paper, 12 pages), Proceedings of the 2010 International Conference on Active Media Technology (AMT’10), Toronto, Canada, August 28-30, 2010. Lecture Notes in Computer Science (LNCS) 5993, 107-118. Springer-Verlag Publisher, ISBN 978-3-642-12274-3. (Z. Ye is my PhD student).

Hu, Vivian Q., Ye, Z. and Huang, X. “Enhancing Content-Based Image Retrieval Using Machine Learning Techniques” (full paper, 12 pages), Proceedings of the 2010 International Conference on Active Media Technology (AMT’10), Toronto, Canada, August 28-30, 2010. Lecture Notes in Computer Science (LNCS) 5993, 107-118. Springer-Verlag Publisher, ISBN 978-3-642-12274-3. (Vivian Hu and Z. Ye are my PhD student).

Hu, Vivian Q., Huang, X., William Melek and C. Joseph Kurian. “A Time Series Based Method for Analyzing and Predicting Personalized Medical Data” (full paper, 12 pages), Proceedings of the 2010 International Conference on Brain Informatics (BI’10), Toronto, Canada, August 28-30, 2010. Lecture Notes in Computer Science (LNCS) 5993, 107-118. Springer-Verlag Publisher, ISBN 978-3-642-12274-3. (Vivian Hu is my PhD student)

Yin, X., Huang, X. and Li, Z. “Towards A Better Ranking for Biomedical Information Retrieval Using Context” (full paper), Proceedings of the 2009 IEEE International Conference on Bioinformatics & Biomedicine , Washington D.C., USA, November 1- 4, 2009 (18.9% acceptance rate: 44 full papers accepted out of 233 submissions. X. Yin is my PhD student).

Yin, X., Huang, X. and Li, Z. “BioCLink: A Probabilistic Approach for Improving Genomics Search with Citation Links” (short paper), Proceedings of the 2009 IEEE International Conference on Bioinformatics & Biomedicine , Washington D.C., USA, November 1-4, 2009 (15.9% acceptance rate: 37 short papers accepted out of 233 submissions. X. Yin is my PhD student).

Rohian, H., An, A., Zhao, J. and Huang, X. “Discovering Temporal Associations among Significant Changes in Gene Expression” (short paper), Proceedings of the 2009 IEEE International Conference on Bioinformatics & Biomedicine , Washington D.C., USA, November 1-4, 2009 (15.9% acceptance rate: 37 short papers accepted out of 233 submissions. Rohian and Zhao are my Master and PhD students respectively. This paper was also awarded the best student poster paper at BIBM this year.).

Ye, Z., Huang, X. and Lin, H. “Towards A Better Performance for Medical Image Retrieval Using An Integrated Approach”, Proceedings of the 10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009, Corfu, Greece, September 30 - October 2, 2009 (Z. Ye is my PhD student).

Huang, X. and Hu, Q. “A Bayesian Learning Approach to Promoting Diversity in Ranking for Biomedical Information Retrieval” (full paper), Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Boston, USA, July 19-23, 2009 (15.8% acceptance rate: 78 regular papers accepted out of 494 submissions. ACM SIGIR is the best conference in the field of Information Retrieval. Q. Hu is my PhD student).

Ye, Z., Huang, X. and Lin, H. “A Graph-based Approach to Mining Multilingual Word Associations from Wikipedia” (poster paper), Proceedings of the 32th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Boston, USA, July 19-23, 2009 (33.6% acceptance rate: 86 poster papers accepted out of 256 submissions. ACM SIGIR is the best conference in the field of Information Retrieval. Z. Ye is my PhD student).

An, A., Wan, Q., Zhao J. and Huang, X. “Diverging Patterns: Discovering Significant Frequency Change Dissimilarities in Large Databases”, Proceedings of the 18th International ACM Conference on Information and Knowledge Management (CIKM’09), Hong Kong, November 2-6, 2009. (20.2% acceptance rate: 171 short papers accepted out of 847 submissions)

Yin, X., Huang, X., Hu, Q and Li, Z. “Boosting Biomedical Information Retrieval Performance through Citation Graph: An Empirical Study”, Proceedings of the 13th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’09), Bangkok, Thailand, April 27-30, 2009. Lecture Notes in Computer Science (LNCS), 8 pages. Springer-Verlag Publisher. (21.30% acceptance rate from 338 submissions – PAKDD is a leading international conference in the areas of data mining and knowledge discovery. X. Yin and Q. Hu are my PhD and Master students respectively).

Huang, Jimmy X., An, A., Hu, Q. and Tu, K. “Medical Text Analytics Tools for Search and Classification”, Proceedings of the 2009 Annual International Conference on Information Technology and Communications in Health (ITCH’09), Laurel Point Inn, Victoria, BC, Canada, February 19-22, 2009. (Q. Hu is my MSc students).

Yang Liu, Xiangji Huang, Aijun An, and Xiaohui Yu “Modeling and Predicting the Helpfulness of Online Reviews”, Proceedings of the 2008 IEEE International Conference on Data Mining, Pisa, Italy, December 15-19, 2008. (9.7% acceptance rate: 70 regular papers accepted out of 724 submissions. IEEE ICDM is the best conference in the field of Data Mining. Y.Liu is my PhD student).

Hu, Q. and Huang, X. “A Reranking Model for Genomics Aspect Search”, Proceedings of the 31th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Singapore, July 20-24, 2008. (ACM SIGIR is the best conference in the field of Information Retrieval. Q. Hu is my Master student).

Liu, Y., Huang, X., An, A. and Yu, X. “HelpMeter: A Nonlinear Model for Predicting the Helpfulness of Online Reviews”, Proceedings of the 2008 IEEE/ACM International Conference on Web Intelligence, Sydney, Australia, December 9-12, 2008. (IEEE/ACM WI is the best conference in the field of Web Intelligencel. Y. Liu is my PhD student and the acceptance rate is 20% out of 430 submissions).

Hu, Q. and Huang, X. “A Dynamic Window Based Passage Extraction Algorithm for Genomic Information Retrieval”, Proceedings of the 17th International Symposium on Methodologies for Intelligent Systems (ISMIS’08), Toronto, Canada. May 20-23, 2008. Lecture Notes in Computer Science (LNCS). Springer-Verlag Publisher. (Q.Hu is my Master student)

Zhu, J., Song, D., Ruger S. and Huang, X. “Modeling Document Features for Expert Finding”, Proceedings of the 17th International ACM Conference on Information and Knowledge Management (CIKM’08), Napa Valley, CA, USA, October 26-30, 2008. (16% acceptance rate: 122 short papers accepted out of 772 submissions)

Andreopoulos, B., An, A, Huang, X. and Labudde, Dirk. “Integration of Genomic, Proteomic and Biomedical Information on the Semantic Web”, Proceedings of 2nd International Workshop on Conceptual Modelling for Life Sciences Applications (CMLSA 2008), Barcelona, Spain. October 20-23, 2008. Lecture Notes in Computer Science (LNCS). Springer-Verlag Publisher. (33% acceptance rate. See http://cmlsa2008.mucoms.org/index.shtml?papers for more information).

Y. Liu, X. Huang, A. An and X. Yu. "ARSA: A Sentiment-Aware Model for Predicting Sales Performance Using Blogs", to appear in the Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'07), Amsterdam, July 23-27, 2007.

Yao, Y., Zeng, Y., Zhong, N. and Huang, X. “Knowledge Retrieval”, Proceedings of the 2007 IEEE/WIC/ACM International Conference on Web Intelligence (WI’07), Silicon Valley, November 2-5, 2007. (17% acceptance rate: 58 regular papers accepted out of 343 submissions)

Xiangji Huang, YanRui Huang, Miao Wen, Aijun An, Yang Liu and Josiah Poon. "Applying Data Mining to Pseudo-Relevance Feedback for High Performance Text Retrieval", Proceedings of the 2006 IEEE International Conference on Data Mining (ICDM'06), Hong Kong, December 18-22, 2006.

Xiangji Huang, Miao Wen, Aijun An and YanRui Huang. "A Platform for Okapi-Based Contextual Information Retrieval", Proceedings of ACM SIGIR 2006, Seattle, Washington, August 6-11, 2006.

Ming Zhong and Xiangji Huang. "Concept-Based Biomedical Text Retrieval", Proceedings of ACM SIGIR 2006, Seattle, Washington, August 6-11, 2006.

Yang Liu, Aijun An and Xiangji Huang. "Boosting Prediction Accuracy on Imbalanced Datasets with SVM Ensembles", Proceedings of 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2006), Singapore, April 9-12, 2006. Lecture Notes in Computer Science (LNCS), Springer-Verlag Publisher.

Miao Wen and Xiangji Huang. "A Multi-level Searching and Re-ranking Framework for Information Retrieval", Proceedings of 2006 IEEE International Conference on Granular Computing, Atlanta, USA, May 10-12, 2006.

Xiangji Huang, YanRui Huang and Miao Wen. "A Dual Index Model for Contextual Information Retrieval", Proceedings of ACM SIGIR 2005, Salvador, Brazil, August 15-19, 2005.

Xiangji Huang and YanRui Huang. "Using Contextual Information to Improve Retrieval Performance", Proceedings of 2005 IEEE International Conference on Granular Computing, Beijing, China, July 25-27, 2005. ISBN: 0-7803-9017-2 and IEEE Catalog Number: 05EX1036.

Luo Si, T. Kanungo and Xiangji Huang "Boosting Performance of Bio-Entity Recongition by Combining Results from Multiple Systems", Proceedings of the 5th ACM SIGKDD Workshop on Data Mining in Bioinformatics, Chicago, USA, August 21, 2005.

Xiangji Huang. "Incorporating Contextual Retrieval into Okapi", Proceedings of ACM SIGIR Workshop on IR in Context (IRiX'05), Salvador, Brazil, August 19, 2005. ISBN: 87-7415-290-4

Ying Zou, Aijun An and Xiangji Huang. "Evaluation and Automatic Selection of Methods for Handling Missing Data", Proceedings of 2005 IEEE International Conference on Granular Computing, Beijing, China, July 25-27, 2005. ISBN: 0-7803-9017-2 and IEEE Catalog Number: 05EX1036.

Qingsong Yao, Q. Xiangji Huang and Aijun An. "A Machine Learning Approach to Identify Database Sessions Using Unlabeled Data", Proceedings of the 7th International Conference on Data Warehousing and Knowledge Discovery, Copenhagen, Denmark, August 22-26, 2005. Lecture Notes in Computer Science (LNCS) 3589: 254-264. Springer-Verlag Publisher.

Qingsong Yao, Q. Aijun An and Xiangji Huang. " Finding and Analyzing Database User Sessions", Proceedings of the 10th International Conference on Database Systems for Advanced Applications (DASFAA 2005), Beijing, China, April 18-20, 2005. Lecture Notes in Computer Science (LNCS) 3453: 851-862. Springer-Verlag Publisher.

Qingsong Yao, Q. Aijun An and Xiangji Huang. " A Distance-based Algorithm for Clustering Database User Sessions", Proceedings of the 15th International Symposium on Methodologies for Intelligent Systems (ISMIS 2005), Saratoga Springs, New York, May 25-28, 2005. Lecture Notes in Computer Science (LNCS) 3488: 562-572. Springer-Verlag Publisher.

Yang Liu, Xiangji Huang and Aijun An. " Clustering Web Surfers with Probabilistic Models in a Real Application", Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence (WI'04), Beijing, China, September 20-24, 2004. 851-862.

Yang Liu, Aijun An and Xiangji Huang. " Web Surfing Recommendations in a Real Application", Proceedings of the ECML/PKDD'04 workshop on Statistical Approaches for Web Mining, Pisa, Italy, September 20-24, 2004. 2-13.

Aijun An, Shakil Khan and Xiangji Huang. " Objective and Subjective Algorithms for Grouping Association Rules", Proceedings of the Third IEEE International Conference on Data Mining (ICDM 2003), Florida, USA, November 19-22, 2003.

Fuchun Peng, Xiangji Huang, Dale Schuurmans and Shaojun Wang. "Text Classification in Asian Languages without Word Segmentation", Proceedings of the Sixth International Workshop on Information Retrieval with Asian Languages (IRAL 2003), Sapporo, Japan, July 2003.

Xiangji Huang, Fuchun Peng, Aijun An and Dale Schuurmans. "Session Boundary Detection for Association Rule Learning Using N-Gram Language Models", Proceedings of the Sixteenth Canadian Conference on Artificial Intelligence (CAI-03), Halifax, Canada, June 11-13, 2003. Lecture Notes in Computer Science (LNCS) 2671: 237-251. Springer-Verlag Publisher.

Xiangji Huang, Aijun An, Nick Cercone and Gary Promhouse. "Discovery of Interesting Association Rules from Livelink Web Log data", Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM'02), Maebashi TERRSA, Maebashi City, Japan, December 9-12, 2002.

Xiangji Huang, Aijun An, Nick Cercone and Gary Promhouse. "Comparison of Interestingness Functions for Learning Web Usage Patterns", Proceedings of the 11th International ACM Conference on Information and Knowledge Management (CIKM'02), McLean, VA, USA, November 4-9, 2002.

Xiangji Huang, Fuchun Peng, Dale Schuurmans and Nick Cercone. "Waterloo at NTCIR-3: Comparing and Analysing Different Text Extraction Methods''", Proceedings of the Third NTCIR Workshop on Evaluation of Information Retrieval, Q&A, and Summarization (NTCIR'02), ISBN:4-86049-016-9, National Institute of Informatics, Tokyo Japan, October 8-10, 2002.

Fuchun Peng, Xiangji Huang, Dale Schuurmans and Nick Cercone. "Investigating the Relationship of Word Segmentation Performance and Retrieval Performance in Chinese IR", Proceedings of the 19th Biennial International Conference on Computational Linguistics (COLING'02), Taipei, Taiwan, August 24-September 1, 2002.

Fuchun Peng, Xiangji Huang, Dale Schuurmans, Nick Cercone and Stephen Robertson. "Using Self-Supervised Word Segmentation in Chinese Information Retrieval", Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'02), Tampere, Finland, August 11-15, 2002.

Xiangji Huang and Stephen Robertson. "Comparisons of Probabilistic Compound Unit Weighting Methods''", Proceedings of the 2001 IEEE ICDM Workshop on Text Mining, San Jose, USA, November 29-December 2 2001. 1-15.

Aijun An, Nick Cercone and Xiangji Huang. "A Case Study for Learning from Imbalanced Data Sets", Proceedings of the 14th Canadian Conference on Artificial Intelligence (CAI-03), Ottawa, Canada, June 7-9 2001. Lecture Notes in Computer Science (LNCS) 2056: 1-15. Springer-Verlag Publisher.

Xiangji Huang and Stephen E. Robertson. "A Probabilistic Approach to Chinese Information Retrieval: Theory and Experiments", Proceedings of the 22nd Annual BCS-IRSG Colloquium on Information Retrieval Research, Cambridge, England, April 2000. 178-193.


Teaching:

Upcoming Courses

TermCourse NumberSectionTitleType 
Fall 2017 AP/ITEC4020 3.0  Internet Client-Server Systems LECT