Selected Publication in Data/Web/Text Mining, Information Retrieval

·         Zhou G., Zhiyuan Zhu, Tingting He, Hu X:Cross-lingual sentiment classification with stacked autoencoders. Knowl. Inf. Syst. 47(1): 27-44 (2016)

·         Zhou G., Zhiwen Xie, Tingting He, Jun Zhao, Hu X. Learning the Multilingual Translation Representations for Question Retrieval in Community Question Answering via Non-Negative Matrix Factorization. IEEE/ACM Trans. Audio, Speech & Language Processing 24(7): 1305-1314 (2016)

·         Hu X., Lin T., Raghavan R., Wah B., Baeza-Yates R., Fox G., Shahal C., Smith M., Yang Q., Ghani R., Fan H., Lempel R., Nambir R.(Eds) , Proceedings of the 2013 IEEE International Conference on Big Data. (IEEE Big Data 2013),  ISBN: 978-1-4799-1292-6, Oct 6-9, 2013 Sana Clara, CA, USA

·         Wang Y. Hu X., AOBA: Recognizing Object Behavior in Pervasive Urban management, IEEE Transactions on Knowledge and Data Engineering, 26(11):2625-2638(2014)

·         Wang Y. Hu X., Fuzzy Reasoning of Accident Provenance in Pervasive Healthcare Monitoring Systems.  IEEE Journal of Biomedical and Health Informatics, 17(6):1015-1022(2013)

·         Zhang X., Hu X., He T., Park, E.K., Zhou X., Utilizing Different Link Types to Enhance Document Clustering based on Markov Random Field Model with Relaxation Labeling, accepted to be published in IEEE Transactions on Systems, Men and Cybernetics, Part A, 2012

·         Xin Chen, Hu X, He T, An Y. Wu X., Inferring functional groups from microbial gene catalogue with probabilistic topic models,  accepted to be published in IEEE Transactions on NanoBioscience, 2012

·         Chen Y., Yin X., Li Z. Hu X., Promoting Ranking Diversity for Biomedical Information Retrieval based on LDA, accepted to be published in BMC Genomics, 2012

·         Lu C., Hu X., J. Park, Exploiting the Social Tagging Network for Web Clustering, IEEE Transactions on Systems, Men and Cybernetics, Part A, Vol 41 (5), Sept., 2011, pp840-852

·         Hu X., Park, E.K., Zhang X., Microarray Gene Cluster Identification and Annotation through Cluster Ensemble and EM based Informative Textual Summarization, IEEE Transactions on Information Technology in Biomedicine, Sept., 2009, Vol. 13, No. 5, pp832-840

·         Hu X., Shen X., Mining Biomedical Literature for Identification of Potential Virus/Bacteria, in IEEE Intelligent System, Nov/Dec 2009, Vol 24 No. 6, pp73-77

·         Hu X., Zhang X., Yoo I., Wang X., Feng J.., Mining Hidden Connections among Biomedical Concepts from Disjoint Biomedical Literature Sets through Semantic-based Association Rule,  International Journal of Intelligent  System, 25(2): 207-223 (2010)

·         Zhou X., Hu X., Zhang X., Topic Signature Language Models for Ad-hoc Retrieval, in IEEE Transactions on Knowledge and Data Engineering (IEEE TKDE), Sept 2007, pp 276-287

·         Li Y., Hu X., Lin H., Yang Z. , A Framework for Semi-supervised Feature Generation and its Applications in Biomedical Literature Mining,  IEEE/ACM Transactions on Computational Biology and Bioinformatics, March-April 2011, pp294-307

·         Lu C., Park. J., Hu X.,  User Tags versue Expert-created metadata: A Comparison between LibraryThing tags and Library Congress Subject Headings, accepted to be published in Journal of Information Science

·         Yan, R., Li C., Heish H., Hu P.  Hu Xiaohua, He T.. Socialized Language Model Smoothing via Bi-Directional Influence Propogation on Social Network, WWW 2016, Montreal, Canada,  April 10-14, 2016  (full paper, acceptance rate : 16%)

·       Liu M., Fang Y., park D., Hu Xiaohua, Yu Z., Retrieving Non-Redundant Questions to Summarize a Product Review, SIGIR 2016, Pisa, Italy, July 17-21, 2016 (full paper, acceptance rate: 18%)

·         Yan R., Cheng-Te Li, Hu Xiaohua, Ming Zhang:Chinese Couplet Generation with Neural Network Structures. ACL (1) 2016

·         Wanying Ding, Yue Shang, Lifan Guo, Xiaohua Hu, Rui Yan, Tingting He. Video Popularity Prediction by Sentiment Propagation via Implicit Network,  regular paper, CIKM 2015

·         Rui Yan, , , Hu, Tackling Sparsity, the Achilles Heel of Social Networks: Language Model Smoothing via Social Regularization. ACL (2) : 623-629

·         Cercone N., Hou L., Keselj V., An A.,. Naruedomkul N.,, Hu X., From Computational Intelligence to Web Intelligence , in IEEE Computer, November 2002, pp 72-76

·         Chen X., Hu X., Zhou Z., An Y., He T., Park E.K., Modeling Semantic Relations between Visual Attributes and Object Categories via Dirichlet Forest Prior, in ACM CIKM 2012 (full paper, acceptance rate: 13.4)

·         An Y, Hu X., Song Y., Learning to Discover Complex Mappings from Web Forms to Ontologies,  in ACM CIKM 2012 ( full paper, acceptance rate: 13.4%)

·         Chen X., Hu X., An Y., Xiong Z., He T., E.K. Park, Perspective Hierarchical Dirichlet Process for User-Tagged Image Modeling, accepted in the 20th ACM Conference on Information and Knowledge Management (ACM CIKM 2011 (acceptance rate: 20%)

·         Lu C. Hu X. Chen X. J. Park, He T., Li Z., The Topic-Perspective Model for Social Tagging Systems, in the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining  (ACM SIGKDD 10) full paper, 77/578, acceptance rate: 13.3%)

·         Chen X., Hu X., Zhou Z., Lu C., Rosen G.  He T., Park E.K, A Probabilistic Topic-Connection Model for Automatic Image Annotation, the 19th ACM Conference on Information and Knowledge Management (ACM CIKM 2010), (full paper, 127/945, acceptance rate  13.4%)

·         Hu X., Zhang X, Lu C., Park E.K., Zhou X.: Exploiting Wikipedia as external knowledge for document clustering, ACM SIGKDD 09: 389-396 (full paper, acceptance rate: 12%)

·         Achanauparkp, P., Hu X, He T., Guo L An Y., Li Z. Improving Diversity of Focused Summaries Through Negative Endorsements of Redundant Facts, in the 2010 IEEE/WIC/ACM International Conference on Web Intelligence (acceptance rate: 16.6%, 52/313)

·         Zhou X., Achananuparp P., Park E.K, Hu X., Zhang X: AskDragon: a redundancy-based factoid question answering system with lightweight local context analysis. ACM/IEEE JCDL 09: 483-484

·         Zhang X., Hu X., Zhou X., A Comparative Evaluation of Different Link Types on Enhancing Document Clustering, accepted in 31th  Annual International ACM SIGIR Conference on Research & Development on Information Retrieval (SIGIR 2008) (acceptance rate: 17%, 85/496)

·         Zhou X., Zhang X., Hu X., Semantic Smoothing for Bayesian Text Classification with Small Training Data, SIAM SDM 08

·         Hu X., Wu F.X. Ng M., Sokhansanj B., Mining and Dynamic Simulation of Sub-Networks from Large Biomolecular Networks, in 2007 International Conference on Artificial Intelligence, June 25-28, Las Vegas, USA (Best Paper Award, out of 500 submissions)

·         Zhou X., Hu X., Zhang X., A Segment-based Hidden Markov Model for Real-Setting Pinyin-to-Chinese Conversion,  in the Proceedings of the ACM CIKM 2007, pp 1027-1030 (acceptance rate: 26%, 512 submission)

·         Zhou X., Zhang X., Hu X., Semantic Smoothing of Document Models for Agglomerative Clustering,  in the Proceeding of the Twentieth International Joint Conference on Artificial Intelligence (IJCAI 07), Hyderabad, India, Jan 6-12, 2007, pp2928-2933(acceptance rate: 15.7%, 212/1353)

·         Zhou X., Hu X., Zhang X., Lin X., Song I-Y., Context-Sensitive Semantic Smoothing for the Language Modeling Approach to Genomic IR, in the Proceedings of the 29th  Annual International ACM SIGIR Conference on Research & Development on Information Retrieval (SIGIR 2006), pp 170-177 (acceptance rate: 18.5%, 74/399)

·         Hu X., Zhang X., Yoo I., Zhang Y-Q. A Semantic Approach for Mining Hidden Links from Complementary and Non-Interactive Biomedical Literature, Proceedings of the 6th SIAM International Conference on Data Mining (SIAM SDM 06), April 20-22, 2006, Bethesda, MD, USA, pp 200-209 , (acceptance rate: 16%, 40/244)

·         Hu X., Zhang X., Zhou X., Integration of Cluster Ensemble and EM based Text Mining for Microarray Gene Cluster Identification and Annotation, in the Proceedings of ACM 15th Conference on Information and Knowledge Management (ACM CIKM 2006), (537 submissions, 15% acceptance rate for full papers, 10% acceptance rate for post papers)

·         Zhang X., Zhou X., Hu X., Semantic Smoothing for Model-based Document Clustering, accepted in the 2006 IEEE International Conference on Data Mining (IEEE ICDM06), Dec. 18-22, 2006, HongKong (800 submissions, acceptance rate : 20%)

·         Yoo I., Hu X., Song I-Y., Integration of Semantic-based Bipartite Graph Representation and Mutual Refinement Strategy for Biomedical Literature Clustering, in the Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD 2006) acceptance rate for full paper: 11%, acceptance rate for short paper: 12%, 50 full papers, 55 short papers out of 457 submission)

·         Yoo I., Hu X., A Comprehensive Comparison Study of Document Clustering for A Biomedical Digital Library MEDLINE, in the Proceedings of the 2006 ACM/IEEE Joint Conference on Digital Library (ACM/IEEE JCDL 2006), June 11-15, 2006, Chapel Hill, NC, USA, pp 220-229 (acceptance rate: 15%, 28/188)

·         Hu X., Yoo I., Song M., Zhang Y., Song I-Y., Mining Undiscovered Public Knowledge from Complementary and Non-interactive Biomedical Literature through Semantic Pruning, in ACM Fourteen Conference on Information and Knowledge Management (ACM CIKM 2005) regular papers: 77/425, poster paper: 86/425)

·         Hu X., Lin T.Y, Song I-Y., Lin X., Yoo I., Lechner M., Song M., Ontology-based Scalable and Portable Information Extraction System to Extract Biological Knowledge from Huge Collection of Biomedical Web Documents, in the Proceedings of the 2004 IEEE/ACM Web Intelligence Conference, Sept, 2004 (nominated for Best Paper Award), pp77-83

·         Hu X., Using Rough Set Theory and Database Operations to Construct a Good Ensemble of Classifier for Data Mining Application, in the Proceedings of the IEEE 2001 International conference in Data Mining (IEEE ICDM 2001), 233-240, Nov 29-Dec 2, 2001, San Jose, CA 2001, pp 233-240 (acceptance rate: 19.7%, 72/365)

·         Hu X., Cercone N., Mining Knowledge Rules from Databases: An Attribute-Oriented Rough Set Approach,  in the Proceedings of 12th IEEE International Conf. on Data Engineering, (ICDE96), New Orleans, LA, USA , Feb 27-March 1, 1996, pp 96-105 (acceptance rate: 22% , 60/264)

·         Hu X., Cercone N., Rough Sets Similarity-Based Learning From Databases, in the Proceedings of the 1st International Conference on Knowledge Discovery and Data Mining, Montreal (KDD 95), Canada, August 1995, pp 162-167

·         Hu X., Cercone N., Discovery of Decision Rules from Databases: A Rough Set Approach, in the Proceedings of the ACM 1st Third International Conference on Information and Knowledge Management (ACM CIKM'94), 392-400, Gaithersburg, Maryland, USA, Nov., 1994, pp 392-340

 

             Selected Publication in Bioinformatics

·         Hu X., Pan Y. (Eds), Knowledge Discovery in Bioinformatics: Techniques, Methods and Applications, John Wiley and Sons, ISBN 047177796X, John Wiley & Sons, 361 pages

·         Jiang X., Hu Xiaohua, Tingting He, Identification of the clustering structure in microbiome data by density clustering on the Manhattan distance. SCIENCE CHINA Information Sciences 59(7): 070104:1-070104:7 (2016)

·         Zhang Y ; Xiaohua Hu ; Xingpeng Jiang, Multi-view Clustering of Microbiome Samples by Robust Similarity Network Fusion and Spectral Clustering, IEEE/ACM Trans. Comput. Biology Bioinform. 12(2)

·         Jiang X., Hu X., Xu W., Park E.K., Predicting Microbial Interactions using Vector Autoregressive Model with Graph Regularization, accepted in IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2015 

·         Jiang X., Hu X., Microbiome data Integration by robust similarity network fusion, 2014 IEEE International Conference on Bioinformatics and Biomedicine

·         Jiang X. Hu X., Xu W., Comparison of dimensional reduction methods for detecting and visualizing novel patterns in human and marine microbiome, IEEE Transactions on NanoBioscience, 2013

·         Wang X,  Li G-Z, Jia-Ming Liu J-M,  Hu X., and  Zhao R-W, Multi-Label Learning for Protein Subcellular Location Prediction,  in IEEE Transactions on NanoBioscience, 2012

·         Kong W., Mou X., Hu X., Exploring Matrix Factorization Techniques for Significant Genes Identification of Alzheimer’s Disease Microarray Gene Expression Data, accepted to be published in BMC Bioinformatics, 2011

·         Chen X., Hu X., Lim T; Shen X., Park E., Rosen G., Exploit the Functional and Taxonomic Structure of Genomic Data by Probabilistic Topic Modeling, in  IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2011, Vol 8(2), March 2011, pp294-307

·         Liu J., Li Z., Hu X., Chen Y., Park E., Dynamics Biclustering of Microarray Data by Multi-Objective Immune Optimization, in BMC Genomics, 2011

·         Yin S., Li. Z., Huang X., Hu X.,  A relevance-novelty combined model for genomics search result diversification, accepted to be published in BMC Bioinformatics, 2011

·         Hu X., Ng M., Wu F.X , Sokhansanj B, Mining, Modeling and Evaluation of Sub-Networks from Large Biomolecular Networks and its Comparison Study,  in the IEEE Transactions on Information Technology in Biomedicine, March 2009, Vol 13, No 2., pp184-194

·         Hu X., Wu D., Data Mining and Predictive Modeling of Biomolecular Network from Biomedical Literature Databases, in IEEE/ACM Transactions on Computational Biology and Bioinformatics, (April-June  2007), p251-263

·         Tang Y.C., Zhang Y-Q,  Huang Z., Hu X.,, and Zhao Y. Recursive Fuzzy Granulation for Gene Subsets Extraction and Cancer Classification, IEEE Transactions on Information Technology in Biomedicine . Vol 12, No. 6, Nov. 2008, pp 723-730

·         Hu X., Sokhansanj B, Wu D., Tang Y., A Novel Approach for Mining and Dynamic Fuzzy Simulation of Biomolecular Network, in IEEE Transactions on Fuzzy Systems (Dec., 2007) pp1219-1229

·         Jiang X., Hu X., Shen H. He T., Manifold Learning reveals nonlinear structure in metagenomics profiles, in IEEE BIBM20112 (full paper, acceptance rate: 19.7%)

·         Chen X., He T., Hu X., An Y. Wu X., Inferring functional groups from microbial gene catalogue with probabilistic topic models. in the 2011 IEEE International Conference on Bioinformatics and Biomedicine, (IEEE BIBM2011). Atlanta, GA, USA. (acceptance rate: 20%,58/299)

·         Wu D., Hu X. Hu T.,, Exploratory Analysis of Protein Translation Regulatory Networks Using Hierarchical Random Graphs, accepted in the 2009 IEEE International Conference on Bioinformatics and Biomedicine, (IEEE BIBM09) (acceptance rate: 18.8%, 44/233)

·         Li Y., Hu X., Lin H., Yang Z., Learning an Enriched Representation from Unlabeled Data for Protein-Protein Interaction Extraction , BMC Bioinformatics, 11(Suppl 2):S7

·         Chen Y., Li Z., Wang X., Feng J., Hu X., Predicting Gene Function using Few position Examples and Unlabeled Ones, accepted to be published in BMC Genomics

·         Wu D., Hu X., He T., Park E. Wu X., Wang X., Feng J., Protein Translation Regulatory Networks Analysis based on Hierarchical Random Graphs,  BMC Bioinformatics (in press).

·         Liu J., Li Z., Hu X., Chen Y., Biclustering of Microarray Data with Multi-Objective Particle Swarm Optimization based on Crowding Distance, BMC Bioinformatics 10(S-4): (2009)

·         Yoo I., Hu X., Song I-Y, A Coherent Document Clustering and Text Summarization Approach through a Scale-free Ontology-enriched Graphical Representation, BMC Bioinformatics, 8(Suppl 9):S4

·         Hu X., Yoo I., Song I-Y., Song M., Han J., Lechner M., Extracting and Mining Protein-Protein Interaction Network from Biomedical Literature, in the Proceedings of the 2004 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (IEEE CIBCB 2004), Oct. 7-8, 2004, San Diego, USA, (Best Paper Award), pp 244-251