Publications

2009

 

Suleyman Cetintas, Luo Si, YanPing Xin, Dake Zhang and Joo Young Park.  (2009). “Detecting Students’ Off-Task Behavior in Intelligent Tutoring Systems with Machine Learning Techniques”.  IEEE Transactions on Learning Technologies (TLT). (In Press)

Jeongwoo Ko, Luo Si, Eric Nyberg. (2009). “Combining Evidence with a Probabilistic Framework for Answer Ranking and Answer Merging in Question Answering”. Information Processing & Management. (IPM). (In Press).

Mummoorthy Murugesan, Wei Jiang, Chris Clifton, Luo Si and Jaideep Vaidya. (2009). “Efficient Privacy-Preserving Similar Document Detection”. The International Journal on Very Large Databases (VLDBJ).  (In Press)

Dan Zhang, Luo Si,, Wei Fang, and Tao Li. (2009). “Maximum Margin Multiple Instance Clustering”.  International Joint Conference on Artificial Intelligence (IJCAI).(PDF)

Wei Jiang, Mummoorthy Murugesan, Chris Clifton and Luo Si. "t-Plausibility: Semantic Preserving Text Sanitization". 2009 IEEE International Conference on Privacy, Security, Risk and Trust (PASSAT). (PDF)

Suleyman Scetintas Luo Si, and Hao Yaun. (2009). “Learning from Past Queries for Resource Selection”. In Proceedings of the 18th International Conference on Information and Knowledge Management (CIKM). (Short Paper) (PDF)

Suleyman Scetintas, Luo Si, YanPing Xin, Dake Zhang and Joo Young Park.  (2009). “Learning to Identify Students’ Off-Task Behavior in Intelligent Tutoring Systems”.  International Conference on Artificial Intelligence in Education. (AIEducation).

Suleyman Scetintas, Luo Si, YanPing Xin, Dake Zhang and Joo Young Park.  (2009). “Text Categorization of Mathematical Word Problems”. International Florida Artificial Intelligence Research Society Conference (FLAIRS).  (PDF)

Suleyman Cetintas, Luo Si, Yan Ping Xin and Casey Hord. (2009). “Predicting Correctness of Problem Solving from Low-level Log Data in Intelligent Tutoring Systems.” The 2nd International Conference on Educational Data Mining (EDM). (PDF)

Dan Zhang and Luo Si. (2009). “Modeling search response time”. Thirtieth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR) (Poster). (PDF)

Yi Fang, Luo Si, Aditya Mathur. (2009). “Learning to Rank Expertise Information in Heterogeneous Information Sources”. In SIGIR 2009 Workshop on Learning to Rank for Information Retrieval (SIGIR Workshop).  (PDF)


2008

Mengqiu Wang and Luo Si. (2008). “Discriminative Probabilistic Models for Passage Based Retrieval”, Answering",   In Proceedings of the Thirtieth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM. (SIGIR) (PDF)

Rong Jin, Luo Si and Christina Chan. (2008). “A Bayesian Framework for Knowledge Driven Regression Model in Micro-array Data Analysis”, International Journal of Data Mining and Bioinformatics (IJDMB), 2(3): 250-267, (PDF)

Luo Si, Danni Yu, Daisuke Kihara and Yi Fang (2008). "Combining Gene Sequence Similarity and Textual Information for Gene Function Annotation in the Literature". (Journal of Information Retrieval), 11(5): 283-404, (PDF

Luo Si, Jamie Callan, Suleyman Cetintas and Hao Yuan (2008). "An effective and efficient results merging strategy for multilingual information retrieval in federated search environments". (Journal of Information Retrieval), 11(1): 1-24, (PDF)

Wei Jiang, Mummoorthy Murugensan, Chris Clifton, and Luo Si. (2008). "Similar Document Detection with Limited Information Disclosure", In Proceedings of the 24th IEEE International Conference on Data Engineering (ICDE) (PDF).

Yi Fang, Luo Si and Aditya Mathur, "FacFinder: Search for Expertise in Academic Institutions", Technical Report, SERC-TR-294 and Department of Computer Science, Purdue University 2008. (PDF)

Yi Fang, Luo Si and Aditya Mathur, Dung Trung Hong, and William Pfeifer "Where to find brians in Indiana?", Poster, IT Suumit, Purdue University 2008. (PDF) (Graduate Student Research Award).

 

2007

Jeongwoo Ko, Luo Si and Eric Nyberg. (2007). "A Probabilistic Graphical Model for Joint Answer Ranking in Question Answering" In Proceedings of the Twenty Nineth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM. (SIGIR) (PDF) (Acceptance Rate: 17%)

Suleyman Cetintas and Luo Si.  (2007). "Exploration of the Tradeoff between Effectiveness and Efficiency for Results Merging in Federated Search" In Proceedings of the Twenty Nineth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM. (SIGIR) (PDF) (Poster)

Wei Jiang, Luo Si  and Jing Li (2007). "Protecting Source Privacy in Federated Search" In Proceedings of the Twenty Nineth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM. (SIGIR) (PDF) (Poster)

Jeongwoo Ko, Luo Si and Eric Nyberg. (2007). "A Probabilistic Framework for Answer Selection in Question Answering" Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT) (Acceptance Rate: 24%) (PDF)

 

2006

Rong Jin, Luo Si, ChengXiang Zhai. (2006). "A study of Mixture Models for Collaborative Filtering" Journal of Information Retrieval. (Journal of Information Retrieval) (PDF).

Luo Si, Rong Jin, Steven C.H. Hoi (2006). "Collaborative Image Retrieval via Regularized Metric Learning"  ACM Multimedia Systems Journal, Special issue on Machine Learning Approaches to Multimedia Information Retrieval. (ACM Multimedia Systems Journal) (PDF).

Thi T. Avrahami, Lawrence Yao, Luo Si and Jamie Callan. (2006). "The FedLemur project: Federated search in the real world" In Journal of the American Society for Information Science and Technology. (JASIST) 57(3) (pp. 347-358).

Jin, R.; Si, L.; Srivastava, S.; Li, Z.; Chan, C. (2006). “A Knowledge Driven Regression Model for Gene Expression and Microarray Analysis”, EMBC Conference Proceedings (IEEE International Conference of the Engineering in Medicine and Biology Society).

Luo Si, Jie Lu and Jamie Callan. (2006). "Combining multiple resources, evidences and criteria for genomic information retrieval." In Proceedings of Text Retrieval Conference (TREC). (PDF)

(No.3 Group in passage retrieval of 2006 TREC Genomics Triage Tasks by NIST)

Hui Yang , Luo Si, Jamie Callan. (2006). "Knowledge Transfer and Opinion Detection in the TREC2006 Blog Track". In Proceedings of Text Retrieval Conference (TREC). (PDF)

 

2005

Luo Si and Jamie Callan. (2005). "Modeling Search Engine Effectiveness for Federated Search" In Proceedings of the Twenty Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Toronto, Canada: ACM. (SIGIR) (PDF)

Rong Jin, Joyce Y. Chai and Luo Si. (2005). "Learn to Weight Terms in Information Retrieval Using Category Information" In Proceedings of the 22th International Conference on Machine Learning. Bonn, Germany.  (ICML). (PDF)
 
Luo Si and Jamie Callan. (2005). “CLEF2005: Multilingual Retrieval by Combining Multiple Multilingual Ranked Lists” In C. Peters(Ed.), Results of the CLEF2005 cross-language evaluation forum (CLEF). (PDF)
(No.1 in 2005 Cross-lingual Evaluation Formul (CLEF) Multilingual Retrieval Task by European Comission
(No.1 in 2005 Cross-lingual Evaluation Formul (CLEF) Results Merging Task by European Commision)

Luo Si, Tapas Kanungo and Xiangji Huang. (2005). "Boosting Performance of Bio-Entity Recognition by Combining Results from Multiple Systems" WorkShop on Data Mining in Bioinformatics (BioKDD). (PDF)
Jimmy Huang, Ming Zhuo and Luo Si. (2005). “York University at TREC 2005: Genomic Track” In Proceedings of the 2005 Text REtrieval Conference (TREC). (PDF)
(No.1 in 2005 TREC Genomics Adhoc Retrieval Task by NIST
Luo Si and Tapas Kanungo. (2005). “Thresholding Strategies for Text Classifiers: TREC-2005 Biomedical Triage Task Experiments” In Proceedings of the 2005 Text REtrieval Conference (TREC) (PDF)
(2 No.2 in 2005 TREC Genomics Triage Tasks by NIST)
 
Luo Si and Rong Jin. (2005). "Adjusting Mixture Weights of Gaussian Mixture Model via Regularized Probabilistic Latent Semantic Analysis" In Proceedings of the Ninth Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD) . (PDF)

2004

Luo Si and Jamie Callan. (2004). "Unified Utility Maximization Framework for Resource Selection" In Proceedings of the 13th International Conference on Information and Knowledge Management. (CIKM) ACM. (PDF)

Luo Si and Rong Jin. (2004). "Unified Filtering by Combining Collaborative Filtering and Content-Based Filtering via Mixture Model and Exponential Model" In Proceedings of the 13th International Conference on Information and Knowledge Management. (CIKM) ACM. (PDF)

Rong Jin, Joyce Chai and Luo Si. (2004). "Effective Automatic Image Annotation via a Coherent Language Model and Active Learning" In Proceedings of the Twentieth Annual ACM International Conference on Multimedia. (ACM Multimedia) (PDF)

Rong Jin and Luo Si. (2004). "A Bayesian Approach toward Active Learning for Collaborative Filtering" In Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence. Banff, Alberta. (UAI) (PDF)

Luo Si and Jamie Callan. (2004). Chapter: "The Effect of Database Size Distribution on Resource Selection Algorithms" In Distributed Multimedia Information Retrieval , LNCS 2924, Springer (An extension of the SIGIR 2003 workshop paper) (LNCS)

Rong Jin, Joyce Zhai and Luo Si (2004)  "An Automated Weighting Scheme for Collaborative Filtering" In Proceedings of the Twenty Sixth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Sheffield, UK: ACM. (SIGIR) (PDF)

Jesse Montgomery, Luo Si, Jamie Callan and David A. Evans. (2004). "Effect of Varying Number of Documents in Blind Feedback" In Proceedings of the Twenty Sixth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Sheffield, UK: ACM. (SIGIR) (PDF)

Rong Jin and Luo Si (2004)  "A Study of Methods for Normalizing User Ratings in Collaborative Filtering" In Proceedings of the Twenty Sixth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Sheffield, UK: ACM. (SIGIR) (PDF)

 

2003

Luo Si and Jamie Callan. (2003). "A Semi-Supervised Learning Method to Merge Search Engine Results" In ACM Transactions on Information Systems , 24(4) (pp. 457-491). ACM (TOIS) (PDF)

Rong Jin, Luo Si, ChengXiang Zhai and Jamie Callan. (2003). "Collaborative Filtering with Decoupled Models for Preferences and Ratings" In Proceedings of the 12th International Conference on Information and Knowledge Management. (CIKM) ACM. (PDF)

Luo Si and Rong Jin. (2003). "Flexible Mixture Model for Collaborative Filtering" In Proceedings of the Twentieth International Conference on Machine Learning. Washington, DC USA. (ICML) (PDF)

Rong Jin, Yan Liu, Luo Si, Jamie Carbonell and Alex Hauptmann. (2003). "A New Boosting Algorithm Using Input-Dependent Regularizer" In Proceedings of the Twentieth International Conference on Machine Learning. Washington, DC USA. (ICML)  (PDF)

Rong Jin, Luo Si and ChengXiang Chai. (2003). "Preference-based Graphic Models for Collaborative Filtering" In Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence. Acapulco, Mexico. (UAI) (PS) (PDF)

Luo Si and Jamie Callan. (2003). "Relevant Document Distribution Estimation Method for Resource Selection" In Proceedings of the Twenty Fifth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Toronto, Canada: ACM. (SIGIR) (PDF)

Luo Si and Jamie Callan. (2003). "The effect of database size distribution on resource selection algorithms." In SIGIR 2003 Workshop on Distributed Information Retrieval . Toronto, Canada: ACM. (SIGIR Workshop on Distributed Information Retrieval) (PS) (PDF)

Luo Si, Jie Lu and Jamie Callan. (2003). "Distributed Information Retrieval With Skewed Database Size Distributions" In Proceedings of the8:36 PM 10/8/2003 NSF's National Conference on Digital Government Research (dg.o2003) (Dg.O) (PS) (PDF)

 

2002

Luo Si, Rong Jin, Jamie Callan and Paul Ogilvie. (2002). "A Language Model Framework for Resource Selection and Results Merging" In Proceedings of the 11th International Conference on Information and Knowledge Management. (CIKM) ACM. (PS) (PDF)

Luo Si and Jamie Callan. (2002). "Using Sampled Data and Regression to Merger Search Engine Results" In Proceedings of the Twenty Fourth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Tampere. Finland: ACM. (SIGIR) (PS)

Rong Jin, Luo Si, Alex G. Hauptmann, Jamie Callan. (2002). "Language Model for IR Using Collection Information" In Proceedings of the Twenty Fourth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Tampere. Finland: ACM. (SIGIR) (PS)

 

2001

Luo Si and Jamie Callan. (2001).  "A statistical model for scientific readability." In Proceedings of the 10th International Conference on Information and Knowledge Management. (CIKM) ACM. (PS)

Luo Si and Jamie Callan. (2001).  "Named Entity Recognition"  Technical Report  Language Technology Institute, School of Computer Science, Carnegie Mellon University.

 

2000 and Before:

Luo Si and Qixiu Hu. (2000).  "Two-Stage Speaker Identification System Based On VQ and NBDGMM" ICSLP 2000. (6th International Conference on Spoken Language Processing)  (ICSLP)

Luo Si, Qixiu Hu, Qin Jin. (1999)  "Speaker Identification via FVC" ISSPIS’99 (1999 International Symposium on Signal Processing and Intelligent System)(ISSPIS)

Luo Si, Qixiu Hu, Qin Jin. (1999)  "A Desirable Features Combination Method"  ISSPIS’99 (1999 International Symposium on Signal Processing and Intelligent System)(ISSPIS)

Jin Qin, Si Luo, Hu Qixiu. (1998)  "A High-Performance Text-Independent Speaker Identification System Based On BCDM" ICSLP’1998. (5th International Conference on Spoken Language Processing) (ICSLP)

Luo Si, Qixiu Hu, Qin Jin. (1999)  "An Unsupervised Speaker Segmentation Method During Dialogue" Signal Processing, Vol.15, pp238-241. (Chinese)

Luo Si, Qixiu Hu, Qin Jin. (1999) "A New Distance Estimation and Codebook Training Algorithm for Speaker Recognition" NCMT’1999. (8th National Conference on Multimedia Technology) (Chinese)

Jin Qin, Hu Qixiu Si Luo. (1999) "A New Set of Feathers Based on Non-All-Pole Model" Ncmmsc’1998 (5th National Conference on Man-Machine Speech Communication) (Chinese)

Luo Si, Qixiu Hu, Qin Jin. (1999) "Different Algorithms of BCDM for Speaker Identification" Ncmmsc’1998 (5th National Conference on Man-Machine Speech Communication) (Chinese)

Hu Qixiu and Si Luo. (1997) "A New Noise Reduction Algorithm" NCCIIIA’1997 (3rd National Conference of Intelligent Interface and Intelligent Application). (Chinese)