Professional Record

Home

Education

       Work Experience   Publications    Systems   Teaching


Research Interests
I have great interest in Information Management and Security/Privacy. It all started when I designed and developed a VB/SQL based software for maintaining my home library. At Syracuse University, I got into Computer Security and Applied Cryptography, which have been my other interests ever since. My current interests include Search Privacy, Query log Anonymization, Document Clustering, Efficient Cryptographic Protocols, Privacy Preserving Data Mining and Automatic Text Anonymization. I am mostly working on any of these at the moment.

Education

   Ph.D. in Computer Science,   Aug 2004 - Present
    
Purdue University, USA
    Advisor: 
Prof. Chris Clifton

   M.S. in Computer Science,    Aug 2002 - June 2004
     Syracuse University, USA
     Advisor:
Prof. Wenliang Du

  B.E. in Computer Science and Engineering   Aug 1998 - May 2002
    
University of Madras, India
     (First class with distinction, Top 1% )


Work Experience

  TRAC,  Database Programmer,  Aug 2002 - June 2004
     
Syracuse University, USA
     Mentor:  Dr. Susan Long

   IBM Research Labs,  Research Intern,  June 2006 - Aug 2006
     New Delhi, India
     Mentor: Dr. Prasan Roy & Dr. Mukesh Mohania


Honors and Awards
  • President of Upsilon Pi Epsilon (CS honor society), Aug. 2007 - Aug. 2008.
  • Best Project Presentation in Ph.D. category, Summer 2006, IBM India Research Labs.
  • 2nd prize in the 2005 Burton Morgan Business Plan Competition, Purdue University, $15K cash award.
  • Member of Phi Delta Beta, Syracuse University, Spring 2003.
  • Ranked 17th (Top 1%) overall in Computer Science and Engineering, University of Madras, 2002.

Publications

Book Chapter:

1.  Is Privacy Still an Issue for Data Mining?,
               Chris Clifton, Wei Jiang, Mummoorthy Murugesan and M. Ercan Nergiz, Chapter 18 in Next Generation of Data Mining, Hillol Kargupta, Jiawei Han, Philip Yu, Rajeev Motwani, and Vipin Kumar (Eds.), CRC Press, 2008.

Journal:

1. Efficient Privacy-Preserving Similar Document Detection,
               Mummoorthy Murugesan, Wei Jiang, Chris Clifton, Luo Si and Jaideep Vaidya, accepted for publication in International Journal of Very Large Data Bases (VLDB Journal), VLDB Endowment.

Conference and Workshop:

1.  t-Plausibility: Semantic Preserving Text Sanitization
           
         Wei Jiang, Mummoorthy Murugesan, Chris Clifton and Luo Si
      IEEE International Conference on Privacy, Security, Risk and  Trust (PASSAT- 09), Vancouver, Canada, August 29-31, 2009. (Acceptance Rate: 13%).

        Anonymizing text documents such as Medical reports is usually handled by removing sensitive words. However, this defeats the purpose of releasing such documents for public use. While these documents may contain identifying information, we propose techniques based on generalizing terms so that identification is minimized, yet the information content in documents is preserved.  We use ontology (eg., wordnet) based generalization to deal with general text documents.


2.  Providing Privacy through Plausibly Deniable Search
                 Mummoorthy Murugesan and Chris Clifton
      SIAM International Conference on Data Mining (SDM09), Sparks, Nevada, USA, April 30 - May 2, 2009.

     This is an extended version of SKM 2008 paper, with more efficient schemes and a better formalization of Plausibly Deniable Search. In this scheme, a user query is issued as a standardized query, along with masking queries that are on different topics. This provides the user privacy through deniability - he could deny issuing any particular query as any other query could have been the actual user query. We use Latent Semantic Indexing, Query Topic Clustering, DMOZ dataset, Lemur Toolkit for the experiments.


3.  Plausibly Deniable Search
               Mummoorthy Murugesan and Chris Clifton
     Workshop on Secure Knowledge Management 2008,  Dallas, Texas, USA, Nov. 3-4, 2008.

     This paper introduces the idea of Plausibly Deniable Technique as a privacy-protection mechanism in a search environment, mainly web search. Noting that Private Information Retrieval is not practical with web search as it affects the advertisement business model, we offer a solution based on the deniable search - the user issues k queries and any one of them is plausibly original user query. The experiments used TREC-4 dataset that shows some very preliminary results - more comprehensive experiments are presented in the SDM 2009 paper.


4. Is Privacy Still an Issue for Data Mining?,
               Chris Clifton, Wei Jiang, Mummoorthy Murugesan and M. Ercan Nergiz,
     National Science Foundation Symposium on Next Generation of Data Mining and Cyber-Enabled Discovery for Innovation (NGDM'07), USA, Oct. 10-12, 2007.

5.  Similar Document Detection with Limited Information Disclosure
                Wei Jiang, Mummoorthy Murugesan, Chris Clifton and Luo Si
      In proceedings of the 24th International Conference on Data Engineering (ICDE 2008), Cancun, Mexico, April 2008.

    Suppose two parties want to find similar documents in their collection without releasing the contents of their own documents to the other party. This paper proposes a way to construct cosine similarity securely, which is then used for finding similar documents across different collections. Experiments are performed using Lemur toolkit and Paillier encryption scheme.


6. Secure Content Validation
                Mummoorthy Murugesan and Wei Jiang
  
   In Privacy Data Management Workshop,  IEEE 23rd International Conference on Data Engineering (ICDE), Istanbul, Turkey, April 2007.

     Data provenance in the presence of confidential sources make it impossible to declare and tag documents with their plaintext sources. This paper proposes a framework that preserves the confidentiality of sources allowing all data provenance operations.


7. Balancing Data quality against time and money constraints
               Susan Long, Linda Roberge, Jeff Lamicela, Mummoorthy Murugesan
  
   In SASŪ Users Group International (SUGI 29), Montreal, Canada, March 2004.

 

8. Uncheatable Grid Computing
                Wenliang Du, Jing J, Manish M and Mummoorthy Murugesan.
     The 24th International Conference on Distributed Computing Systems (ICDCS'04), Tokyo, Japan, March 2004.
    The 7th Most cited paper in Computer Science, published in 2004. Citeseer stats.

 


Systems

I like to see systems built out of my research works. I have built the following systems in recent times. These are available for any research purposes. Don't hesitate to send me a note if you would like to use any of the following systems.

1. Document Detection with Limited Information Disclosure (Lemur, Paillier)

2. Plausibly Deniable Search (Lemur, Lucene, DMOZ, SVD)

3. Text Anonymization (Wordnet, Lemur)


Teaching
    I enjoy teaching! At Purdue I was a teaching assistant for the course CS 352  (an undergraduate compilers course) in Fall 2004 and Spring 2005. These TA assignments were really exciting and also a great learning experience. You can get some slides that I prepared for the PSOs at this link.