|
Book Chapter:
1. Is Privacy Still an Issue for Data
Mining?,
Chris Clifton,
Wei Jiang,
Mummoorthy Murugesan and M. Ercan Nergiz,
Chapter 18 in Next Generation of Data Mining,
Hillol Kargupta, Jiawei Han, Philip Yu, Rajeev Motwani, and Vipin Kumar (Eds.), CRC Press, 2008.
|
Journal:
1. Efficient Privacy-Preserving Similar Document Detection,
Mummoorthy Murugesan, Wei Jiang,
Chris Clifton,
Luo Si and Jaideep Vaidya,
accepted for publication in International
Journal of Very Large Data Bases (VLDB Journal), VLDB Endowment.
|
Conference and Workshop:
1. t-Plausibility: Semantic
Preserving Text Sanitization
Wei Jiang, Mummoorthy Murugesan, Chris Clifton and Luo Si
IEEE
International Conference on Privacy, Security, Risk and Trust (PASSAT-
09), Vancouver, Canada, August 29-31, 2009. (Acceptance
Rate: 13%).
Anonymizing text documents
such as Medical reports is usually handled by removing sensitive words.
However, this defeats the purpose of releasing such documents for public
use. While these documents may contain identifying information, we
propose techniques based on generalizing terms so that identification is
minimized, yet the information content in documents is preserved.
We use ontology (eg., wordnet) based generalization to deal with general
text documents.
|
|
2. Providing Privacy through
Plausibly Deniable Search
Mummoorthy Murugesan and Chris Clifton
SIAM International Conference on Data Mining (SDM09),
Sparks, Nevada, USA, April 30 - May 2, 2009.
This is an extended version of SKM 2008
paper, with more efficient schemes and a better formalization of
Plausibly Deniable Search. In this scheme, a user query is issued as a
standardized query, along with masking queries that are on different
topics. This provides the user privacy through deniability - he could
deny issuing any particular query as any other query could have been the
actual user query. We use Latent Semantic Indexing, Query Topic
Clustering, DMOZ dataset, Lemur Toolkit for the experiments.
|
|
3. Plausibly Deniable Search
Mummoorthy Murugesan and Chris Clifton
Workshop on Secure Knowledge Management 2008, Dallas, Texas,
USA, Nov. 3-4, 2008.
This paper introduces the idea of Plausibly Deniable
Technique as a privacy-protection mechanism in a search environment,
mainly web search. Noting that Private Information Retrieval is not
practical with web search as it affects the advertisement business
model, we offer a solution based on the deniable search - the user
issues k queries and any one of them is plausibly original user query.
The experiments used TREC-4 dataset that shows some very preliminary
results - more comprehensive experiments are presented in the SDM 2009
paper.
|
|
4. Is Privacy Still an Issue for Data
Mining?,
Chris Clifton,
Wei Jiang, Mummoorthy
Murugesan and M. Ercan
Nergiz,
National Science Foundation Symposium on Next
Generation of Data Mining and Cyber-Enabled Discovery for Innovation
(NGDM'07), USA, Oct. 10-12, 2007.
|
|
5. Similar Document
Detection with Limited Information Disclosure
Wei Jiang, Mummoorthy Murugesan, Chris Clifton and Luo Si
In proceedings of the 24th International Conference on Data Engineering (ICDE 2008),
Cancun, Mexico, April 2008.
Suppose two parties want to find similar
documents in their collection without releasing the contents of their
own documents to the other party. This paper proposes a way to construct
cosine similarity securely, which is then used for finding similar
documents across different collections. Experiments are performed using
Lemur toolkit and Paillier encryption scheme.
|
|
6. Secure Content Validation
Mummoorthy Murugesan and Wei Jiang
In Privacy Data Management Workshop, IEEE 23rd International Conference on Data Engineering (ICDE), Istanbul, Turkey,
April 2007. Data provenance in the presence
of confidential sources make it impossible to declare and tag documents
with their plaintext sources. This paper proposes a framework that
preserves the confidentiality of sources allowing all data provenance
operations.
|
|
7. Balancing Data quality
against time and money constraints
Susan Long, Linda Roberge, Jeff Lamicela, Mummoorthy Murugesan
In SASŪ Users Group International (SUGI 29), Montreal, Canada,
March 2004. |
8. Uncheatable Grid Computing
Wenliang Du,
Jing J, Manish M and Mummoorthy Murugesan.
The 24th International Conference on Distributed Computing Systems (ICDCS'04), Tokyo, Japan,
March 2004.
The 7th Most cited paper in Computer Science, published in 2004. Citeseer stats.
|
|