Work place: Information Science and Technology Institute, Zhengzhou 450002, China
E-mail: raindot_ymj@163.com
Website:
Research Interests: Information-Theoretic Security, Data Mining, Network Security, Information Security
Biography
Meijuan Yin was born in Anhui Province, China at November, 1977. She was conferred a M.Sc. in computer science by the university of Zhengzhou Information Science and Technology Institute at Zhengzhou, China, in 2003. She is working on the Ph.D. in computer software and academic of the same University. After graduating from the University of Zhengzhou Information Science and Technology Institute at Zhengzhou, China, she became an assistant of the University in 2003 and turned to a lecturer in 2005. Her current research interests include data mining, social network analysis, and information security. Ms. Yin joined China Computer Federation (CCF) as a common member in 2007 and received the IEEE membership in 2010.
By Meijuan Yin Xiao Li Junyong Luo Xiaonan Liu Yongxing Tan
DOI: https://doi.org/10.5815/ijem.2011.06.03, Pub. Date: 5 Dec. 2011
Mining user identity information from emails is an important research topic in email mining. Most approaches extract an email user's name only from the header of an email, but there are often many name information in the body of emails, which are usually more suitable for representing the sender's or recipient's identity. This paper focuses on the problem of extracting email users' name aliases in the body of plain-text emails. After locating and extracting salutation and signature blocks from email bodies, we can identify the potential aliases in the salutation and signature lines, which can be directly related with the email addresses in email headers, by using named entity recognition(NER) tools. To verify and amend the potential aliases that were identified by NER tools, we propose a novel approach to extract aliases in the salutation and signature lines based on name boundary word template built on the characteristics of alias neighboring words. Results on the public subset of the Enron corpus indicate that the approaches presented in this paper can efficiently extract user's aliases from email bodies.
[...] Read more.By Meijuan Yin Junyong Luo Ding Cao Xiaonan Liu Yongxing Tan
DOI: https://doi.org/10.5815/ijigsp.2011.03.01, Pub. Date: 8 Apr. 2011
Finding out user identity information from emails is one of the important research topics in email mining. Most approaches extract an email user’s name only from the header of an email, but there are often many name information appearing in the body of emails, and those names are usually more suitable for representing the sender’s or recipient’s identity. This paper focuses on the problem of extracting email users’ name aliases in the body of plain-text emails. After locating and extracting salutation and signature blocks from email bodies, we can identify the potential aliases in the salutation and signature lines, which can be directly associated with the corresponding email address in email headers, by using named entity recognition(NER) tools. However the identified aliases may be half-baked or there are still some potential aliases that can’t be correctly identified. So we propose a novel approach to efficiently and accurately extract aliases in the salutation and signature lines based on name boundary word template built on the characteristics of alias neighboring words. Results on the public subset of the Enron corpus indicate that the approaches presented in this paper can efficiently extract user’s aliases from email bodies.
[...] Read more.Subscribe to receive issue release notifications and newsletters from MECS Press journals