Intern recruiting in Information Extraction and Web Data Mining Area
Working time: Full time or part time
Location: Beijing, Haidian District, Tsinghua Science Park, HP Labs China
If you are interested in Information Extraction and Web Mining area,
please contact: shicong.feng@hp.com
Information Extraction focuses on extracting structured data from unstructured data
(web page, text, email..), including entity extraction, entity relation extraction,
web content/structure analysis and data visualization. The research results will be
integrated in a real-world enterprise search engine.
Applicants must have:
1. Motivated Master’s or Ph.D. candidate in Computer Science or related technical discipline
2. Outstanding problem-solving and very strong programming skills (Java/C/C++ programming)
3. In-depth knowledge of computer algorithms and data structure.
4. Good mathematical background.
5. Excellent written and verbal communication skills
6. Good team working and research passion
Experience and knowledge at least in one of the following areas:
7. Search engine and web mining
8. Information extraction / information retrieval
9. Machine learning
10. Natural language processing
Familiar with the followings is a plus:
11. Web search technology and Web programming
12. Web content/structure analysis
13. Nutch, Lucene and MapReduce
--
FROM 123.120.134.*