MADlib is an open-source library for scalable in-database analytics. It provides data-parallel implementations of mathematical, statistical and machine learning methods for structured and unstructured data. We aim to foster widespread development of scalable analytic skills, by harnessing efforts from commercial practice, academic research, and open-source development.
We are looking for excellent students with a background in machine learning, statistics and data mining. People like you, who are passionate about developing quality scalable software and make a real impact on the diverse and rapidly expanding 'big data' market.
You will be responsible for helping to design and develop machine learning algorithms that will be deployed against large clusters of hardware at extreme scales of data to solve a variety of real world problems.
Requirements:
* Strong background in machine learning /data mining/kdd.
* Excellent programming skills in C/C++, Python.
* Experience working with databases.
Desired Skills:
* Experience with database internals, Postgres or other mpp systems.
Please send your chinese or english resume to myang@pivotal.io
--
FROM 106.39.20.*