Data Mining Engineer - Big Data/HPC
Bosch Research and Technology Center, Robert Bosch, LLC - Palo Alto, CA

This job posting is no longer available on Find similar jobs:Data Mining Engineer jobs

Tasks *
  • Develop and implement algorithms for distributed and parallel predictive analytics. *
  • Stay up-to-date with research & innovative 3rd party products addressing storage & analysis of large datasets from real-world problems. *
  • Develop distributed/parallel solutions for predictive analytics and visualization of structured & unstructured data sets. *
  • Design test cases to evaluate run-time & predictive performance of parallel/distributed algorithms. *
  • Improve scalability performance of existing storage and analytics solutions.
Requirements *
  • Practical experience in developing and parallelizing algorithms & appl. using MapReduce, MPI, or similar frameworks. *
  • Experience with distributed file systems & working knowledge of NoSQL or other distributed DTB systems. *
  • Demonstrated experience with relational database systems and familiarity with SQL. *
  • Proven expertise in applying descriptive and inferential statistics in Big Data. *
  • Competence in theory & application of standard machine learning or data mining algorithms. *
  • Need Linux OS system internals, storage concepts, & networking topologies & protocols. *
  • Experience identifying performance bottlenecks with network, I/O, OS, DBMS configuration. *
  • Experience with 2+ of the following: Java, C++ (STL), Python, Perl, MATLAB, R, SPSS, SAS. *
  • Propensity to work with stakeholders from a variety of business units & educational backgrounds. *
  • HBase, Hive, Pig, Cassandra, or similar technologies - Mahout (a plus)