This engineering role works on developing algorithms to analyze and organize web-scale data – processing millions of web pages and extracting interesting pieces of data, including: crawled data from client websites and the web at large. The engineer will both be working to improve the depth of BloomReach’s existing algorithms to improve its efficacy and also to innovate and build new products that will extend the current suite of services offered by BloomReach.
Develop machine learning and data mining algorithms to extract meaningful information from tens to hundreds of millions of crawled web-pages
Improve web search relevance for proprietary search service
Develop models to predict web search query patterns
Develop algorithms to cluster and organize web pages by similar content
BS/MS/PhD in Computer Science or related fields
Experience with analyzing large data sets
Comfortable with machine learning, data mining, information retrieval, discrete algorithms
Experience with implementing algorithms in applied, real-world settings
Experience with Bash, Java, Hadoop, Lucene, Python, Amazon Web Services a plus
Experience with Search, SEO a plus
If this is you and you can prove it – we’re interested in talking with you to be one of BloomReach’s Software Engineers - Data. If you can also send at least one cool piece of code, or a link to something you’ve built, or a hack that you’re proud of, we’d love to see it.
BloomReach helps its customers manage potential duplicate content by creating clusters of equivalent pages and nominating a...