This engineering role focuses on the core systems challenges of scaling BloomReach’s web-scale crawling, data processing, and serving systems and infrastructure.
Scale BloomReach’s Hadoop-based data processing pipelines -- including crawling, parsing, indexing, semantic analysis and language modeling, and analytics -- to handle and maintain complex processes in an efficient and reliable way.
Core architecture and development of new features and improved performance for high-traffic, high-availability web services.
Build infrastructure and tools to increase automation, improve efficiency of the engineering team, and maintain technical excellence in the code base.
BS/MS degree in Computer Science or related field.
Extensive background in algorithms and strong software architecture skills.
Expert proficiency in at least two common languages, such as Java, C++, Python, Ruby.
Experience with maintaining distributed systems at significant scale in a production environment.
Strong knowledge of web technologies, including details of HTTP, common web frameworks such as Tomcat or Django, networking, and web performance engineering.
Experience with map-reduce or large-scale data processing (e.g Hadoop), Linux serving systems, and MySQL a plus.
If this is you and you can prove it – we’re interested in talking with you to be one of BloomReach’s Software Engineers - Backend. If you can also send at least one cool piece of code, or a link to something you’ve built, or a hack that you’re proud of, we’d love to see it.
BloomReach helps its customers manage potential duplicate content by creating clusters of equivalent pages and nominating a "canonical" page...