This post is over 30 days old. The position may no longer be available

Big Data Architect

Number Theory , Gurgaon · numbertheory.ai · Full-time employment · Programming

What we are looking for :

1. A successful candidate with 8+ years of experience in the role of implementation of a high-end software product.

2. Provides technical leadership in BigData space (Spark and Hadoop Stack like Map/Reduc, HDFS, Hive, HBase, Flume, Sqoop etc..NoSQL stores like Cassandra, HBase, etc) across Engagements and contributes to open source BigData technologies.

3. Rich hands-on in Spark and worked on Spark at a larger scale.

4. Visualize and evangelize next-generation infrastructure in BigData space (Batch, Near Real-time, Real-time technologies).

5. Passionate for continuous learning, experimenting, applying and contributing towards cutting edge open source technologies and software paradigms

6. Expert-level proficiency in Java and Scala.

7. Strong understanding and experience in distributed computing frameworks, particularly Apache Hadoop2.0 (YARN; MR & HDFS) and associated technologies one or more of Hive, Sqoop, Avro, Flume, Oozie, Zookeeper, etc.Hands-on experience with Apache Spark and its components (Streaming, SQL, MLLib)

8. Operating knowledge of cloud computing platforms (AWS, Azure) - Good to have

9. Operating knowledge of different enterprise Hadoop distribution (C) - Good to have

10. Good Knowledge of Design Patterns

11. Experience working within a Linux computing environment, and use of command-line tools including knowledge of shell/Python scripting for automating common tasks.

What you will do :

- Evaluate and recommend BigData technology stack best suited for NT AI at scale Platform and other products

- Lead the team for defining proper Big Data Architecture Design.

- Design and implement features on NT AI at scale platform using Spark and other Hadoop Stack components.

- Drive significant technology initiatives end to end and across multiple layers of architecture

- Provides strong technical leadership in adopting and contributing to open source technologies related to Big Data across multiple engagements 

- Designing /architecting complex, highly available, distributed, failsafe compute systems dealing with considerable scalable amount of data

- Identify and work upon incorporating Nonfunctional requirements into the solution (Performance, scalability, monitoring, etc.)

Apply for this position

Login with Google or GitHub to see instructions on how to apply. Your identity will not be revealed to the employer.

It is NOT OK for recruiters, HR consultants, and other intermediaries to contact this employer