This role will work closely with Hadoop Developers, Data Scientists, and IT and the key responsibility is to
support the Hadoop platform & associated devops operations
1.Diagnose, assess, troubleshoot and fix issues within the Open Source environment.
2.Performance tuning, troubleshooting, resource and capacity management of users and scheduled jobs.
3.Enforce security compliance and BI governance rules.
4.Documentation of all environmental settings and configurations.
5.Planning and upgrading of the environment, both hardware and software (where applicable).
6.Provide front line support to teams using the Hadoop environments.
7.Ensure availability and stability of the systems in production.
8.Along with the rest of the team, actively research and share learning/advancements in the Hadoop space,
especially related to administration.
1.Candidates with over 5 to 8 years of relevant IT experience will be considered .
2.Having production-grade HDFS admin experience on either Cloudera (CDH 5.x),
Hortonworks (HDP 2.x), Apache open-source or comparable Hadoop distribution
3.Having hands-on experience working with HDFS shell, YARN, Cloudera (or equivalent),
sqoop,NIFI, oozie hive , impala, pig, kafka, ELK ,spark ,storm and any other big data
stacks available in market
4.Having experience with any of NOSQL databases like HBASE, MangoDB, Cassandra or
any other db
5.Good to have real time experience with Vertica Analytics Platform
6.Having hands-on experience working with UNIX, RHEL Linux, or similar filesystem OS
7.Proven ability with AWS administration activities on key services like EC2,S3,SQS and so on
8.Proven ability to setup, expand ,upgrade & configure a cluster for big data solutions on
private cloud infrastructure and AWS Cloud Infrastructure
9.Proven ability to secure (Kerberos or comparable) a cluster and integration with enterprise
10.Proven ability on arriving backup strategies for big data systems
11.Proven ability in setting up real time streaming application with big data eco system
12.Expert knowledge of Hadoop hardware and network infrastructure;
13.Experience working with RDBMS and DW products in sysadmin role will be an added advantage
14.Good to have person worked in ETL projects;
15.Proven ability to install and configure software binaries for key BI/stats products (e.g.
Qlik, SAS, Tableau, Cognos, Excel)
16.Experience working in a DEVOPS model and familiarity with its toolsets (git, jenkins , ansible
17.Proven experience in defining, developing administration standards, policies & procedures
18.Strong communication, influencing and collaboration experience with all levels of the organization