Early applicant
On-site
Full-time
Requirements
- **Platform Engineering:
- Cluster Management:
- Expertise in design, implement, and maintain Hadoop clusters in large volume, including components such as HDFS, YARN, and MapReduce.
- Collaborate with data engineers and data scientists to understand data requirements and optimize data pipelines.
- Administration and Monitoring:
- Experience in administering and monitoring Hadoop clusters to ensure high availability, reliability, and...
- Cluster Management:
Skills
Hadoop
Platform engineering
Cluster management
HDFS
YARN
MapReduce
Data pipelines
Administration
Monitoring
Troubleshooting
Security implementation
Authentication
Authorization
Encryption
Backup and disaster recovery
Performance optimization
Capacity planning
Automation
DevOps
Technology adoption
Documentation
Technical support
User interface design
Role-based access control (RBAC)
Resource management
Self-service provisioning
Automated scaling
Job scheduling
Data ingestion
Query optimization
ETL
Apache NiFi
Data modeling
Database design
SQL
Streaming data processing
Apache Kafka
Spark Streaming
Data quality
Data governance
Workflow orchestration
Apache Airflow
Data warehousing
Version control
Git
Data scientists
Data security
Compliance
Data catalog
Metadata management
Apache Flink
Apache Beam
Data transformation
Data serialization
Avro
Parquet
Problem-solving
Analytical thinking
Collaboration
Teamwork
Adaptability
Continuous learning
Performance monitoring
Security best practices
Ansible
Observability
Networking
Cloudera CDH/CDP
Data Bricks
HD Insights
Spark
Hive
Impala
HBase
Kudu
Sqoop
Oozie
RHEL Linux
Linux system administration
Apache Ambari
Cloudera Manager
Bash
KSH
Kerberos
LDAP
AD
Scalability
Hadoop Platform Engineer | Onsite | Dallas, TX
Photon
Dallas
Early applicant
On-site
Full-time
Requirements
- **Platform Engineering:
- Cluster Management:
- Expertise in design, implement, and maintain Hadoop clusters in large volume, including components such as HDFS, YARN, and MapReduce.
- Collaborate with data engineers and data scientists to understand data requirements and optimize data pipelines.
- Administration and Monitoring:
- Experience in administering and monitoring Hadoop clusters to ensure high availability, reliability, and...
- Cluster Management:
Skills
Hadoop
Platform engineering
Cluster management
HDFS
YARN
MapReduce
Data pipelines
Administration
Monitoring
Troubleshooting
Security implementation
Authentication
Authorization
Encryption
Backup and disaster recovery
Performance optimization
Capacity planning
Automation
DevOps
Technology adoption
Documentation
Technical support
User interface design
Role-based access control (RBAC)
Resource management
Self-service provisioning
Automated scaling
Job scheduling
Data ingestion
Query optimization
ETL
Apache NiFi
Data modeling
Database design
SQL
Streaming data processing
Apache Kafka
Spark Streaming
Data quality
Data governance
Workflow orchestration
Apache Airflow
Data warehousing
Version control
Git
Data scientists
Data security
Compliance
Data catalog
Metadata management
Apache Flink
Apache Beam
Data transformation
Data serialization
Avro
Parquet
Problem-solving
Analytical thinking
Collaboration
Teamwork
Adaptability
Continuous learning
Performance monitoring
Security best practices
Ansible
Observability
Networking
Cloudera CDH/CDP
Data Bricks
HD Insights
Spark
Hive
Impala
HBase
Kudu
Sqoop
Oozie
RHEL Linux
Linux system administration
Apache Ambari
Cloudera Manager
Bash
KSH
Kerberos
LDAP
AD
Scalability