DevOps
Join us as a visionary DevOps Engineer, adept in blockchain & Web3, CI/CD optimization, cross-cloud management, and system reliability
Who we are
Imperator.co is a leading proof-of-stake node operator, securing over 45 blockchains, including Cosmos, Ethereum, and Sui, with more than $400M in assets staked. Serving a global customer base of 200,000+, we specialize in Data Engineering, supporting Cosmos infrastructure. Trusted by dYdX, Osmosis, Axelar, Coingecko, and others, we contribute to major protocols' operations.
Imperator.co proudly collaborates with dYdX, contributing as a key team running the indexer for dYdX v4. Our role is critical in ensuring traders have access to accurate, real-time asset pricing and trading data. Our goal is to empower and educate, making the Cosmos ecosystem more accessible. We provide people with extensive research papers, founders interview, weekly newsletter.
Our commitment extends to diverse professional services, including expert consultation in areas such as tokenomics, marketing, strategy, and infrastructure. We offer technical support, tailored staking services for institutional clients, and white-labeling solutions.
Who we're looking for
We are seeking a visionary and proactive **Lead DevOps Engineer** who is passionate about innovation and has a deep understanding of blockchain technologies and Web3. The ideal candidate will take charge in leading our DevOps initiatives, managing and optimizing CI/CD pipelines, enhancing on-call alerting systems, and ensuring robust IT incident management. Your role is crucial in maintaining our infrastructure's operational integrity, facilitating seamless deployments, and contributing to our overall mission.
Leadership and Initiative: Lead and drive DevOps initiatives, demonstrating proactive problem-solving and a forward-thinking approach.
Optimizing CI/CD Pipelines: Manage and enhance continuous integration and continuous deployment processes to ensure smooth and efficient deployment of software.
Enhancing On-Call Alerting Systems: Improve alerting systems to ensure timely responses to incidents, maintaining system reliability.
Robust IT Incident Management: Establish and follow thorough procedures for managing IT incidents to minimize downtime and impact on operations.
Infrastructure Operational Integrity: Confidently manage and maintain the operational integrity of our infrastructure across multiple cloud providers, ensuring high availability and resilience.
Seamless Deployments: Facilitate the smooth and efficient deployment of software, minimizing disruptions and ensuring continuity of service.
Responsibilities
Security and Compliance: Maintain security and compliance standards, ensuring all systems adhere to company policies and industry regulations.
Infrastructure as Code (IaC): Design, provision, and manage infrastructure through IaC using Terraform, Ansible, and GitOps, adhering to established company best practices.
Automation and Efficiency: Develop tools, scripts, and playbooks to speed up processes and enhance efficiency across our organization.
Containerization and Orchestration: Experience in containerization with Docker, and familiarity with orchestration.
Cloud Expertise: Hands-on experience with on-premise and public clouds (GCP preferred), and a solid understanding of service offerings and management tools from major cloud providers such as AWS and GCP.
Proactive Security Management: Plan and implement necessary security updates and patches to ensure infrastructure reliability and safety.
Collaboration and Communication: Actively participate in architectural discussions, technical presentations, and the review process to enhance system designs and practices. Strong communication skills are essential for this fully remote role that involves working with multiple stakeholders across all levels of Engineering.
On-Call Responsibility: Willingness to be on-call, including weekends, for critical alerts to ensure system reliability. We consider on-call as a crucial component of a reliable system.
Technical Proficiencies
Proficient in Linux, Docker, AWS, Ansible, Terraform, and familiar with cloud-native architectures.
Expertise in systems administration, site reliability, and DevOps practices, with a focus on enhancing reliability and performance through proactive monitoring and collaboration.
Creating scalable, secure infrastructure with robust monitoring, logging, and metrics to support dynamic scaling and reliability.
Establishing automated pipelines for comprehensive testing, validation, and deployment, including code approval processes and release management.
Managing test and production nodes across different ecosystems like Cosmos, ETH, Sui, etc.
Embracing automation with Python/Bash scripting to streamline tasks and enhance efficiency across our organization.
Nice to have
Familiarity with blockchain nodes is advantageous
Benefits
Work from anywhere in the world
Flexible working schedule
Financial Support for education/courses
Annual bonus
Stipends to help get your work equipment without compromises
And more
Let us know how we can make this the best place to work for you.
How to Apply
Does this role sound like a good fit? Fill this form
Imperator.co welcomes all qualified people to apply regardless of race, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status. Compensation will be competitive and commensurate with experience.
Process
Shortlisted candidates will then proceed through the following steps:
Screening interview - an initial discussion to assess general qualifications, motivation, and alignment with our team’s values and culture
Technical interview - a comprehensive evaluation to assess technical abilities
Take home assignment - a practical take-home assignment where candidates demonstrate their skills through hands-on tasks related to their day-to-day activities, showing how they would contribute to our team’s dynamics
Contact
Contact Imperator.co if you have any questions regarding the application: