Overview
The DevOps/DDI Engineer is responsible for maintaining the enterprise DDI (DNS, DHCP and IPAM) infrastructure and developing automation scripts and tools to improve network reliability and visibility.
Key Responsibilities
- Manage the product lifecycle of operational system infrastructure (DDI and automation tools) to ensure operational availability; design and specify changes for implementation.
- Receive performance data and analyze the performance of installed technologies. Propose and implement any required changes, including identifying and planning for any resulting impacts on other technologies.
- Continue to develop expertise in their domain and seek to expand their knowledge into other areas of the network infrastructure.
- Resolve recurring problems and document appropriate procedures.
- Prepare reports on a monthly/quarterly basis for project metrics, SLAs, and achievements.
- Document new procedures in a clear and concise manner. May participate in the development of new or modified procedures.
- Provide direction, technical guidance, and mentorship to team members, as appropriate.
- May lead moderate to large sized projects.
- Oversee implementation efforts related to their area of responsibility.
- Possess experience in implementing procedural solutions to support and monitor the effective operations of an element(s) of the network infrastructure.
- Understand the business impact of different solutions, and assess the tradeoffs between business needs, technology requirements, and costs.
- Conduct complete diagnostics of most business problems, factoring in a strong understanding of the technical architecture.
- Find solutions to and develop documentation for advanced operations problems.
- Follow and execute standardized procedures, under guidance. Supervisor is readily available to address non-standard situations.
- Maintain the World Bank Group’s DDI infrastructure based on Infoblox.
- Manage cloud-based DNS components in AWS and Azure clouds.
- Automate the environment using various scripts and tools like Python, Perl, Ansible and Terraform.
- Enhance the team’s proactive monitoring capability by creating automated reports/alerts for network events.
- Ensure compliance with various security standards for the environment.
- Handle escalation calls from the NOC, perform triage, and resolve problems.
- Provide continuous monitoring, troubleshooting, and maintenance of the various network equipment and associated peripherals, including adequate follow up on Problem Resolution tracking using the ServiceNow tool.
- Monitor and review use of systems for World Bank access control policy violations in line with World Bank’s guidelines.
- Ensure that the defined decommissioning and disposal procedures are followed for all hardware systems and media.
- Report and respond to critical security events and take corrective measures per defined security incident management processes.
- Follow and comply with the World Bank Group policies, processes, and procedures.
- Participate in audits, as needed, producing necessary documentation, reports, and explanations, and implement corrective and preventive action plans approved by unit managers.
- Take personal responsibility and accountability for timely response to client queries, requests or needs, while working to remove obstacles that may impede execution or overall success.
- Able to present and explain technical information to diverse types of audience (management, users, vendor, and technical staff).
Required Experience
- Bachelor’s degree with 7 years of experience or master’s degree with 5 years relevant experience OR equivalent combination of education and experience.
- Hands-on experience with Infoblox DDI in a large enterprise network.
- Extensive experience using APIs with Infoblox, Cloud providers, ServiceNow and Splunk.
- Experience in automating tasks using scripting languages and tools like Python, Perl, Ansible and Terraform.
- Hands-on experience with Azure DevOps.