Responsibilities:
- Help shape and operate infrastructure and deployment systems
- Contribute to the design, build, and maintenance of the cloud infrastructure, CI/CD pipelines, and deployment tooling that engineering teams rely on every day
- Drive reliability and observability
- Implement monitoring, alerting, and incident response practices that keep systems healthy and surface issues before customers feel them
- Champion security and compliance
- Partner with security and compliance teammates to ensure infrastructure aligns with VIA's security obligations (e.g., CJIS, CMMC, and SOC 2), embedding security controls into automated pipelines rather than bolting them on
- Partner across the engineering organization
- Work closely with engineering leadership, Client Delivery Leads, and product teams to ensure infrastructure work is properly prioritized and accounted for during project planning and resource allocation
- Partner with the team to identify and implement AI tools and agents
- Identify and implement AI tools and agents to eliminate manual bottlenecks across the DevOps lifecycle, including automated remediation, log anomaly detection, infrastructure-as-code generation and review, and compliance evidence collection
Qualifications:
What you will bring to this role:
- Bachelor’s or master’s degree in computer science, engineering, information systems, or a related degree
- 5+ years of experience in a DevOps, SRE, or infrastructure engineering role, with demonstrated success in a high-growth technical environment
- Deep technical understanding of IT infrastructure integration, Unix/Linux, (e.g., DNS, storage systems, firewalls, etc.), TCP/UDP/IP behavior, and application data flows between systems - a must have
- High proficiency in application deployment (e.g., Kubernetes, IaC e.g. Terraform, Docker), configuration and monitoring (e.g., System logs, Kibana, Grafana, etc.)
- In-depth experience with AWS; Azure, or GCP cloud service providers a bonus
- Experience managing network infrastructure, protocols, troubleshooting, and security (e.g., firewalls, security groups, OS/container hardening, PEN testing)
- Experience adhering to the standards of compliance frameworks (e.g. CJIS, CMMC, SOC2)
- Proficiency in Python and shell script languages
- Proven ability to independently manage priorities and plan effectively across multiple upcoming development cycles (sprints), anticipating future dependencies and resource needs
- Proven track record building strong stakeholder relationships, with excellent interpersonal and presentation skills to effectively communicate and influence
- Experience or interest in the following skills is also a plus:
- Deployment of blockchain nodes (e.g., experience with configuration of Ethereum miners)
- Application orchestration
- Service mesh technology (e.g., Istio)
- Managing IoT devices as part of an Operational Technology system