Principal Platform Software Engineer
Job description
Join Ora***'s Health Data Intelligence (HDI) team as a Software Engineer 4, where you will design and build the next generation of cloud-native platforms, distributed systems, and intelligent automation solutions that power large-scale healthcare analytics. This role is ideal for engineers who enjoy solving complex software engineering challenges at scale. You will develop highly available services, reliability platforms, observability systems, automation frameworks, and AI-powered operational tooling that enable mission-critical analytics workloads across Ora*** Cloud Infrastructure and multi-cloud environments. You will partner with product, platform, data, and reliability teams to build scalable software systems that process massive datasets, improve developer productivity, automate operational workflows, and enhance platform resilience. As Ora*** continues investing in AI-native infrastructure, you will help drive the adoption of Generative AI and agent-based technologies to build intelligent operational platforms, self-service infrastructure solutions, and autonomous reliability capabilities. U.S. citizenship is required for this position, as the successful candidate will be required to obtain and maintain a U.S. government security clearance after hire. Software Engineering Strong software development experience in Java, Python, Go, or similar languages Experience designing, building, and maintaining large-scale distributed systems Strong understanding of software architecture, design patterns, and system design principles Experience developing cloud-native applications and microservices Ability to write high-quality, maintainable, and testable production code Distributed Systems & Platform Engineering Experience building highly available, fault-tolerant systems Strong understanding of scalability, performance optimization, concurrency, and resiliency patterns Experience developing platform services, shared frameworks, or developer tooling Knowledge of event-driven architectures and service-oriented systems AI-Native Engineering Experience building software solutions leveraging Generative AI, Agentic AI, or LLM technologies Experience integrating frameworks such as LangChain, AutoGen, CrewAI, or custom AI agents Ability to develop AI-powered automation for: Incident analysis Operational workflows Infrastructure lifecycle management Intelligent observability Experience designing APIs and services that incorporate AI capabilities Cloud & Platform Technologies Strong experience with OCI, AWS, Azure, or multi-cloud environments Experience building cloud-native services using Kubernetes and container technologies Knowledge of cloud architecture, networking, security, and resource optimization Experience with Infrastructure as Code using Terraform or similar technologies Data Engineering Experience building data-intensive applications and analytics platforms Knowledge of ETL pipelines and large-scale data processing frameworks Familiarity with data warehouse technologies such as Snowflake, Vertica, or equivalent platforms Understanding of distributed storage and columnar database architectures Reliability Engineering Understanding of SRE principles and operational excellence practices Experience implementing observability solutions using Prometheus, Grafana, OpenTelemetry, or similar tools Experience analyzing production issues and implementing durable software fixes Knowledge of reliability patterns, monitoring, alerting, and performance engineering Responsibilities Design & Develop Software Platforms Design, develop, and maintain cloud-native software services supporting Ora*** HDI analytics platforms Build scalable distributed systems capable of processing and analyzing large healthcare datasets Develop reusable frameworks, APIs, libraries, and platform capabilities that improve developer productivity and system reliability Contribute across the full software development lifecycle from architecture through production deployment Build Intelligent Automation Systems Develop AI-powered tools and platforms that automate operational and engineering workflows Build agent-based systems for: Incident investigation Root cause analysis Infrastructure automation Operational intelligence Design software that enables self-service infrastructure and autonomous platform operations Platform Reliability & Scalability Design software solutions that improve availability, scalability, performance, and resilience Build observability platforms and reliability tooling used across engineering teams Develop automated remediation and self-healing capabilities Drive engineering improvements through metrics, automation, and performance optimization Cloud & Infrastructure Engineering Develop software solutions that support OCI and multi-cloud deployments Build infrastructure automation frameworks and deployment tooling Improve platform efficiency, security, compliance, and operational scalability Data Platform Development Design and optimize large-scale data processing and analytics systems Develop and enhance data pipelines, processing frameworks, and warehouse integrations Improve throughput, reliability, and efficiency of data-intensive workloads Engineering Excellence Participate in architecture reviews, design discussions, and code reviews Mentor engineers and contribute to engineering best practices Collaborate across software, platform, data, and infrastructure teams Champion software quality, automation, testing, and operational excellence What You Bring 10+ years of software engineering experience building large-scale distributed systems Strong track record of delivering production software in cloud environments Experience developing cloud-native services, platforms, or infrastructure software Proven ability to lead complex technical initiatives from design through deployment Core Technical Expertise Distributed systems design Cloud-native application development Platform engineering and developer tooling Large-scale data processing systems Performance optimization and scalability AI-Native Experience Experience building AI-powered software products or developer platforms Experience integrating LLMs, agent frameworks, or intelligent automation systems into production environments Understanding of emerging AI engineering patterns and operational applications Technical Skills Java, Go, Python, or similar languages Kubernetes, Docker Terraform CI/CD platforms Prometheus, Grafana, OpenTelemetry Cloud platforms (OCI preferred)