About the role
<h2>Who we are</h2> <h3>About Stripe</h3> <p>Stripe is a financial infrastructure platform for businesses. Millions of companies—from the world's largest enterprises to the most ambitious startups—use Stripe to accept payments, grow their revenue, and accelerate new business opportunities. Our mission is to increase the GDP of the internet, and we have a staggering amount of work ahead. That means you have an unprecedented opportunity to put the global economy within everyone's reach while doing the most important work of your career.</p> <h3>About the team</h3> <p>The Big Data Infrastructure operates the critical infrastructure that powers the batch data processing at Stripe. The team supports a variety of use cases, including Payment, Ledger, ML, Fraud Detection, Product Analytics, Regulatory Reporting, Financial Data Reconciliation, and externally facing products like Radar and Sigma. As an example of the scale, the team's systems serve hundreds of teams, thousands of workflows, 100,000+ task executions, O(billion) transformations, and moving terabytes of data processing over 1 GB/second every day. Our users inside Stripe include other engineering teams, Data Scientists, Sales and Operations, Finance, etc.</p> <p>Data Orchestration builds and operates the time-based and event-based orchestration infrastructure that powers and accelerates batch data pipelines. The team operates on a wide range of tech stacks including Airflow, Spark, SQL, Kafka, Flink, Hive MetaStore, Trino, Pinot, Python, Java, Scala, S3, and Iceberg.</p> <h2>What you'll do</h2> <p>As a Software Engineer on this team, you'll design and build infrastructure that powers batch data processing at Stripe.</p> <h3>Responsibilities</h3> <ul> <li>Design, build, and maintain next-generation and first-generation versions of key Data Platform products, with an emphasis on usability, reliability, security, and efficiency.</li> <li>Design ergonomic APIs and abstractions that build a great customer experience for internal Stripes, that will in turn enhance the experience of millions of Stripe users.</li> <li>Ensure operational excellence and enable a highly available and reliable Data Orchestration platform across batch workloads.</li> <li>Collaborate with high-visibility teams and their stakeholders to support their key initiatives—while building a robust platform that benefits all of Stripe in the long term.</li> <li>Plan for the growth of Stripe infrastructure by unblocking, supporting, and communicating with internal partners to achieve results.</li> <li>Connect your work with improvements in the usability and reliability of open source software (OSS) like Apache Airflow, Iceberg, and Spark, and contribute back to the OSS community.</li> </ul> <h2>Who you are</h2> &l