Job Information
Microsoft Corporation Principal Site Reliability Engineer in Burlington, Massachusetts
There has never been a more exciting time to be working in healthcare at Microsoft. Our Health & Life Sciences Solutions organization is an interdisciplinary team of product managers, designers, engineers, and clinicians who are designing, developing and deploying next-generation healthcare solutions powered by the Microsoft Cloud for healthcare organizations around the world.
As a Principal Site Reliability Engineer, you will be an integral member of a team within HLS Solutions that is working to empower clinicians to achieve more with groundbreaking healthcare-oriented copilots. The candidate will be excited about waking up every morning to apply their skills in data storage, performance tuning, and optimization to develop and deploy new technologies and experiences centered around driving positive healthcare outcomes.
The candidate will be excited about waking up every morning to apply their skills in developing and deploying new technologies and experiences centered around driving positive healthcare outcomes.
We are seeking a highly skilled and experienced Principal Site Reliability Engineer to join our team. In this role, you will be responsible for ensuring the availability, reliability, and performance of our database systems, with a focus on Azure SQL and CosmosDB. You will also help shape and improve our overall data acquisition and data handling strategy. As a Principal Site REliability Engineer within our Engineering organization, you will drive our strategy and adoption of database technologies. You will work closely with cross-functional teams, including software engineers, platform engineers, and product architects, to design, implement, and maintain robust and scalable database solutions. The candidate has a background in database administration, migrations, troubleshooting, and optimization, particularly in high-volume transactional environments.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Responsibilities
What You'll Do:
Design, deploy, and manage highly available, reliable, and scalable database architectures for Azure SQL and other database technologies.
Monitor and optimize database performance, identifying and resolving performance bottlenecks and other issues to ensure optimal system performance.
Develop and implement database backup and disaster recovery strategies to ensure data integrity and minimize downtime.
Perform database capacity planning and resource utilization analysis to ensure adequate scalability and efficient resource allocation.
Collaborate with development teams to provide guidance and support in optimizing database queries, data models, and application integration.
Troubleshoot and resolve database-related incidents and issues, including data corruption, connectivity problems, and performance degradation.
Implement and enforce security and access control measures to protect sensitive data.
Proactively identify potential issues, risks, and areas for improvement in the database infrastructure, and propose and implement solutions.
Find efficiencies in our data handling processes.
Coordinate with the analytics teams on reporting and data warehousing.
Stay up-to-date with the latest database technologies, tools, and practices, and recommend their adoption when appropriate.
Create and implement monitoring and alerting solutions to detect and prevent issues before they impact our customers.
Troubleshoot and resolve complex issues in production environments.
Participate in on-call rotation to ensure 24/7 availability of our systems and services.
Lead and mentor other members of the Site Reliability Engineering team.
Continuously identify and implement process improvements to increase efficiency and reduce risk.
Embody our culture (https://careers.microsoft.com/v2/global/en/culture) and values. (https://www.microsoft.com/en-us/about/corporate-values)
Qualifications
Minimum Qualifications:
8+ years technical experience in software engineering, network engineering, or systems administration
OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 5+ years technical experience in software engineering, network engineering, or systems administration
OR Master's Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration
OR Doctorate Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration.
8+ Years experience with database technologies.
8+ years experience with performance optimization.
Other Requirements:
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to, the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Preferred Qualifications:
9+ years technical experience in software engineering, network engineering, or systems administration
OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 6+ years technical experience in software engineering, network engineering, or systems administration
OR Master's Degree in Computer Science, Information Technology, or related field AND 4+ years technical experience in software engineering, network engineering, or systems administration
OR Doctorate Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration.
The ability to collaborate across many teams and applications.
Site Reliability Engineering IC5 - The typical base pay range for this role across the U.S. is USD $137,600 - $267,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $180,400 - $294,000 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: US corporate pay information | Microsoft Careers (https://careers.microsoft.com/v2/global/en/us-corporate-pay.html)
Microsoft will accept applications for the role until December 3, 2024.
#Health&LifeScience
Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations (https://careers.microsoft.com/v2/global/en/accessibility.html) .