MSK is seeking a Data Engineer to join the Computational Oncology Program in the Department of Epidemiology and Biostatistics as part of the new Program in Computational Immuno-Oncology. Working in close collaboration with researchers and software engineers, you will be responsible for managing data from cutting edge, large scale research efforts in at the intersection of computational oncology and immunology including bulk and single-cell genomics, imaging and clinical data analysis and interpretation. Motivated by contributing meaningfully to modern progress in cancer research driven by advances in computing and data, we are seeking a hardworking, highly-skilled, responsible individual with experience handling and visualizing data utilizing robust, enterprise-level modern software systems. The right candidate will be the main liaison for all data requests from researchers and collaborators, a close collaborator with the software engineering team to build data management solutions, a mentor to those who wish to use this solution, and ultimately responsible for the correctness and completeness of all data within the group. Additionally, you will oversee and ensure the delivery of data results, taking action to ensure analysis projects stay on track.
The Data Engineer will also drive assigned projects and ensure the delivery of results; identity, develop, and gather resources to complete the project; plan, coordinate and lead meetings with stakeholders throughout the project life cycle; design and implement a plan for completing projects to deliver results, manage priorities and resource allocation, monitor progress and performance against the project plan and take action to ensure projects stay on track.
Manage data from high-throughput next-generation sequencing and imaging
Contribute to the design of databases as part of bioinformatics data processing and analysis systems
Contribute to front end solutions for visualization of data and analyses
Maintain and monitor streaming and batch ETLs operating on structured and unstructured sources
Maintain a data lake with hundreds of terabytes of data
Develop workflows and integrate systems with REST APIs
Compile datasets and verify data consistency
Communicate with stakeholders of the data and upon request, conduct data query tracking and resolution
Identify inefficiencies and work with software engineers to simplify processes, debug systems and automate routine tasks
Able to hold yourself and others accountable in order to achieve goals and live up to commitments
A good decision-maker, with proven success at making timely decisions that keep the organization moving forward
Able to work effectively in an environment notable for complex, sometimes contradictory information
Consistently achieving results, even under tough circumstances
Adept at planning and prioritizing work to meet commitments aligned with organizational
Adept at building partnerships and working collaboratively with others to meet shared objectives and goals
An effective communicator, capable of determining how best to reach different audiences and executing communications based on that understanding
Resilient in recovering from setbacks and skilled at finding detours around obstacles
Able to operate effectively, even when things are not clear or the way forward is not obvious
Adept at learning quickly, applying insights from past efforts to new situation
At least 3 years of proven experience, preferably with bioinformatics lab information management systems
Bachelors Degree in Computer Science, Information Systems, or Database Management (or equivalent experience)
Experience designing databases and defining system requirements for data collection
Experience in Python, and working with SQL and NoSQL data
Experience in Linux systems, and shell scripting
Experience in software development life cycle (requirements, design, deployment, testing, etc.)
Competitive compensation packages | Sick Time |Generous Vacation+ 12 holidays to recharge & refuel| Internal Career Mobility & Performance Consulting | Medical, Dental, Vision, FSA & Dependent Care|403b Retirement Savings Plan Match|Tuition Reimbursement |Parental Leave & Adoption Assistance |Commuter Spending Account |Fitness Discounts &Wellness Program | Resource Networks| Life Insurance & Disability | Remote Flexibility
We believe in communication, openness, and thinking beyond your 8-hour day @ MSK. Its important to us that you have a sense of impact, community, and work/life balance to be and feel your best.
Internal Number: 2020-43382
About Memorial Sloan-Kettering Cancer Center
As one of the world's premier cancer centers, Memorial Sloan-Kettering Cancer Center is committed to exceptional patient care, leading-edge research, and superb educational programs. The close collaboration between our physicians and scientists is one of our unique strengths, enabling us to provide patients with the best care available today as we work to discover more effective strategies to prevent, control, and ultimately cure cancer in the future. Our education programs train future physicians and scientists, and the knowledge and experience they gain at Memorial Sloan-Kettering has an impact on cancer treatment and the biomedical research agenda around the world.