The Computer Scientist will further develop SEQSpark which was developed to analyze large scale genetic epidemiological data.
The Computer Scientist will lead the development of a Python API for extending SEQSpark with various association testing methods written in Python and R. S/he will also implement through the API linear mixed models (LMM), generalized LMM, and methods for detecting pleiotropy and interactions, etc.
Bachelor's Degree in computer science, software engineering, or related field or equivalent in education and experience plus three years of experience.
Experience in distributed computing with Spark, Python software design, and scientific computing in Python (TensorFlow).
Equal Opportunity Employer / Disability / Veteran
Columbia University is committed to the hiring of qualified local residents.
Internal Number: 508079
About Columbia University
Columbia University is one of the world's most important centers of research and at the same time a distinctive and distinguished learning environment for undergraduates and graduate students in many scholarly and professional fields. The University recognizes the importance of its location in New York City and seeks to link its research and teaching to the vast resources of a great metropolis. It seeks to attract a diverse and international faculty and student body, to support research and teaching on global issues, and to create academic relationships with many countries and regions. It expects all areas of the university to advance knowledge and learning at the highest level and to convey the products of its efforts to the world.