We are looking for an enthusiastic and energetic individual to join our research effort investigating genetic, molecular, and behavioral variation in neurodegenerative disease and cognitive decline. This position will be the main contact for primary bioinformatics analysis of next-generation sequencing and proteomics data, and will work with a variety of investigators in the department as part of a centralized bioinformatics core. The candidate will be part of an integrated team of neurologists, data scientists, cellular biologists, and human immunologists who come together to perform team-based projects. There will be many opportunities to contribute to multiple ongoing national and international collaborative projects.
Responsible for assembling, executing, and developing the latest bioinformatics pipelines for primary analysis of omics, transcriptomic (RNA-seq), epigenetic, and proteomic data from human and mouse tissue.
Work with database administrators to develop a data management system for large-scale omics data.
Contribute to QA/QC of pilot and production data sets.
Display initiative and independence in providing rapid results to various investigators generating experimental data.
Prepare summary reports of data and results for dissemination to colleagues and collaborators.
Directly respond to inquiries regarding projects being managed. Produce subsets of data for distribution to collaborators as approved by the principal investigators.
Requires bachelor's degree or equivalent in education and experience, plus four years of related experience.
Quantitative background (Computer Science, Statistics, Bio-statistics, Computational Biology, Applied Mathematics or Bioinformatics), and extensive experience in building and maintaining computational pipelines for processing sequencing and other types of data.
Good knowledge of programming languages for implementing computational algorithms on lage-scale data (R, Python, Perl), as well as experience working in Linux/Unix on high-performance computing/cluster platforms.
In-depth knowledge of at least one flavor of data (genetics, transcriptomics, epigenetics, or proteomics) is required.
Will receive on-the-job training for other data modalities under the supervision of computational investigators in the department.
Demonstrated programming skills.
Demonstrated experience with building pipelines for at least one "omics" data modality (genetics, transcriptomics, epigenetics, proteomics).
Strong organizational skills in managing large datasets.
Programming experience in R, Python, Perl, or C/C++.
Ability to work independently, display initiative within a team environment, and respond rapidly to requests.
Equal Opportunity Employer / Disability / Veteran
Columbia University is committed to the hiring of qualified local residents.
Internal Number: 499676
About Columbia University
Columbia University is one of the world's most important centers of research and at the same time a distinctive and distinguished learning environment for undergraduates and graduate students in many scholarly and professional fields. The University recognizes the importance of its location in New York City and seeks to link its research and teaching to the vast resources of a great metropolis. It seeks to attract a diverse and international faculty and student body, to support research and teaching on global issues, and to create academic relationships with many countries and regions. It expects all areas of the university to advance knowledge and learning at the highest level and to convey the products of its efforts to the world.