Summary Description: The Data Science Institute of Columbia University's mission involves advancing the state of the art in data science; transforming all fields, professions, and sectors through the application of data science, while ensuring the responsible use of data for the benefit of society.We are looking for a research programmer to serve as a system engineer on a large research project involving search and summarization over multilingual text and speech documents. Columbia University is the lead organization and there are four other universities on the team. The candidate will be responsible for maintaining and modifying the system architecture, developing new system components as required, creating and maintaining web resources in the languages we are required to work in building, overseeing evaluation of the system, and managing a group of GPU machines for the group. The programmer would work with a team from multiple universities on a project called SCRIPTS. SCRIPTS (System for CRoss language Information Processing, Translation and Summarization) is a planned end to end system for retrieval of speech and text documents in low to medium resource languages (e.g., Uhyghur or Arabic). In order for the system to determine relevance, speech documents must be automatically transcribed and both text and speech documents must be translated. There will also be a summarization component that will produce a short paragraph summary of the relevant portion of each document that an end user can use to determine if the document is what s/he was looking for. The project is funded by IARPA and the team will participate in multiple evaluations to demonstrate how well the system works.
Minimum Qualifications: Associate Research Scientists at Columbia SEAS are junior officerswho have demonstrated innovation and excellence in research.Candidates are evaluated on research ability and required expertise.Applicants must have a PhD degree in computer, communication andinformation sciences. At least 2 years of postdoctoral experience arerequired, as well as a strong publication record. Familiarity withphotonics is necessary, along with experience in electrical engineeringand communication systems.
Preferred Qualifications: The following experience and skills are preferred for the successful candidate:?Experience managing large software projects.?Strong Java, C++, Python and Perl programming skills and a close familiarity with core and 3rd party libraries.?Familiarity with software engineering tools such as github, Docker and one-click build and test.?Knowledge of deep learning and GPU machines preferable.?Skill with processing, analysis and storage of big data.?Experience with Linux system administration.?Knowledge about Human Language Technologies (NLP, IR, ASR)?Experience with Network Analysis preferable.?Good at switching between tasks and assisting project members.?Skilled at interacting with internal and external partners.?Devoted user of industry best practices and test-driven development.
As a member of the National Collegiate Athletic Association (NCAA) and the Council of Ivy Group Presidents (Ivy League), it is imperative that members of the Columbia University community, in all matters related to the intercollegiate athletics program, exhibit the highest professional standards and ethical behavior with regard to adherence to NCAA, Conference, University, and Department of Intercollegiate Athletics and Physical Education rules and regulations.
Columbia University is an Equal Opportunity/Affirmative Action employer.
Internal Number: 334_65305
About Columbia University
Columbia University is one of the world's most important centers of research and at the same time a distinctive and distinguished learning environment for undergraduates and graduate students in many scholarly and professional fields. The University recognizes the importance of its location in New York City and seeks to link its research and teaching to the vast resources of a great metropolis. It seeks to attract a diverse and international faculty and student body, to support research and teaching on global issues, and to create academic relationships with many countries and regions. It expects all areas of the university to advance knowledge and learning at the highest level and to convey the products of its efforts to the world.