We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results

Artificial Intelligence Research Data Science Specialist

Dartmouth College
remote work
United States, New Hampshire, Hanover
7 Lebanon Street (Show on map)
Dec 18, 2024
Position Information


Posting date 12/18/2024
Closing date
Open Until Filled Yes
Position Number 1129150
Position Title Artificial Intelligence Research Data Science Specialist
Department this Position Reports to Research Data Services
Hiring Range Minimum $108,700
Hiring Range Maximum $125,000
Union Type DCLWU
SEIU Level Not an SEIU Position
FLSA Status Exempt
Employment Category Regular Full Time
Scheduled Months per Year 12
Scheduled Hours per Week 40
Schedule
Location of Position
Hanover, NH
Remote Work Eligibility? Hybrid
Is this a term position? No
If yes, length of term in months. NA
Is this a grant funded position? No
Position Purpose
This position works as part of the Dartmouth Libraries Research Data Services team to support research, curricular, and applied artificial intelligence work on campus. The person in this role will bring data science skills together with necessary expertise in information curation and knowledge management to support a variety of generative artificial intelligence applications, such as semantic search, retrieval augmented generation, and information/data retrieval application development. Working alongside campus partners engaged in data science and generative artificial intelligence work, this role will focus on database creation, data ingestion, information preprocessing and embedding, vector database management, and system optimization.
This position is hybrid work location eligible.
Description
Required Qualifications - Education and Yrs Exp Bachelors plus 3-5 years' experience or equivalent combination of education and experience
Required Qualifications - Skills, Knowledge and Abilities

  • BA in quantitative or related field + 3-5 years experience, or; MA in a quantitative or related field + 1-3 years, or; PhD in a quantitative or related field; or MLIS + 1-3 years
  • 1-3 years of relevant education or work experience in research or applied AI environments
  • Demonstrated knowledge of programming/ scripting languages and analysis applications (e.g., R, Python, SAS, SPSS)
  • Experience with using GenAI, Deep Learning frameworks, and Natural Language Processing (NLP) for projects; or, experience with database design and development
  • Experience with preparing data for analysis, visualization, and other procedures
  • Demonstrated ability to work independently and as a team member to solve problems
  • Excellent oral and written communication skills
  • Strong interpersonal and organizational skills
  • Excellent analytical skills
  • Willingness to learn new programming languages, statistical analysis tools or other relevant tools as needed

Preferred Qualifications

  • Experience with data tools and services, including HPC, in a research library or academic/research setting
  • Demonstrated ability to initiate, plan, coordinate, implement, and assess complex programs, projects, and services.
  • Professional experience working with research data and/or in an academic library
  • Demonstrated knowledge of data management, curation, and preservation principles and practices
  • Demonstrated knowledge of open data, data repositories, and the data life cycle

Department Contact for Recruitment Inquiries Lora Leligdon, Head of Research Data Services
Department Contact Phone Number 603-646-3845
Department Contact for Cover Letter and Title Lora Leligdon, Head of Research Data Services
Department Contact's Phone Number 603-646-3845
Equal Opportunity Employer
Dartmouth College is an equal opportunity/affirmative action employer with a strong commitment to diversity and inclusion. We prohibit discrimination on the basis of race, color, religion, sex, age, national origin, sexual orientation, gender identity or expression, disability, veteran status, marital status, or any other legally protected status. Applications by members of all underrepresented groups are encouraged.
Background Check
Employment in this position is contingent upon consent to and successful completion of a pre-employment background check, which may include a criminal background check, reference checks, verification of work history, conduct review, and verification of any required academic credentials, licenses, and/or certifications, with results acceptable to Dartmouth College. A criminal conviction will not automatically disqualify an applicant from employment. Background check information will be used in a confidential, non-discriminatory manner consistent with state and federal law.
Is driving a vehicle (e.g. Dartmouth vehicle or off road vehicle, rental car, personal car) an essential function of this job? Not an essential function
Special Instructions to Applicants
Dartmouth College has a Tobacco-Free Policy. Smoking and the use of tobacco-based products (including smokeless tobacco) are prohibited in all facilities, grounds, vehicles or other areas owned, operated or occupied by Dartmouth College with no exceptions. For details, please see our policy. https://policies.dartmouth.edu/policy/tobacco-free-policy
Additional Instructions
Quick Link https://searchjobs.dartmouth.edu/postings/77026
Key Accountabilities


Description
Works with researchers, staff, and students to refine the collection and curation of corpus documents to ensure datasets are suitable for artificial intelligence and related computational techniques. Designs database architectures for storing documents and the vector databases that will hold document embeddings. While ensuring database scalability, reliability, and performance optimization, monitors the system's performance and optimizes queries to ensure quick retrieval times and high relevance of retrieved documents. Regularly updates the database with new entries and re-indexes as needed.
Percentage Of Time 30%


Description
Assists researchers, staff and students in the development and application of document preprocessing pipelines to clean and prepare text data for embedding. Automates transcription processing where necessary, including language detection, segmentation, and annotation.
Collaborate with librarians to properly handle metadata and maintain data integrity.
Percentage Of Time 20%


Description
Utilizes machine learning models to generate embeddings from preprocessed text data. Indexes embeddings efficiently within the vector database for fast retrieval. Analyzes retrieval accuracy and optimizes the system by applying query transformations and result reranking techniques.
Percentage Of Time 20%


Description
Provides instruction, outreach, and consultations on advanced computing concepts for faculty, students, and staff to expand computational research skills (including data discovery, curation, management, storage, analysis, visualization, and preservation) as needed for curricular or research projects.
Percentage Of Time 10%


Description
Collaborates with Library Research Data colleagues and Information Technology & Consulting Colleagues to integrate databases effectively with campus AI infrastructure and large language models, and to fine-tune the models based on the data structure and requirements.
Percentage Of Time 10%


Description
Engages in focused professional development activities and serves on applicable Dartmouth committees and task forces, with an emphasis on data science techniques, generative artificial intelligence, and ethical applications of novel technologies. Recommends and facilitates improvements to existing programs and services, and participates in internal training and professional development for Dartmouth Library and related staff.
Percentage Of Time 10%
-


-- Demonstrates a commitment to diversity, inclusion, and cultural awareness through actions, interactions, and communications with others.
-- Performs other duties as assigned.
Applied = 0

(web-86f5d9bb6b-jk6zr)