Generative AI Data Scientist

Company:  Global Commerce & Information, Inc.
Location: Gwynn Oak
Closing Date: 30/11/2024
Hours: Full Time
Type: Permanent
Job Requirements / Description
Your Success is Our Success. Global CI is an award-winning 30-year IT Services company founded on the principles of providing high-quality, value-added technology consulting services. Our vision is to create a better future by improving the lives of the people we serve through emerging technologies. Join us and together we will advance the future of technology services.

Global CI offers competitive compensation and non-salary benefits to all eligible employees.

Job Description

Role Description :

Formulating, design and deliver AI/Client-based decision-making frameworks and models for business outcomes. Measure and justify AI/Client based solution values.

We are seeking a Python GenAI Engineer with expertise in developing, fine-tuning, and integrating AI models, particularly in natural language processing (NLP). This role focuses on building generative AI solutions for summarization tasks and other NLP applications, while incorporating prompt engineering and human-in-the-loop feedback to optimize AI outputs. The ideal candidate will possess demonstrated prior experience analyzing unstructured medical records, developing AI models for extracting insights, and incorporating human-in-the-loop feedback to improve model performance. You will collaborate closely with data scientists, software engineers, and other stakeholders to integrate AI models into production environments within cloud infrastructure.

Required Qualifications & Experience:

" 5+ years of experience in AI/Client development with a strong focus on NLP and generative models, using frameworks such as TensorFlow, PyTorch, and Hugging Face

" Expertise in Python, with experience in libraries like Transformers, NLTK, SpaCy, Gensim, and data manipulation tools such as Pandas and NumPy

" Implement dynamic prompt engineering strategies to optimize model outputs (1-2 years preferred)

" Expertise in frameworks such as TensorFlow, PyTorch, and Hugging Face

" Strong proficiency in Python, with experience in libraries like Transformers and NLTK

" Familiarity with generative AI models such as OpenAI's GPT, Llama, and supporting libraries like VLLM

" Strong analytical skills and experience with statistical modeling and data analysis

" Ability to effectively articulate technical challenges and solutions

" Strong communicator with excellent written and verbal communication skills

" Identify and analyze user requirements to generate stories and tasks for team backlog

" Prioritize and execute tasks throughout the software development life cycle

" Create custom NLP algorithms and annotators to evaluate medical record data

" Create custom tools to enable analysts to perform data research

" Solid understanding of statistical modeling, data analysis, and performance evaluation metrics.

" Demonstrated experience analyzing and processing unstructured clinical data (e.g., electronic health records, physician notes, imaging reports), using techniques such as tokenization, lemmatization, and word embeddings (e.g., TF-IDF, BERT)

" Familiarity with healthcare data formats and standards such as HL7, FHIR, ICD codes, and SNOMED

" Experience with cloud platforms (AWS, Azure), containerization (Docker), and using CI/CD pipelines for machine learning model deployment

" Knowledge of SQL (PostgreSQL, MySQL) and NoSQL (MongoDB, Elasticsearch) databases, and how to structure data pipelines for efficient data processing. Experience optimizing databases (SQL: PostgreSQL, MySQL; NoSQL: MongoDB, Elasticsearch) to support efficient data storage and retrieval for AI models

" Develop and fine-tune AI models for natural language processing (NLP) tasks, including Named Entity Recognition (NER), text classification, summarization, and sentiment analysis, particularly with unstructured clinical records

" Conduct experiments to evaluate model performance, utilizing metrics such as precision, recall, and F1-score to iteratively improve models through hyperparameter tuning and training optimizations

" Experience implementing prompt engineering strategies and apply transformer models to enhance generative AI outputs using frameworks like Hugging Face and PyTorch

" Experience integrating AI models into production environments, collaborating with software engineers and using cloud platforms like AWS to ensure scalability and performance

" Analyze and preprocess large datasets, particularly unstructured medical records (e.g., physician notes, discharge summaries), using tools like Pandas, NLTK, and SpaCy

" Stay updated with the latest research and advancements in AI and NLP, applying state-of-the-art techniques such as transfer learning, attention mechanisms, and fine-tuning pre-trained models to healthcare-specific challenges

" Master's degree (Data Science, AI, Computer Science, or a related field) + 10 years experience; or PhD + 4 years

Preferred Qualifications:

" Experience in healthcare, particularly working with unstructured medical records in clinical settings, leveraging NLP models for insight extraction.

" Experience working with human-in-the-loop systems, incorporating clinician/end-user feedback and leveraging tools like SciPy and NumPy to improve AI model accuracy

" Educational background or practical training in a clinical setting, with exposure to clinical workflows and medical terminologies

" Familiarity with deep learning techniques, attention mechanisms, and transformers applied to healthcare data

Benefits include:

Comprehensive medical, dental, vision, life, and short & long-term disability insurance + health savings account

Matching 401k retirement plan + IRA's and Roth IRA's

Generous paid time off and paid holidays

Employee recruitment/referral bonus

Paid community service hours

Tuition reimbursement

Employee discounts

At Global Commerce & Information, Inc. we celebrate, support, and are committed to creating a diverse and inclusive environment. We're proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, veteran status, or any other legally protected characteristics.

Global Commerce & Information, Inc maintains a drug-free workplace.

Apply Now
Share this job
  • Similar Jobs

  • Data Scientist -AI / ML, Senior-Level

    Windsor Mill
    View Job
  • Data Scientist

    Gwynn Oak
    View Job
  • Entry Level - Data Scientist

    Baltimore
    View Job
  • Data Integration Developer

    Baltimore
    View Job
  • Oncology Data Specialist (ODS)

    Baltimore
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙