Innodata Inc. is a global data engineering company focused on the responsible advancement of artificial intelligence. They are seeking a Generative AI Associate who will contribute to the development of large language models by evaluating AI performance and generating training data, all while working flexibly on a part-time basis.
Rating/assessing the performance of AI models or algorithms based on their output or behavior through a set of evaluative questions
Labeling elements of a piece of content rather than the content as a whole
Assigning predefined categories or labels to items
Evaluating the perceived quality and/or appropriateness of content
Generating labels to advance understanding of a concept, trend etc
Creation of additional training data for machine learning models by applying transformations to the original data, such as modifying images (rotation, flipping, cropping), generating new text (paraphrasing, summarization), or altering audio/video signals (speed modification, pitch shifting) to reduce overfitting and increase dataset diversity
Reviewing data and identifying whether or not a product feature works as intended based on the project's guidelines
Labeling model outputs to identify if a piece of content is or isn't something. Examples: identify clickbait; identifying gaming videos; identifying branded content
Ordering or ranking items based on a set of preferences or criteria
Creating prompts or questions that will be used to generate responses from a language model or other AI system
Projects that evaluate the relevance of content based on a relevancy scale (1-3, 1-5, etc.)
Generating responses to prompts or questions using a language model or other AI system
Rewriting existing text while preserving the original meaning, often to improve clarity or style and adherence to guidelines
Producing concise summaries of longer pieces of text or data
Converting spoken language or audio content into written text
Converting text or spoken language from one language to another
Gathering and compiling various forms of data to be used for training, evaluating, or fine-tuning the AI models. This may include text, images, videos, audio files, or other types of digital content
Qualification
Required
A Bachelor's degree or higher in a humanities specialization is required
Professional or Expert level proficiency (C1/C2) in English and French
Preferred
Advanced degrees are strongly preferred (Master's or PhD)
Benefits
(NASDAQ: INOD) Innodata is a global data engineering company. We believe that data and AI are inextricably linked.