Cyient-logo
Cyient
ยท
April 29, 2025
Apply Now
This job has closed.

Trainee Engineer-Data science

Allen, TX
Full-time
Onsite
New Grad, Entry Level
Cyient is a digital engineering and technology company specializing in management consulting and engineering services. They are seeking a Trainee Engineer in Data Science to assist with data collection, preparation, and analysis, as well as model development and deployment.
Apply Now

Responsibilities

  • Data Gathering: Collecting data from various sources, including databases, APIs, and web scraping.
  • Data Cleaning: Identifying and correcting errors or inconsistencies in data to ensure quality.
  • Data Transformation: Converting raw data into a usable format, including normalization and scaling.
  • Descriptive Statistics: Calculating measures such as mean, median, mode, variance, and standard deviation.
  • Visualization: Creating charts, graphs, and plots to understand data distributions and relationships.
  • Pattern Identification: Detecting trends, anomalies, and patterns in data.
  • Algorithm Selection: Choosing appropriate machine learning algorithms based on the problem at hand.
  • Model Training: Training models using supervised, unsupervised, or reinforcement learning techniques.
  • Model Evaluation: Assessing model performance using metrics like accuracy, precision, recall, F1 score, and ROC-AUC.
  • Model Deployment: Implementing models in production environments.
  • Monitoring: Tracking model performance and making adjustments as needed.
  • Documentation: Writing clear documentation for data processes, models, and code.
  • Team Collaboration: Working with other data scientists, engineers, and stakeholders to understand requirements and deliver solutions.
  • Reporting: Presenting findings and insights to non-technical stakeholders through reports and presentations.
  • Continuous Learning: Staying updated with the latest trends and technologies in data science.
  • Programming: Writing code in languages such as Python, R, SQL, and sometimes Java or Scala.
  • Tools and Libraries: Using data science tools and libraries like Pandas, NumPy, Scikit-learn, TensorFlow, and PyTorch.
  • Database Management: Working with SQL and NoSQL databases to store and retrieve data.
  • Data Privacy: Ensuring compliance with data privacy laws and regulations.
  • Bias Mitigation: Identifying and mitigating biases in data and models.

Qualification

Required

  • Data Gathering: Collecting data from various sources, including databases, APIs, and web scraping.
  • Data Cleaning: Identifying and correcting errors or inconsistencies in data to ensure quality.
  • Data Transformation: Converting raw data into a usable format, including normalization and scaling.
  • Descriptive Statistics: Calculating measures such as mean, median, mode, variance, and standard deviation.
  • Visualization: Creating charts, graphs, and plots to understand data distributions and relationships.
  • Pattern Identification: Detecting trends, anomalies, and patterns in data.
  • Algorithm Selection: Choosing appropriate machine learning algorithms based on the problem at hand.
  • Model Training: Training models using supervised, unsupervised, or reinforcement learning techniques.
  • Model Evaluation: Assessing model performance using metrics like accuracy, precision, recall, F1 score, and ROC-AUC.
  • Model Deployment: Implementing models in production environments.
  • Monitoring: Tracking model performance and making adjustments as needed.
  • Documentation: Writing clear documentation for data processes, models, and code.
  • Team Collaboration: Working with other data scientists, engineers, and stakeholders to understand requirements and deliver solutions.
  • Reporting: Presenting findings and insights to non-technical stakeholders through reports and presentations.
  • Continuous Learning: Staying updated with the latest trends and technologies in data science.
  • Programming: Writing code in languages such as Python, R, SQL, and sometimes Java or Scala.
  • Tools and Libraries: Using data science tools and libraries like Pandas, NumPy, Scikit-learn, TensorFlow, and PyTorch.
  • Database Management: Working with SQL and NoSQL databases to store and retrieve data.
  • Data Privacy: Ensuring compliance with data privacy laws and regulations.
  • Bias Mitigation: Identifying and mitigating biases in data and models.

Preferred

Benefit

Cyient is a digital engineering and technology company specializing in management consulting and engineering services.
Glassdoor
This is some text inside of a div block.
Founded in 1991
Hyderabad, Andhra Pradesh, IND
10001+ employees
https://www.cyient.com