Skip to main content

Data Scientist / Machine Learning Engineer

Job Description Roles and Responsibilities: Internal R&D to Develop and Deploy AI Systems:   - Design, build, and deploy AI and generative AI systems tailored to client needs   - Utilize and fine-tune open-source models to create generative AI applications   - Actively engage with our client's AI COE communities across APAC and beyond, and the open-source community to stay updated on the latest advancements and incorporate them into projects   - Have an understanding of, and be confident enough to (attempt to) develop AI systems capable of handling multi-modal data and performing agentic tasks Client Work:   - Work closely with clients to understand their requirements and deliver customized AI solutions   - Collect requirements from internal or external departments and provide the analysis   - Participate in the High-level design and data collection and cleaning. The data can be structured, unstructured Data science:   - Data Management: Collect, pre-process, and analyze large datasets to train and evaluate models   - Performance Optimization: Continuously monitor and improve the performance of AI models   - Documentation and Reporting: Maintain comprehensive documentation of models, processes, and project progress. Prepare and present reports to stakeholders Requirements Knowledge of / Proficiency in  Python, TensorFlow, PyTorch, Flash Attention, Other relevant AI/ML frameworks Experience with Open-Source Models with demonstrated ability to leverage and fine-tune open-source models for specific applications Community engagement skills with a track record of active participation in the open-source community contributing to and utilizing community resources Multi-modal data handling experience in working with multi-modal data (text, image, audio, etc.) and developing models that integrate these data types Agentic AI development knowledge of developing AI systems capable of autonomous decision-making and task execution Problem-solving skills with strong analytical ability to troubleshoot complex issues Communication skills with excellent verbal and written communication style to engage with clients effectively and collaborate with team members from diverse backgrounds Ability to work well in a team environment and contribute to collective goals Preferred: Knowledge of and experience with NoSQL DBs like MongoDB and data platforms like databricks and / or with industry standard ETL techniques will be a big plus Knowledge of and experience with transformer based models and generative AI models, including pre-trained large foundation models, fine-tuning of models using techniques like LoRA etc. is a plus Design and implement the API for AI model and applications integration and of Web applications / interfaces that utilize the AI models will also be a plus Master’s or PhD in Machine Learning, Artificial Intelligence, Computer Science, or related fields. Strong academic background with a focus on AI/ML technologies Additional relevant certifications in AI/ML from recognized institutions are a plus Requirements Roles and Responsibilities: Internal R&D to Develop and Deploy AI Systems:   - Design, build, and deploy AI and generative AI systems tailored to client needs   - Utilize and fine-tune open-source models to create generative AI applications   - Actively engage with our client's AI COE communities across APAC and beyond, and the open-source community to stay updated on the latest advancements and incorporate them into projects.   - Have an understanding of, and be confident enough to (attempt to) develop AI systems capable of handling multi-modal data and performing agentic tasks Client Work:   - Work closely with clients to understand their requirements and deliver customized AI solutions   - Collect requirements from internal or external departments and provide the analysis   - Participate in the High-level design and data collection and cleaning. The data can be structured, unstructured Data science:   - Data Management: Collect, pre-process, and analyze large datasets to train and evaluate models   - Performance Optimization: Continuously monitor and improve the performance of AI models   - Documentation and Reporting: Maintain comprehensive documentation of models, processes, and project progress. Prepare and present reports to stakeholders Requirements Knowledge of / Proficiency in  Python, TensorFlow, PyTorch, Flash Attention, Other relevant AI/ML frameworks Experience with Open-Source Models with demonstrated ability to leverage and fine-tune open-source models for specific applications Community engagement skills with a track record of active participation in the open-source community contributing to and utilizing community resources Multi-modal data handling experience in working with multi-modal data (text, image, audio, etc.) and developing models that integrate these data types Agentic AI development knowledge of developing AI systems capable of autonomous decision-making and task execution Problem-solving skills with strong analytical ability to troubleshoot complex issues Communication skills with excellent verbal and written communication style to engage with clients effectively and collaborate with team members from diverse backgrounds Ability to work well in a team environment and contribute to collective goals Preferred: Knowledge of and experience with NoSQL DBs like MongoDB and data platforms like databricks and / or with industry standard ETL techniques will be a big plus Knowledge of and experience with transformer based models and generative AI models, including pre-trained large foundation models, fine-tuning of models using techniques like LoRA etc. is a plus Design and implement the API for AI model and applications integration and of Web applications / interfaces that utilize the AI models will also be a plus Master’s or PhD in Machine Learning, Artificial Intelligence, Computer Science, or related fields. Strong academic background with a focus on AI/ML technologies Additional relevant certifications in AI/ML from recognized institutions are a plus

Data Scientist / Machine Learning Engineer

NEXUS CORPORATION
Shinjuku City, Tokyo
Full time

Published on 08/26/2024

Share this job now