Automatic Speech Recognition (Python/AI)
TRUONG MINH THINH TECHNOLOGY JOINT STOCK COMPANY
Tầng 2,4 Lô III-26, Đường 19/5A, Nhóm CN III, Khu công nghiệp Tân Bình, Tan Phu, Ho Chi Minh
Không xác định
2021-09-25 -> 2021-09-26
- Must – have technical skills
- - Being well versed in classical speech processing methodologies like hidden Markov models (HMMs), Gaussian mixture models (GMMs), Artificial neural networks (ANNs), Language modeling, etc.
- - Hands-on experience of current deep learning (DL) techniques like convolutional neural networks (CNNs), recurrent neural networks (RNNs), long-term short-term memory (LSTM), connectionist temporal classification (CTC), etc used for speech processing is essential
- - Strong programming skills in Python
- - Familiarity with any of the end-to-end ASR, TTS models such as DeepSpeech, Tacotron, Conformer, etc.
- - Good understanding of machine learning(ML) tools
- Must – have soft skills
- - Top-down thinker, excellent communicator, and great problem solver. Must be a strong team player
- - Open-minded and excited to learn new things
- - Ability to comprehend and independently implement latest research papers
- Good - to - have skills
- - Experience working with data related to text, voice and image analytics
- - Capable of presenting complex technical topics in a clear and structured way for even non-technical person
- Education and Experience
- - B.S. or M.S. degree in Computer Science, Software Engineering or related field
- - At least two year experience or equivalent in developing TTS and/or ASR solutions;
- Are you looking for a dynamic, creative and potential working environment? You want to work closer with AI-based solutions to automate routine transactions and processes?
- Welcome to TMT!!!
- Newly found but rapidly grown, the TMT’ AI Team is passionately working with close consultation and collaboration from a prestigious Professor from HCM University of Technology (Đại học Bách Khoa) whose expertise is in natural language processing and machine learning. OUR VISION is to apply the power of AI in Chatbot to finally understand the mysteries of human language, speech, communication, emotions, and real intent in real life.
- Ongoing investigations in our team revolve around:
- - Speech Processing: Process of transcribing utterances into texts, process of converting language texts into speeches
- - Natural Language Processing: Automatic question answering, text classification, etc.
- - Computer Vision: Object detection, Image-based product classification, etc.
- - Other AI-related techniques: Data pre-processing and cleaning, automatic labelling and model re-training
- Automatic Speech Recognition (ASR) & Text To Speech (TTS) Engineer (Senior)
- - Developing and validating the pipeline for robust Developing Speech to Text and Text to Speech systems using frameworks like DeepSpeech, Tacotron, etc.
- - Developing Speech to Text and Text to Speech systems
- - Tuning the individual modules to achieve the state-of-the-art metrics
- - Integrating the developed solutions into company existing and emerging products
- - Leading a team of 2-4 members