UZINFOCOM logoUZINFOCOM logo

Middle\Senior Data Engineer (Muxlisa AI)

Full-time

3-6

Main Office

10.02.2026

Ish sharoitlari

You will design and maintain data pipelines for the advancement of speech technologies at Muxlisa AI. Your work will serve as the foundation for training Automatic Speech Recognition (ASR/STT), Text-to-Speech (TTS), and diarization models, providing them with clean and high-quality datasets.

Responsibilities:

  • Capture & Upload: Organize the capture and upload of audio data from various sources (call centers, TTS recording studios, internal resources).

  • Dataset Collection: Collect open datasets using Python, web crawling libraries, and custom parsers.

  • Cleaning & Preprocessing: Perform audio cleaning and preprocessing (resampling, VAD — Voice Activity Detection, silence removal, segmentation).

  • Data Preparation: Form verified "audio-text" pairs for training, validation, and test sets (train/dev/test).

  • Storage Optimization: Optimize data structures and flows in MinIO/S3.

  • Labeling Support: Manage labeling processes (data export/import, validation).

  • ETL Processes: Create and support ETL processes specific to STT/TTS.

  • Delivery: Prepare and deliver data for Machine Learning Engineers.

Requirements:

  • Python: Strong proficiency in Python (pandas, numpy, librosa, soundfile, re, pydub).

  • Linux/Big Data: Experience with Linux/bash environments and skills in processing large volumes of data.

  • Audio Processing: Understanding of the basics of audio signal processing.

  • Object Storage: Experience with object storage systems (S3/MinIO).

  • Data Structuring: Deep understanding of data structuring principles: ability to segment, categorize, and label data, design clear schemas, and ensure format consistency.

  • Formats: Knowledge of data format specifics for STT/TTS.

Nice to have:

  • Familiarity with ETL orchestration tools (Airflow, Luigi).

  • Experience working with datasets for speech diarization.

Conditions:

  • Schedule: 5 days a week, from 09:00 to 18:00.

  • Employment: Official employment in accordance with the Labor Code of the Republic of Uzbekistan, providing 28 calendar days of vacation.

  • Dress Code: No strict dress code — we aim to break stereotypes about government-related organizations.

  • Team: Work within a strong team of professionals ready to share knowledge and experience.

  • Impact: Participation in large-scale and significant projects aimed at creating services to improve the population's quality of life and optimize business processes in the country's leading enterprises.

  • Autonomy: Wide opportunities for independent decision-making and active influence on the company's development.

Vakansiyaga qiziqdingizmi?

Vakansiyaga ariza berishdan oldin majburiyatlar va ish sharoitlari bilan tanishib chiqing