AI Engineer

Hồ Chí Minh

Full-time

What you will do
  • Optimizing latency and throughput of model inference;
  • Building reliable production serving system to serve millions of users;
What you will need
  • Experience with programming languages such as C++ and Python;
  • Solid knowledge of Data Structures and Algorithms;
  • Proficiency with deep learning frameworks such as PyTorch and TensorRT;
  • Experience with system optimizations for model serving, such as batching, caching, load balancing, and model parallelism;
  • Experience with algorithmic optimizations for inference, such as quantization, distillation, and speculative decoding;
  • Experience with HTTP, gRPC, and Triton Inference Server;
  • Experience with large-scale, high-concurrency production serving;
  • Ability to quickly learn new technologies, frameworks, and algorithms;
Nice to have:
  • Experience with low-level optimizations for inference, such as GPU kernels;
  • Experience with building solutions with MLOps tools and frameworks such as Kubernetes, Kubeflow, etc;•

Information :

  • Company : Zalo
  • Position : AI Engineer
  • Location : Thành phố Hồ Chí Minh
  • Country : VN

Attention - In the recruitment process, legitimate companies never withdraw fees from candidates. If there are companies that attract interview fees, tests, ticket reservations, etc. it is better to avoid it because there are indications of fraud. If you see something suspicious please contact us: support@jobkos.com

Post Date : 2025-06-05 | Expired Date : 2025-07-05