AI Engineer
Zalo |
Thành phố Hồ Chí Minh |
VN
Hồ Chí Minh
Full-time
What you will do- Optimizing latency and throughput of model inference;
- Building reliable production serving system to serve millions of users;
- Experience with programming languages such as C++ and Python;
- Solid knowledge of Data Structures and Algorithms;
- Proficiency with deep learning frameworks such as PyTorch and TensorRT;
- Experience with system optimizations for model serving, such as batching, caching, load balancing, and model parallelism;
- Experience with algorithmic optimizations for inference, such as quantization, distillation, and speculative decoding;
- Experience with HTTP, gRPC, and Triton Inference Server;
- Experience with large-scale, high-concurrency production serving;
- Ability to quickly learn new technologies, frameworks, and algorithms;
- Experience with low-level optimizations for inference, such as GPU kernels;
- Experience with building solutions with MLOps tools and frameworks such as Kubernetes, Kubeflow, etc;•
Information :
- Company : Zalo
- Position : AI Engineer
- Location : Thành phố Hồ Chí Minh
- Country : VN
Attention - In the recruitment process, legitimate companies never withdraw fees from candidates. If there are companies that attract interview fees, tests, ticket reservations, etc. it is better to avoid it because there are indications of fraud. If you see something suspicious please contact us: support@jobkos.com
Post Date : 2025-06-05 | Expired Date : 2025-07-05