Friday Systems builds AI that allows industrial robots to adapt to dynamic warehouse environments. Own the DRL stack end-to-end: formulation → algorithm design → large-scale training → evaluation → deployment. Design & ship DRL algorithms (PPO/SAC/DDQN and variants, based on encoders/cross-attention/pointer networks) for complex control & combinatorial optimization.
~ GAE, normalization, entropy/KL control, distributional/value-loss tuning, curriculum learning and reward shaping, …
~ Launch multi-GPU training, parallel rollouts, efficient replay/storage, and reproducible experiment tooling.
~ Productionize: clean PyTorch code, profiling, Dockerized services (FastAPI), AWS deployments, experiment tracking, dashboards.
~ Provide mentorship and leadership to foster a culture of quality and innovation.
Extensive Deep Learning, Reinforcement Learning & PyTorch expertise: You can implement several DRL algorithms from scratch, reason about root-cause performance drops and make informed decisions about next steps.
~ Python, Linux, Docker, Multi-GPU, Cloud (AWS).
~ Ownership: you’re comfortable being the primary owner for experiments, code quality, and results in a small team.
~ We are not considering entry-level or coursework-only profiles for this role.
Deep technical session with CTO on your past RL work (no LeetCode, no homework)
#