Heungsub Lee
- Contact
- heungsub@subl.ee
- Web Sites
- subl.ee, github.com/sublee, linkedin.com/in/sublee
Interests
- Designing engineer-oriented modules or services
- MLOps and back-end development
- Optimizing software development culture
Skills
- Programming Languages
- Go, Python, JavaScript, Bash
- ML Engineering
- PyTorch, data or pipeline parallelism, Triton Inference Server, NVIDIA Nsight Systems
- Back-end Development
- Linux, K8s, AWS, Terraform, MongoDB, MySQL, Redis, etcd, ZeroMQ
Work Experience
- Project Manager & Software Engineer
- Clova in Naver, 2020–
- Industrializes AI.
-
Lead 25 software engineers developing MLOps platforms.
Designed and developed NSML an ML research platform on HPC infrastructures, focused on large-scale AI models.
Developed an inference API and checkpoint preprocessing tools for HyperCLOVA, which is a Korean variant of 82B GPT-3.
- Software Engineer
- Kakao Brain, 2018–2020
- The AI research lab in Kakao.
-
Focused on parallel training of convolutional neural networks, including large-scale data parallelism and pipeline parallelism. Developed and published a pipeline parallelism library in PyTorch named torchgpipe.
Developed a serverless training framework and a distributed hyperparameter search platform for an AutoML service.
- Game Server Engineer & Architect
- Nexon, 2011–2018
- Developed and launched Durango, KartRider Dash & Coin Rush.
-
Designed and developed cloud-based distributed game servers for Durango (MMORPG) and KartRider Dash & Coin Rush (online racing games) respectively. Durango achieved up to 70k concurrent users per game world.
Developed an internationalization and localization system focused on linguistic features of Korean and Indo-European languages.
Researched rating systems such as Elo, Glicko, and TrueSkill to develop a matchmaker.
Led a server engineering team including 15 engineers.
- Web Developer
- Npine, 2008–2011
- Supplies stock images for business on Iclickart.
- Developed web services from scratch. Maintained on-premise Linux servers.
- Front-end Web Developer
- Lunant, 2008–2011
- Served social media named VLAAH.
- Designed and implemented the UI/UX for social media.
Open Source Experience
- torchgpipe, 2019–2020
- A GPipe implementation in PyTorch.
-
Implemented GPipe in PyTorch with optimization for CUDA, PyTorch’s autograd engine, and long skip connections. GPipe is a scalable pipeline parallelism library for the training of a giant model.
This project has become a part of PyTorch. The story behind it can be found on Kakao Brain Blogko and the technical report.
- Hangulize, 2010–
- Automatically transcribes a non-Korean word into Hangul.
-
Implemented an automatic Hangul transcription algorithm to realize Brian Jongseong Park’s idea. Designed and implemented the web service and RESTful API.
Many professional Korean translators habitually use this tool to translate undocumented proper nouns. Netflix refers to this tool in the Korean timed-text style guide.
- TrueSkill, 2012–
- A TrueSkill™ implementation in Python.
- Implemented TrueSkill™, which is a rating algorithm for Xbox Live, in Python with a handy interface. This project was introduced in PyData Berlin 2019.
- Profiling, 2014–2018
- An interactive profiler for Python inspired by the Unity3D profiler.
- Developed a Python profiler with an interactive TUI inspired by the Unity profiler. On GitHub, this project has been starred by 3k people. Also, it was the 3rd daily trending repository on Sep 22, 2014.
- Others
-
- Tossi – A utility for Korean allomorphic particles.
- Flask-AutoIndex – mod_autoindex for Flask.
- SUBLEERUNKER – A simple parody game of SUBERUNKER.
- Contributions
-
- For PyTorch, fixed potential GPU memory violation (#27371); deprecated inconsistent API (#21006, #25985); discussed a counterintuitive behavior of the autograd engine (#18568).
- For ZeroMQ, discussed a PUB socket crash (#2942).
- For Flask, fixed a bug to generate a URL with a subdomain (#108).
- For jQuery 1.4.3, restored a missing part of the content negotiation header for Ajax.
Publications
- B. Kim et al., “What Changes Can Large-scale Language Models Bring? Intensive Study on HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers,” arXiv:2109.04650, Sep 2021.
- C. Kim*, H. Lee*, M. Jeong, W. Baek, B. Yoon, I. Kim, S. Lim, S. Kim, “torchgpipe: On-the-fly Pipeline Parallelism for Training Giant Models,” arXiv:2004.09910, Apr 2020.
*Contributed equally.
Public Speeches
- Remake of Hangulizeko at Golang Korea 2018 and Naver D2
- Server architecture of Durango Vol. 3ko at NDC 2018
- Python Survival Guideko at Nexon Talk 2016
- Server architecture of Durango Vol. 2ko at NDC 2016
- Profilingko at PyCon KR 2015
- Server architecture of Durangoko at NDC 2014
Languages
- Korean – native
- English – conversant in reading and writing
Education
Computer Software, Kwangwoon University, 2008 – completed the first year only