论文标题

迈向ML工程:TensorFlow扩展的简短历史(TFX)

Towards ML Engineering: A Brief History Of TensorFlow Extended (TFX)

论文作者

Konstantinos, Katsiapis, Karmarkar, Abhijit, Altay, Ahmet, Zaks, Aleksandr, Polyzotis, Neoklis, Ramesh, Anusha, Mathes, Ben, Vasudevan, Gautam, Giannoumis, Irene, Wilkiewicz, Jarek, Simsa, Jiri, Hong, Justin, Trott, Mitch, Lutz, Noé, Dournov, Pavel A., Crowe, Robert, Sirajuddin, Sarah, Warkentin, Tris Brian, Li, Zhitao

论文摘要

作为一门学科,软件工程在过去5年以上已经成熟。现代世界在很大程度上取决于它,因此软件工程的成熟度提高是最终的。测试和可靠技术等实践有助于使软件工程可靠地建立行业。同时,在过去的20多年中,机器学习(ML)也发展了。 ML越来越多地用于研究,实验和生产工作量。现在,ML通常会为我们生活不可或缺的产品提供广泛使用的产品。但是,作为一门学科,ML工程学并没有像其软件工程祖先那样广泛成熟。我们能否将我们学到的知识并帮助应用ML的新生领域演变为ML工程,以编程的方式演变为软件工程[1]?在本文中,我们将进行Sibyl [2]和Tensorflow Extended(TFX)[3]的旋风之旅,这是Alphabet的两个连续的端到端ML平台。我们将分享从十多年来应用的ML建立在这些平台上的经验教训,解释它们的相似之处和差异,并扩大帮助我们在旅途中的转变(心理和技术)。此外,我们将重点介绍TFX的某些功能,这些功能有助于实现ML工程的几个方面。我们认为,为了释放ML的收益,组织应通过投资于强大的ML基础设施并促进ML工程教育来提高其ML团队的成熟度。我们还建议在专注于尖端的ML建模技术之前,产品负责人应该投入更多时间来为其组织采用可互操作的ML平台。结束时,我们还将瞥见TFX的未来。

Software Engineering, as a discipline, has matured over the past 5+ decades. The modern world heavily depends on it, so the increased maturity of Software Engineering was an eventuality. Practices like testing and reliable technologies help make Software Engineering reliable enough to build industries upon. Meanwhile, Machine Learning (ML) has also grown over the past 2+ decades. ML is used more and more for research, experimentation and production workloads. ML now commonly powers widely-used products integral to our lives. But ML Engineering, as a discipline, has not widely matured as much as its Software Engineering ancestor. Can we take what we have learned and help the nascent field of applied ML evolve into ML Engineering the way Programming evolved into Software Engineering [1]? In this article we will give a whirlwind tour of Sibyl [2] and TensorFlow Extended (TFX) [3], two successive end-to-end (E2E) ML platforms at Alphabet. We will share the lessons learned from over a decade of applied ML built on these platforms, explain both their similarities and their differences, and expand on the shifts (both mental and technical) that helped us on our journey. In addition, we will highlight some of the capabilities of TFX that help realize several aspects of ML Engineering. We argue that in order to unlock the gains ML can bring, organizations should advance the maturity of their ML teams by investing in robust ML infrastructure and promoting ML Engineering education. We also recommend that before focusing on cutting-edge ML modeling techniques, product leaders should invest more time in adopting interoperable ML platforms for their organizations. In closing, we will also share a glimpse into the future of TFX.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源