Skip to content

Introduction

  • Several aspiring data scientists think Data science is just about training fancy models. While it's part of the job, what we are missing here is the huge additional effort that is required to make sure that a model is trainable, executable and deployable. Add to it the complexity of working in a team and now we have to make sure the code is well formatted and structured as well. In all, life of a Data scientist is similar to any software engineer, just with a caveat of having the luxury to play with the state-of-the-art AI algorithms once in a while 😄
  • Now, the industry is trying (or realizing) the capabilities, limitations and responsibilities of professionals in AI or ML field. This is giving rise to increase in requirements for diverse job profiles like ML engineer, Data Engineer, Data Scientist, Research Scientist, etc. That said, Data scientist (DS) are reinventing themselves as well, giving rise to the interesting profile of "Full stack Data scientists" - who while researching and experiment with AI models, are not afraid to dabble into the old school engineering aspect of the project. This article is for such aspirants or practitioners.

MLOps tools

  • Just like a mechanic should know about all the tools at his disposal, a data scientist should be aware of different ready-made and possibly free services available. You can quote me on this,

Quote

Never pay for what you can do for free, and never build something which has already been built! 😏

  • With this fortune cookie quote in mind, let's look into different tools and their segregation based on the where and how they are used,

Different MLOps tools covering Data, Training, Evaluation and Deployment. (fullstackdeeplearning)

  • The rest of this section will cover some of these tools in detail.