Monday, December 28, 2020

Building a Decision tree tensorflow.js application

 Building a Decision tree tensorflow.js application: 

Introduction: TensorFlow is a machine learning framework for JavaScript applications. It helps us build models that can be directly used in the browser or in the node.js server. We use this framework for building an application that can find similar requests so that they might be used for prediction. 

Description: The JavaScript application uses data from a CSV that has categorizations of requests and the resolution time. The attributes of each request include a category_id, a few attributes, and the execution time. The data used in this sample has 1200 records. 

As with any ML learning example, the data is split into 70% training set and 30% test set. There is no order to the data and the split is taken over a random set.  

The model chosen is a decision tree model. This model is appropriate for splitting data based on column value and recursively for each level of columns one after the other with all the rows. After training this model, it can be used to make a prediction. This model is described in detail here. TensorFlow makes it easy to construct this model using an API. It can only present the output after the model is executed. In this case, the model must be run before the weights are available.  The feature column and input functions are used to train the model. The output of each layer can be printed using the describe() method.  

With the model and training/test sets defined, it is now as easy to evaluate the model and run the inference.  The model can also be saved and restored. It executed faster when there is GPU added to the computing. 

The features are available with the feature_extractor. It is evaluated on the training set using model.compile() and model.fit(). The model can then be called on a test input. Additionally, if a specific layer was to be evaluated, we can call just that layer on the test input. 

When the model is trained, it makes several passes of the entire training dataset It is helpful to visualize the training with the help of a high chart that shows the reasoning for the decision tree split at that level. 

When the model is tested, it predicts the resolution time for the given attributes of category_id and parameter attributes 

Conclusion: Tensorflow.js is becoming a standard for implementing machine learning models. Its usage is fairly simple but the choice of model and the preparation of data takes significantly more time than setting it up, evaluating, and using it. 

 

 

No comments:

Post a Comment