machine learning Building a Convolution Neural Network (CNN) for handwritten digit recognition in Python using Keras We built a Convolution Neural Network (CNN) for handwritten digit recognition from scratch in python. We will be using Keras API with TensorFlow backend and use handwritten digits dataset from Kaggle.

machine learning Why Principal Component Analysis (PCA) works? We have demonstrated how and why Principal Component Analysis (PCA) works using the intuition behind the common operations used in the algorithm such as Variance, Covariance, Eigenvectors and Eigenvalues. Eigenvectors represent directions while Eigenvalues represent magnitude the importance

machine learning Algorithm of Principal Component Analysis (PCA) The algorithm of Principal Component Analysis (PCA) is based on a few mathematical ideas namely Variance, Convariance, Eigen Vectors and Eigen values. The algorithm is of eight simple steps including preparing the data set, calculating the covariance matrix, eigen vectors and values, new feature set

machine learning Basic Ideas of Principal component analysis Principal component analysis (PCA) is a technique to bring out strong patterns in a dataset by supressing variations. It is used to clean data sets to make it easy to explore and analyse. We have demonstrated an example of 17 dimensions and given the basic intuition of PCA

machine learning Top deep learning frameworks to explore In this article, we have explored some of the top Deep Learning frameworks that are out there and you should definitely try out. Some of them are TensorFlow, Keras, Caffe, Caffe2, MXNet, CNTK, BigDL, Torch, PyTorch, deeplearn.js and others

machine learning Popular Datasets in Machine Learning Data sets are important in Machine learning as the more better data we have, the better the model. The various popular data sets available for machine learning are ImageNet, MNIST, NIST, CIFAR-10 and YouTube 8M.

tensorflow Key ideas in TensorFlow TensorFlow is a popular Deep Learning library that is used to build Deep Learning models and is backed by Google. A few key ideas of TensorFlow are tensor, distributed computing, kernel abstraction, operation abstraction, computational graph, automatic gradient computation and others

tensorflow Build and install TensorFlow from source with MKL DNN support and AVX enabled In this guide, we will walk you through building and installing TensorFlow from source with support for MKL DNN and with AVX enabled. By following the six simple steps, you can build and install TensorFlow from source in 20 minutes

machine learning Differences between Torch and PyTorch deep learning libraries We have explored some of the differences between two popular frameworks namely Torch and PyTorch from the view of common origin, current development status, source code and implementation, usage, performance and ONNX support. As development of Torch has been paused, you should go with PyTorch

openmp OpenMP clauses: firstprivate, lastprivate, ordered There are three basic OpenMP clauses namely firstprivate, lastprivate and ordered. firstprivate clause is used to initialize a variable from the serial part of the code and private clause doesn't initialize the variable

openmp Basic OpenMP functions There are 3 basic functions in OpenMP namely omp_get_thread_num, omp_set_num_threads (nthreads) and omp_get_num_threads. We have given a basic C/ C++ example to demonstrate the use of the function and the observation of the output as well

openmp When to use OpenMP directives? If some section of your code can be parallelized and you have more than one processor, you should definitely speed up the execution of your program using OpenMP directives. We have demonstrated how to use Amdahl's law to calculate the expected speed up and profiling to find which section takes time

openmp Introduction to OpenMP OpenMP is an open source library which is used for multi-threaded parallel processing and shared-memory multi-processor (core) computers. In OpenMP, part of program is a single thread and part is multi-threaded We have given advantages, disadvantages, industry use example, approach of OpenMP library

machine learning ONNX format for interchangeable AI models The Open Neural Network Exchange Format (ONNX) is a format for exchanging deep learning/ artificial intelligence models. It will make deep learning models portable thus preventing vendor lock in. We have provided a real life use case of ONNX, benefits of ONNX and the key ideas and challenges of ONNX

machine learning MKL (Math Kernel Library), a Basic Linear Algebra Subprograms (BLAS) Library Intel's Math Kernel Library (Intel MKL) is a Basic Linear Algebra Subprograms (BLAS) Library that optimizes code with minimal effort for future generations of Intel processors. We have presented how to download and use MKL using an example of matrix multiplication in MKL Library

machine learning FLAME BLIS, a Basic Linear Algebra Subprograms Library FLAME BLIS is an open source portable software framework/ Basic Linear Algebra Subprograms (BLAS) library for instantiating high-performance BLAS-like dense linear algebra libraries. We have presented how to install and use BLIS using an example of matrix multiplication in FLAME BLIS

machine learning OpenBLAS, a Basic Linear Algebra Subprograms Library OpenBLAS is an open source optimized BLAS (Basic Linear Algebra Subprograms) library based on GotoBLAS2 1.13 BSD version. We have presented how to install and use OpenBLAS with a matrix multiplication example in OpenBLAS

machine learning Basic Linear Algebra Subprograms (BLAS) Library The BLAS (Basic Linear Algebra Subprograms) are routines that provide standard building blocks for performing basic vector and matrix operations. There are three levels within the BLAS library. The various implementations include Intel's MKL, BLIS, NetLib's BLAS, OpenBLAS, BLAS++ and others

machine learning PyTorch vs TensorFlow We have compared PyTorch and TensorFlow on the basis of various metrics to help you determine the framework you should go forward with. In short, TensorFlow gives you more control and high computational efficiency while PyTorch gives you the simplicity to develop applications.

machine learning Install and use NNVM Compiler NNVM compiler is a graph compiler for the TVM Stack that takes in models in NNVM Intermediate Representation format and compiles them for various backends such as LLVM, METAL, CUDA and others. We have presented how to install and build NNVM from source and how to use it with the configurations

machine learning NNVM Intermediate Representation NNVM is a reusable graph Intermediate Representation stack for deep learning systems. It provides useful API to construct, represent and transform computation graphs to get most high-level optimization needed in deep learning. NNVM is a part of TVM stack for deep learning and has a compiler as well

machine learning Run a ResNet34 model in ONNX format on TVM Stack with LLVM backend In this guide, we will run a ResNet34 model in ONNX format on the TVM Stack with LLVM backend. You do not need any specialized equipment like GPU and TPU to follow this guide. A simple CPU is enough.

machine learning Run a ResNet18 model in ONNX format on TVM Stack with LLVM backend In this guide, we will run a ResNet18 model in ONNX format on the TVM Stack with LLVM backend. You do not need any specialized equipment like GPU and TPU to follow this guide. A simple CPU is enough.

machine learning Run a ResNet101 model in ONNX format on TVM Stack with LLVM backend In this guide, we will run a ResNet101 model in ONNX format on the TVM Stack with LLVM backend. You do not need any specialized equipment like GPU and TPU to follow this guide. A simple CPU is enough.

machine learning Run a ResNet152 model in ONNX format on TVM Stack with LLVM backend In this guide, we will run a ResNet152 model in ONNX format on the TVM Stack with LLVM backend. You do not need any specialized equipment like GPU and TPU to follow this guide. A simple CPU is enough.