Deep Learning Checklist: 13 Months course

Week 3: Concepts in Inference

Once you have a DL model, run it.

Basics of Inference

Inference is the process of running a DL model. It may look to be simple but is technically, rich direction. The main focus is on performance optimization and keep the accuracy (rate of correct output) of the model intact.
Training vs Inference

Training and Inference are the two parts of deep learning models. Inference is the simplier part where we run a model. Remember inference can be done on CPUs but training require special hardware like GPUs, TPUs, FPGAs and others. Training can be done on CPUs but it may take years to complete.
Throughput

Throughput is a measurement in DL to determine the performance of various models for a specific application. Throughput refers to the number of data units processed in one unit of time.
Latency

Latency is a measurement in Machine Learning to determine the performance of various models for a specific application. Latency refers to the time taken to process one unit of data provided only one unit of data is processed at a time.
Floating Point (FP32) Format

Floating Point Format is the format where the data is represented in different floating point formats.
Instruction set AVX2, AVX512-VNNI

Different instruction sets like SIMD, SSE, AVX512 enable competitive performance of DL models. For example, AVX512-VNNI instructions result in 4X performance with INT8 model.s is the format where the data is represented in different floating point formats.
GCC intrinsics are an important concept to enable use of different instructions and not rely on compiler.
Pb Format

Models need to be saved so that you can use it directly anytime. This is done using specific file formats like Pb format, ONNX format and many others.
Pb file is a serialized version of a TensorFlow model that can be saved to disk and loaded back into memory. It contains all the information needed to reconstruct the TensorFlow model, including the model's architecture, variables, and operations.
ONNX Format

ONNX Format is designed to allow framework interoporability. There are many excellent machine learning libraries in various languages.

Week 7: DL use cases

Understand the core tasks DL have mastered to solve and the architectures involved.

Object Detection

Medical imaging diagnosis is changing with the help of deep learning. Nowadays, deep learning can be used to alert the patient that their body is ill or helping doctors to identify that the patient has a certain disesase.
Transportation

Transportation is a crucial part of human activities, and with the help of deep learning. We can increase the efficiency in transportation control and we can also use the power of deep learning to help us control traffic flows.
Differentiating Fake Faces

Fake images and deepfakes are a common problem on the internet. Learn how to differentiate fake faces using machine learning and computer vision. This project uses Jupyter Notebook along with OpenCV, NumPy, Matplotlib and Scikit-Learn libraries.
Medical Imaging Diagnosis

Object Detection is an image processing task that refers to the identification of objects in digital images. It is also referred to by synonymous concepts such as object recognition, object identification & image detection.
Health Care

Machine learning may assist in the analysis of huge amounts of data, the identification of patterns and trends, and the prediction of outcomes based on that data. Machine learning has the potential to enhance patient outcomes, lower costs, and boost efficiency in healthcare.
Pancreatic Volumetry

Pancreatic Volumetry can be predicted usign deep learning and it can help increase the efficiency of the process.
Chest X Rays

Chest X Rays can be predicted usign deep learning and it can help increase the efficiency of the process.
Laptops

Difference uses of deep learning in laptop are covered and we will go over 6 different cases on how deep learning is helping the lab top industry.
Media

Media Industry is also affected by deep learning. This article at OpenGenus delves into how deep learning is being applied in the media industry, revolutionizing the way we create, consume, and interact with media content.

Week 10: Advanced Concepts

With a strong foundation, you can arrive at the advanced concepts on your own and contribute to the growth of DL.

Transposed Convolution

Transposed Convolution is also known as upsampled convolution, which refers to the task it accomplishes, which is to upsample the input feature map.
TVM

TVM is an open source deep learning compiler stack for CPUs, GPUs, and specialized accelerators. It aims to close the gap between the productivity-focused deep learning frameworks, and efficiency-oriented hardware backends. We have provided a brief introduction to the TVM Stack.
Floating Point Operations Per Second

FLOPs are values of various machine learning models like VGG19, VGG16, GoogleNet, ResNet18, ResNet34, ResNet50, ResNet152 and others. The FLOPS range from 19.6 billion to 0.72 billion.
Refinedet

refinedet generates a predetermined number of bounding boxes and scores indicating the existence of distinct kinds of items in those boxes, followed by non-maximum suppression (NMS).
Image Segementation

Image Segementation evaluation metrics are covered in this article. It includes Panoptic quality (PQ), segmentation quality (SQ) and recognition quality (RQ).
Hinge Loss for SVM

Hinge Loss for SVM is a type of loss function that is used to penalize the SVM for misclassifying data points.
One Shot Learning

One Shot Learning is a classification task where one, or a couple, examples are used to classify many new examples in the future. Let us learn about it with the help of an example.
He initialization

He initialization , also known as Kaiming Initialization, is a widely used technique in deep learning for initializing the weights of neural networks
Top 50 Interview Questions

50 Most interviewed questions will be covered in this article

Week 12: LLMs

LLMs are the hottest topic today and DL has laid the foundation.

Large Language Models (LLMs)

Large Language Models have been one of the most significant and disruptive innovations of the 21st Century in the field of technology, with the potential to revolutionize a wide range of domains, from natural language processing and machine translation to content creation and even distant- seemingly non related domains such as literature and finance.
People Who Started LLM Revolution

People who started LLM Revolution will be discussed in the article. We will go over how each individual revolutionized the industry.
Transformers

Transforers questions are explored in this article. After goign through the article, you will have a thorough understanding of the model. Vision Transformer is a transformer model to build a new network for image recognition
Self Attention

Self Attention is a process that trying to make each single input to pay attention to the other inputs in the same sequence. this attention is weighted to each other input and this weight is trainable.
ERNIE 3.0 TITAN LLM

ERNIE 3.0 TITAN LLM is a model developed by BAIDU. Pre-trained language models such as ERNIE, GPT, BERT have revolutionized the field of Natural Language Processing (NLP) by improving language generation, analysis and understanding. This article at OpenGenus aims to provide you an overview of Baidu's ERNIE 3.0 TITAN LLM and briefly explore its architecture.
ChatGPT vs BARD

ChatGPT and Google BARD , were created by two separate businesses, OpenAI and Google, respectively. Even if they have certain things in common, they also differ greatly.uage Processing (NLP) by improving language generation, analysis and understanding. This article at OpenGenus aims to provide you an overview of Baidu's ERNIE 3.0 TITAN LLM and briefly explore its architecture.
GPT Models

All GPT Models are covered in this article. And we will compare the developments and the different advantages and the disadvantages of the individual models. while distilled GPT 2 is a light version of GPT-2 and has 6 layers and 82 million parameters. The word embedding size for distilGPT2 is 768. GPT 3.5is a fined-tuned version of the GPT3 (Generative Pre-Trained Transformer) model. GPT-3.5 was developed in January 2022 and has 3 variants each with 1.3B, 6B and 175B parameters.

Deep Learning Checklist: 13 Months course

Week 1: Basics of DL

Week 2: Basic operations (ops)

Week 3: Concepts in Inference

Week 4: Concepts in Training DL models

Week 5: CNN Models

Week 6: Other DL Architecture

Week 7: DL use cases

Week 8: TensorFlow/ PyTorch

Week 9: Optimization

Week 10: Advanced Concepts

Week 11: NLP Model

Week 12: LLMs

Week 13: DL Projects

Best of Luck.