Open in app

Sign In

Write

Sign In

Ehsan Yousefzadeh-Asl-Miandoab
Ehsan Yousefzadeh-Asl-Miandoab

109 Followers

Home

About

Sep 21

Learning about your system’s hardware setup within the Ubuntu terminal

In this post, I gathered the commands that are needed to know about the system’s CPU, GPU, system memory, disk, and how CPUs and GPUs are connected. CPU The primary processor’s information can be revealed with the following command. lscpu In the output, you will see the info as follows. or…

System

3 min read

Learning about your system’s hardware setup within the Ubuntu terminal
Learning about your system’s hardware setup within the Ubuntu terminal
System

3 min read


Published in

MLearning.ai

·Aug 11

“AntMan: Dynamic Scaling on GPU Clusters for Deep Learning” paper summary

Introduction GPUs as main accelerators for deep learning training tasks suffer from under-utilization. Under-utilization means in the first place means spending more money on purchasing the GPU. Second, it means wasting energy. Third, if it was used in a better way, it could mean higher performance! Authors of AntMan [1] propose…

Deep Learning

5 min read

“AntMan: Dynamic Scaling on GPU Clusters for Deep Learning” paper summary
“AntMan: Dynamic Scaling on GPU Clusters for Deep Learning” paper summary
Deep Learning

5 min read


Published in

MLearning.ai

·Aug 7

“Looking beyond GPUs for DNN Scheduling on Multi-Tenant Clusters” paper summary

Introduction Training deep learning models is a heavy task from computation and memory requirement perspective. Enterprises, research and development teams shared GPU clusters for this purpose. Usually, there is a resource manager and scheduler (e.g., SLURM, LFS, Kubernetes, Apache YARN, etc.) on the clusters to get the jobs and allocate GPUs…

Deep Learning

3 min read

“Looking beyond GPUs for DNN Scheduling on Multi-Tenant Clusters” paper summary
“Looking beyond GPUs for DNN Scheduling on Multi-Tenant Clusters” paper summary
Deep Learning

3 min read


Jun 15

Metrics in Machine Learning

This post reviews machine learning, its different subbranches very concisely as introduction, then delve into metrics. It is important to be careful when choosing and using or coming up with new metrics evaluating how well a ML approach performs. Machine Learning (ML) ML is about learning patterns in data instead of explicitly programming…

Machine Learning

10 min read

Metrics in Machine Learning
Metrics in Machine Learning
Machine Learning

10 min read


May 3

Profiling PyTorch model with NVIDIA Nsight Systems gives Error when MIG is enabled for any of the GPUs

This post’s aim to spread the word for disabling MIG mode of all GPUs when using Nsight systems for profiling a deep learning model developed with PyTorch. If MIG is not disabled, you will experience observing errors talking about an error caused by a device that is in MIG mode followed by a bunch of bucket errors!

Pytorch

1 min read

Profiling PyTorch model with NVIDIA Nsight Systems gives Error when MIG is enabled for any of the…
Profiling PyTorch model with NVIDIA Nsight Systems gives Error when MIG is enabled for any of the…
Pytorch

1 min read


Published in

MLearning.ai

·May 3

Profiling a Training Task with PyTorch Profiler and viewing it on Tensorboard

This post briefly and with an example shows how to profile a training task of a model with the help of PyTorch profiler. Developers use profiling tools for understanding the behavior of their code to be able to optimize it. TensorFlow framework provides a good ecosystem for machine learning developers…

Profiling

5 min read

Profiling a Training Task with PyTorch Profiler and viewing it on Tensorboard
Profiling a Training Task with PyTorch Profiler and viewing it on Tensorboard
Profiling

5 min read


Published in

MLearning.ai

·Feb 18

The Extinction of Programming

When “traditional programming” or “programming” terms are used, we usually mean using programming languages like python, Java, C#, etc. to reshape or re-express a programmer’s solution for a problem in a way to be executable by a computer system. …

Programming

3 min read

The Extinction of Programming
The Extinction of Programming
Programming

3 min read


Published in

MLearning.ai

·Jan 27

Setting up Conda, TensorFlow, and PyTorch for your Deep Learning Experimentation

This post reviews how to build the experimenting environment with conda for deep learning. I compile the commands to ease the purpose of setting up the environment. Step 1: Get miniconda installed Conda is a package or environment management tool that eases the life of developers by eliminating the necessity of changing versions of packages…

Deep Learning

2 min read

Setting up Conda, TensorFlow, and PyTorch for your Deep Learning Experimentation
Setting up Conda, TensorFlow, and PyTorch for your Deep Learning Experimentation
Deep Learning

2 min read


Published in

MLearning.ai

·Jan 2

Multi-Instance GPU (MIG) of NVIDIA GPUs

Introduction and History This post delves into the MIG feature of NVIDIA GPU devices. GPUs are identified as the primary parallel general purpose processors for tasks like rendering graphics and training deep learning models. If you need to learn more about how these processors work, read the following post: How do GPUs work? This post reviews all that is needed to understand how GPU executes code. The assumption is that you know the basics of…medium.com

Mig

5 min read

Multi-Instance GPU (MIG) of NVIDIA GPUs
Multi-Instance GPU (MIG) of NVIDIA GPUs
Mig

5 min read


Published in

MLearning.ai

·Sep 7, 2022

How do GPUs work?

This post reviews all that is needed to understand how GPUs execute code. The assumption is that you know the basics of computer architecture to grasp this concept fully. This post starts with Flynn’s taxonomy, then goes into the single instruction multiple data (SIMD) processor. Afterwards, x86 instruction extension for…

Multithreading

7 min read

How do GPUs work?
How do GPUs work?
Multithreading

7 min read

Ehsan Yousefzadeh-Asl-Miandoab

Ehsan Yousefzadeh-Asl-Miandoab

109 Followers

My Resume https://ehsanyousefzadehasl.github.io/ehsanyousefzadehasl/

Following
  • Darrin Atkins

    Darrin Atkins

  • Amy @GrabNGoInfo

    Amy @GrabNGoInfo

  • Dariusz Gross #DATAsculptor

    Dariusz Gross #DATAsculptor

  • Vishal Rajput

    Vishal Rajput

  • Achmad Fatoni

    Achmad Fatoni

See all (27)

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech

Teams