Context Navigation

Inference

Timestamp:: Aug 21, 2023, 8:59:40 PM (23 months ago)
Author:: LakshyaG42
Comment:: —

Legend:

: Unmodified
: Added
: Removed
: Modified

Other/Summer/2023/Inference

-              v101
+              v102
 }}}
-== Overview
-As machine learning models become increasingly advanced and complex, running these models on less powerful devices is becoming increasingly difficult, especially when accounting for latency. However, it is also not efficient to run everything using only the cloud as it creates too much traffic. A viable solution to this problem is edge computing, where we use the edge (the networks in between user devices and the cloud) for computation.
-. Trained a small and large Neural Networks (DENSENET & Mobilenet V2) on the CIFAR10 dataset
-. Performed PCA and SVM on NNs to familiarize ourselves with PyTorch
-. Loaded the MNIST database (image) onto an orbit node
-. Connected 2 nodes with client-server architecture and extracted data for time and accuracy measurements
-. Compared performances of both neural networks on the CIFAR10 dataset
-  * Established connection between two nodes
-  * Communicated test data between nodes to compare accuracy and delay between our NN models
-. Worked with professors/mentors and read papers to understand the concepts of early exit, split computing, accuracy/latency tradeoff, and distributed DNNs over the edge cloud
-. Split the NN ResNet 18 using split computing onto two different devices and ran an inference across a network
-. Used Network Time Protocol (NTP) and sent data in "packages" (chunks) to collect latency and delay data
-. Explored different research questions with the data collected: __________
-. Limited CPU power in terminal to imitate mobile devices
-. Implemented different threshold values based on confidence for sending the data to the edge and server for inference
-  * Generated graphs for threshold vs latency, accuracy vs latency, etc.
-. Retrained neural network to achieve 88% accuracy and collected new graphs
-. Introduced a delay in the inference as well as data transfer to simulate a queue
 == Networking Setup for Our Experiment Setup
 …
 }}}
+==
+As machine learning models become increasingly advanced and complex, running these models on less powerful devices is becoming increasingly difficult, especially when accounting for latency. However, it is also not efficient to run everything using only the cloud as it creates too much traffic. A viable solution to this problem is edge computing, where we use the edge (the networks in between user devices and the cloud) for computation.
+. Trained a small and large Neural Networks (DENSENET & Mobilenet V2) on the CIFAR10 dataset
+. Performed PCA and SVM on NNs to familiarize ourselves with PyTorch
+. Loaded the MNIST database (image) onto an orbit node
+. Connected 2 nodes with client-server architecture and extracted data for time and accuracy measurements
+. Compared performances of both neural networks on the CIFAR10 dataset
+  * Established connection between two nodes
+  * Communicated test data between nodes to compare accuracy and delay between our NN models
+. Worked with professors/mentors and read papers to understand the concepts of early exit, split computing, accuracy/latency tradeoff, and distributed DNNs over the edge cloud
+. Split the NN ResNet 18 using split computing onto two different devices and ran an inference across a network
+. Used Network Time Protocol (NTP) and sent data in "packages" (chunks) to collect latency and delay data
+. Explored different research questions with the data collected: __________
+. Limited CPU power in terminal to imitate mobile devices
+. Implemented different threshold values based on confidence for sending the data to the edge and server for inference
+  * Generated graphs for threshold vs latency, accuracy vs latency, etc.
+. Retrained neural network to achieve 88% accuracy and collected new graphs
+. Introduced a delay in the inference as well as data transfer to simulate a queue
 == References