Differences

This shows you the differences between two versions of the page.

Link to this comparison view

ep:labs:09 [2021/12/03 21:17]
andrei.mirciu [Introduction]
ep:labs:09 [2025/05/15 21:17] (current)
andrei.mirciu [Resources]
Line 1: Line 1:
-====== Lab 09 - Memory, CPU and Network Monitoring (Windows) ​======+====== Lab 09 - Machine Learning ​======
  
 ===== Objectives ===== ===== Objectives =====
  
-  * **Find the root cause if the system runs out of memory** ​with Task Manager, Windows Performance Recorder, Windows Performance Analyzer, Visual Studio, VMMap. +  * Understand basic concepts of machine learning 
-  * **Monitor ​the CPU usage** with Task Manager, Windows Performance Recorder, Windows Performance Analyzer. +  ​Remember examples ​of real-world problems that can be solved ​with machine learning 
-  * **Check ​the amount ​of network traffic generated by processes** with Task Manager, Windows Performance Recorder, Microsoft Network Monitoring, Wireshark.+  * Learn the most common performance evaluation metrics for machine learning models 
 +  * Analyse ​the behaviour ​of typical machine learning algorithms using the most popular techniques 
 +  ​Be able to compare multiple machine learning models
  
-<note important>​+===== Resources =====
  
-You can download the **Windows 10 VM** via [[https://​ctipub-my.sharepoint.com/:​u:/​g/​personal/​radu_mantu_upb_ro/​EXSrHQMCkWBEpGYseFEmnnABCA1hyb1oGWMUhnnHx8LIdQ?​e=I0pxHg | OneDrive]].+In this lab, we will study basic performance evaluation techniques used in machine learning, covering elementary concepts such as classification,​ regression, data fitting, clustering and much more
  
-If you need to use VirtualBoxyou can use this //.ovf// version to import the VM (just on OneDrive) +You will work in an environment that is easy to use, and provides a couple of tools like manipulating data and visualizing results. We will use a **Jupyter Notebook** hosted ​on **Google Colab**, which comes with a variety of useful tools already installed.
-[[https://​ctipub-my.sharepoint.com/:​u:/​g/​personal/​cezar_craciunoiu_upb_ro/​EZYR_YFyHx5GiHf5yBNuiyYB-zXhIaTNzJ8o8Ri2M8l5Mw?​e=9qxrde | OneDrive]].+
  
-There is also the option to download as a torrent {{:ep:​labs:​ep_win10_vm.7z.torrent.txt}}. +The exercises will be solved in Python, using popular libraries that are usually integrated in machine learning projects:
-DokuWiki is not configured to accept //​.torrent//​ files so remove the //.txt// extension. +
-After that, you know what to do...+
  
-Alternatively,​ you can install the following on your own Windows machine: +  ​* [[https://scikit-learn.org/stable/documentation.html|Scikit-Learn]]: fast model development,​ performance metrics, pipelines, dataset splitting 
-  * **[[https://go.microsoft.com/fwlink/?​linkid=2120254 ​ADK]]** - make sure to check //**Windows Performance Analyser**// and //**Windows Performance Recorder**//​. +  ​[[https://pandas.pydata.org/pandas-docs/stable/|Pandas]]: data frames, csv parser, data analysis 
-  ​* **[[https://visualstudio.microsoft.com/downloads/ | Visual Studio Community Edition]]** - select //C++ development//​. ​ +  * [[https://numpy.org/doc/|NumPy]]: scientific computation 
-  ​* **[[https://docs.microsoft.com/en-us/sysinternals/​downloads/​sysinternals-suite ​Sysinternals suite]]**+  * [[https://matplotlib.org/3.1.1/users/index.html|Matplotlib]]: data plotting
  
-</​note>​+As datasets, we will use some public corpora provided by the Kaggle community:
  
-<note important>​ +  * [[https://​www.kaggle.com/​uciml/​pima-indians-diabetes-database/​data|Classification Dataset]] 
- If Visual Studio prompts you with an "​Expired"​ message, you will have to log in with your (university) account+  * [[https://​www.kaggle.com/​zaraavagyan/weathercsv|Regression dataset]]
-</note>+
  
-===== Contents ===== +You can also check out these cheet sheets for fast reference to the most common libraries:
-{{page>:ep:​labs:​09:​meta:​nav&​nofooter&​noeditbutton}}+
  
-===== Introduction =====+**Cheat sheets:**
  
-The Windows operating system contains plenty of built-in tools to analyze its resource usageThe most famous one is probably the Windows Task Manager, as it highlights resource usage of individual processes and gives admins and users the option to kill any misbehaving ones.+  * [[https://​perso.limsi.fr/​pointal/​_media/​python:​cours:​mementopython3-english.pdf)|python]] 
 +  * [[https://​s3.amazonaws.com/​assets.datacamp.com/​blog_assets/​Numpy_Python_Cheat_Sheet.pdf|numpy]] 
 +  * [[https://​s3.amazonaws.com/​assets.datacamp.com/​blog_assets/​Python_Matplotlib_Cheat_Sheet.pdf|matplotlib]] 
 +  * [[https://​s3.amazonaws.com/​assets.datacamp.com/​blog_assets/​Scikit_Learn_Cheat_Sheet_Python.pdf|sklearn]] 
 +  * [[https://​github.com/​pandas-dev/​pandas/​blob/​master/​doc/​cheatsheet/​Pandas_Cheat_Sheet.pdf|pandas]]
  
-The Performance Monitor and Resource Monitor are two additional tools that admins and experienced Windows users may use to analyze performance or any resources related issues on Windows machines.+<​solution -hidden>​ 
 +[[https://​colab.research.google.com/​drive/​1aeV9PGF_uxBA3FoKNMEzsiXMxjVSCcm4?​usp=sharing|Solution]] 
 +</​solution>​
  
 ===== Tasks ===== ===== Tasks =====
  
-<note warning>​ +==== Google Colab Notebook ====
-The password for {{:​ep:​laboratoare:​logs2.zip | log2.zip}} and {{:​ep:​labs:​build.zip | build.zip}} is: //​parola//​ +
-</​note>​+
  
-{{namespace>​:ep:​labs:​09:​contents:​tasks&​nofooter&​noeditbutton}}+For this lab, we will use Google Colab for exploring performance evaluation in machine learning. Please solve your tasks [[https://​github.com/​vladastefanescu/​machine-learning-introduction/​blob/​main/​Machine_Learning_Introduction.ipynb|here]] by clicking "​**Open in Colaboratory**"​.
  
-== 01[30p] RAM Monitoring ==+You can then export this python notebook as a PDF (**File -> Print**) and upload it to **Moodle**.
  
-The memory usage by processes is dynamic, thus it is possible to have memory allocation spikes. **Task Manager** shows the amount of memory allocated to a process. To see this information,​ check the **Peak Working Set** value in the Details tab. It can be noticed that sometimes **Peak Working Set** can be significantly larger than **Working Set**. ​+===== Feedback =====
  
-<​spoiler>​ +Please take a minute to fill in the **[[https://forms.gle/NpSRnoEh9NLYowFr5 | feedback form]]** for this lab.
-{{ :ep:​laboratoare:​ep5_taskmanager-ram.jpg?400 |}}  +
-</spoiler>  ​+
  
-== [15p] Task A - Identify the problem == 
  
-  * What can you do if the system runs out of memory, but you do not know what causes this? 
  
-Imagine a situation when a system encounters issues if some conditions are met, and these conditions can be reproduced. To find out the source of this problem, start **Windows Performance Recorder** configured as shown below.  ​ 
  
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_wpa-ram-rec.jpg?​400 |}}  ​ 
-</​spoiler> ​ 
  
-Run in parallel a program that allocates 1MB of memory every 100 milliseconds for a while and then stops. After the program stops running, save the capture and open it in **Windows Performance Analyzer**. You should get the following. 
  
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_wpa-ram-select.jpg?​400 |}}    
-</​spoiler>  ​ 
  
-Click **Add all Memory graphs to Analysis View** and scroll down to the **Virtual Memory Snapshot** graph which shows the memory usage for all processes. 
  
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_wpa-ram-graphs.jpg?​400 |}}  ​ 
-</​spoiler>  ​ 
  
-After installing, it requires including the //vld.h// file. When writing the code, the following functions need to be overwritten:​ malloc, free, new, and delete. This allows each memory allocation and deallocation to be tracked. All the detected leakages (having an allocation that is not followed by a deallocation) will be saved in a log file that can be viewed after the program stops running. In the bottom part of the screenshot shown below, it can be noticed where the allocation took place and that it is not followed by a deallocation. 
  
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_visualstudio-vld.jpg?​400 |}}  ​ 
-</​spoiler>  ​ 
  
-However, it might be the case that this is not a memory leak, and somewhere at the end of the program the memory gets unallocated. If this is the case, how can you determine which part of the program is responsible for generating the spike? This requires using another tool from SysInternals,​ [[https://​technet.microsoft.com/​en-us/​sysinternals/​vmmap.aspx | VMMap]]. With this tool you can view a process'​s memory allocations and usage. Use it to run the program that allocates 1 MB of memory every 100 milliseconds. 
  
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_vmmap-configure.jpg?​400 |}}    
-</​spoiler>  ​ 
-  
-Prior to starting the tool, go to Options, Configure Symbols and set the paths to the program'​s,​ to the Microsoft Symbol Server, and to the program'​s source files. Start the tool, select **Launch and trace a new process**, select the process, select the directory where it will run, and let it run. You will see something similar to the screenshot below. To view the latest memory allocations,​ you need to double-click **Heap** in the upper-part of the screenshot, and hit F5 (refresh) from time to time. In the bottom part of the screenshot, you can view the memory allocations. If you click one and press **Heap Allocations** you can see the stack where the allocation occurred. By pressing the "​Source"​ button, you can view the actual code for the allocation. ​ 
  
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_vmmap-analyze.jpg?​400 |}}    
-</​spoiler> ​ 
- 
-== [15p] Task B - Conclusions == 
- 
-:!: :!: NON-DEMO TASK  
- 
-  * Provide screenshots with your output and add your explanation. 
- 
-<​hidden>​ 
-  * It can be noticed the steep line representing the increase of our process'​s memory usage. This leads to the conclusion that this process is the cause of the problem since all the other ones seem stable. 
-  * So far, it was identified the process that causes problems. If this process is not written by you, you can check who is launching it. If you don't need it, you can deactivate it. If you need it, check if there are any updates to fix it. If there aren't any updates to fix the problem, you can try reporting the problem to the producer and hope they will fix it. 
-</​hidden>​ 
- 
-== 02. [30p] CPU Monitoring == 
- 
-== Task Manager == 
-Monitoring the CPU usage presents similar issues to the ones encountered when monitoring the memory usage. Task Manager can help find out the current CPU usage for a process. ​ 
- 
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_taskmanager-cpu.jpg?​400 |}}  ​ 
-</​spoiler> ​ 
- 
-== Windows Performance Recorder == 
-Considering the same scenario as the one presented in the previous section: what if the CPU usage briefly spikes up and then goes back to normal, how can you catch this? Using Task Manager would mean having someone continuously watching what is happening to catch the moment when the spike occurs. Use Windows Performance Recorder, with the same settings as in the screenshot below. ​ 
- 
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_wpr-cpuusagestart.jpg?​400 |}}  ​ 
-</​spoiler>  ​ 
- 
-== Windows Performance Analyzer ==  
-After clicking start, run CPUUsage.exe (part of the resources provided at the end of the tutorial), which generates CPU usage. Save and open the catch. 
- 
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_wpr-cpuusageselect.jpg?​400 |}}  ​ 
-</​spoiler>​ 
- 
-Right-click on to the Computation area and click "Add All Calculation graphs to Analysis View". 
- 
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_wpr-cpuusagegraphs.jpg?​400 |}}  ​ 
-</​spoiler>  ​ 
- 
-The graph shows the impact of CPUUsage.exe,​ which inflicts a 12% CPU usage. Part of the problem is solved, it was determined who is generating the CPU usage. Now, to further debug the situation, as in the previous case of the memory, if this process was not written by you, check if it is useful, and if not, make sure to stop it. If it is useful, but it's not yours, you can try to find an update to fix the problem, or report the problem to the producer. If the program is written by you (this course - Performance Evaluation - targets the processes written by us), then it is important to determine what causes this problem. Unfortunately,​ unlike in the case of monitoring the memory usage, there is no tool that shows the stack with the problem, so you need to create one. Open EvenimenteProcMon,​ which has the purpose of integrating your messages with ProcessMonitor so they can be viewed as the process unfolds. It is necessary to understand any code, not perfectly, but at least to get the big picture of what is going on. 
- 
-A ProcessMonitor class with 5 functions was created: 
- 
-  * **OpenProcMon** opens up a handle for the ProcessMonitor'​s message interface. 
-  * **CloseProcMon** closes this handle. 
-  * **ProcMonLog** writes the message that is passed as a parameter to the ProcessMonitor interface. 
-  * **MyProcMon** is the class constructor. It is called when a MyProcMon object is declared. 
-  * **~ MyProcMon** is the class destructor. It is called to destroy the MyProcMon object. 
- 
-The code below highlights that it was declared globally: 
- 
-<​code>​ 
-MyProcMon __procMon; 
-</​code>​ 
- 
-This means that at the start of the process, before executing the main function, when the global variables are initialized,​ our class instance will be constructed along with the implicit handle for the ProcessMonitor message interface. The handle is closed when the object is destroyed, after the program'​s execution ends. 
- 
-Another class was declared, ProcMonLogFunc,​ with the purpose of highlighting when entering and leaving a function. This led to defining the following macro, which declares a ProcMonLogFunc object and passes it the name of the current function as a parameter. ​ 
- 
-<​code>​ 
-#define DBGTRACE_FN_ () ProcMonLogFunc __my_log __ (__ FUNCTIONW__) 
-</​code>​ 
- 
-| Start ProcessMonitor and change the filter to ProcessName contains EvenimenteProcMon. Select the profiling button as shown below. | {{ :​ep:​laboratoare:​ep5_butonprofiling.png?​400 |}} || 
- 
-| Running the program, generates a Process Monitor capture like this one. | {{ :​ep:​laboratoare:​ep5_procmon-profiling.jpg?​400 |}} || 
- 
-Notice messages such as Output: ==> Func1 and Output: <== Func1, with the associated times for these events in the left-hand side of the screenshot, in the Time of Day column. The difference between the times (4:​42:​07.1848883 and 4:​42:​07.1848955) is 72, and since the times after the comma are expressed in hundreds of nanoseconds,​ this means that func1 took 7.2 microseconds. 
- 
-As it is inefficient to calculate by hand the times for each function, save the output in csv format (File -> Save and choose the "​Comma-Separated Values"​ option). The generated file will look like this: 
- 
-<​code>​ 
-"​4:​42:​07.1846936 PM","​EvenimenteProcMon.exe","​6352","​Debug Output Profiling","","","​Output:​ ==>​main"​ 
-"​4:​42:​07.1848812 PM","​EvenimenteProcMon.exe","​6352","​Debug Output Profiling","","","​Output:​ Acesta e logul meu 1" 
-"​4:​42:​07.1848883 PM","​EvenimenteProcMon.exe","​6352","​Debug Output Profiling","","","​Output:​ ==>​Func1"​ 
-"​4:​42:​07.1848955 PM","​EvenimenteProcMon.exe","​6352","​Debug Output Profiling","","","​Output:​ <​==Func1"​ 
-"​4:​42:​07.1848990 PM","​EvenimenteProcMon.exe","​6352","​Debug Output Profiling","","","​Output:​ ==>​Func2"​ 
-"​4:​42:​07.1849038 PM","​EvenimenteProcMon.exe","​6352","​Debug Output Profiling","","","​Output:​ <​==Func2"​ 
-"​4:​42:​07.1849069 PM","​EvenimenteProcMon.exe","​6352","​Debug Output Profiling","","","​Output:​ ==>​Func3"​ 
-"​4:​42:​07.1849105 PM","​EvenimenteProcMon.exe","​6352","​Debug Output Profiling","","","​Output:​ <​==Func3"​ 
-"​4:​42:​07.1849148 PM","​EvenimenteProcMon.exe","​6352","​Debug Output Profiling","","","​Output:​ Acesta e logul meu 2" 
-"​4:​42:​07.1849184 PM","​EvenimenteProcMon.exe","​6352","​Debug Output Profiling","","","​Output:​ <​==main"​ 
-</​code>​ 
- 
-Making a parser in Python would make it easy to notice in which of the functions was spent the most time. If you only want to take into account the CPU usage, you need to have logging messages before and after every I/O operation, in order to not count in their time. 
- 
-:!: :!: NON-DEMO TASK  
- 
-Integrate CPUUsage with ProcessMonitor and find out the total time spent in every function. 
- 
-== 03. [40p] Network Monitoring == 
- 
-== Task A [20p] - Go through tutorial == 
- 
-== Task Manager ==  
-The amount of network traffic generated by a process can be seen using Task Mananger. 
- 
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_taskmanagernetworking.jpg?​400 |}}    
-</​spoiler>  ​ 
- 
-== Windows Performance Recorder ==  
-The resources for this tutorial include Winhttp.exe,​ a program that downloads putty.exe. The above screenshot displays its network activity. However, if the process generating the network activity is unknown, you can use Windows Performance Recorder with the following settings. Save and open the capture to view it. The statistics offered by Windows Performance Analyzer are for the total use of the network, rather than per process statistics. 
- 
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_wpr-cpustart.jpg?​400 |}}    
-</​spoiler>  ​ 
- 
-== Microsoft Network Monitoring ==  
-For this reason, we are calling upon another tool developed by Microsoft. Install it, start it using "Run as administrator",​ and select the network interface through which the traffic is expected to pass (cable, wifi, ...). You should get a capture such as this one:  
- 
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_netmon.jpg?​400 |}}  ​ 
-</​spoiler>  ​ 
- 
-==  Wireshark ==  
-As in the case of the CPU, inspecting the events taking place on the network involves some amount of work for the analyst. However, this being a simple case, you can just expand the view on the traffic generated by Winhttp.exe,​ and notice the request for //​putty.exe//​. If it is not clear why some requests are there or why they last so long, you can integrate the application that you wish to investigate with ProcessMonitor. This way you can insert logging elements to find out what request are made and how long they take. The part with timing the requests and traffic can be determined straight from Network Monitor by considering the times of the packets. For displaying all traffic on a http connection (it can also be https as long as you control the server, but this in not in the scope of this tutorial), you can use another tool, [[https://​www.wireshark.org/​download.html | Wireshark]]. Install Wireshark (**64bit!!!**) accepting the default settings. Start it and select the interface that you want to listen to. 
- 
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_wireshark-start.jpg?​400 |}}  ​ 
-</​spoiler> ​ 
- 
-Click the //Start// button and run Winhttp.exe. After Winhttp.exe stops, click the Stop button in Wireshark. ​ 
- 
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_wireshark-captura.jpg?​400 |}}  
-</​spoiler>  ​ 
- 
-This way you have obtained a traffic capture while winhttp.exe was running. Viewing the code for winhttp.exe,​ it can be noticed that it makes a request to www.sociouman-usamvb.ro. Use the ping command to get the IP address for this url. 
- 
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_findip.jpg?​400 |}}  
-</​spoiler> ​ 
- 
-Switching back to Wireshark, add a filter for ip.addr = 86.106.30.115 (make sure to use the IP address identified using ping command). Right click Get documents and choose Follow TCP Stream. 
- 
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_wireshark-captura2.jpg?​400 |}} 
-</​spoiler>  ​ 
- 
-In the bottom part of the Wireshark window, at the "//​Show and save data as//" option choose "​Raw"​. Save the capture (using the "Save as" button) as "//​my.pdf//"​. ​ 
- 
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_wireshark-rawdata.jpg?​400 |}} 
-</​spoiler>  ​ 
- 
-Use Notepad++ to open the my.pdf file and remove the headers as shown in the screenshot below. 
- 
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_wireshark-extractdata.jpg?​400 |}}  
-</​spoiler> ​ 
- 
-Save it, close Notepad++ and double-click on the newly saved file (my.pdf). ​ 
- 
-<​spoiler>​ 
-{{ :​ep:​laboratoare:​ep5_wireshark-extractdata-result.jpg?​400 |}} 
-</​spoiler>  ​ 
- 
-== Task B [20p] - Conclusions == 
- 
-:!: :!: NON-DEMO TASK  
- 
-  * Follow the steps above and provide screenshots of the pdf file. 
- 
-<​hidden>​ 
-  * It was possible to obtain a valid pdf file. That means that you were able to extract the conversation data from the packet exchange. This was possible due to dealing with a http communication. Otherwise, it would have been much more complicated if https was used for the communication and you would not control the server. 
-</​hidden>​ 
- 
- 
-=== 04. [10p] Feedback === 
- 
-:!: :!: NON-DEMO TASK  
- 
-  * Please take a minute to fill in the **[[https://​forms.gle/​KHMVUhNfCPoR71Ew7 | feedback form]]** for this lab. 
- 
-{{ :​ep:​laboratoare:​ep4_logo_bitd2.png?​300 |}} 
- 
-{{namespace>:​ep:​labs:​09:​contents:​tasks&​nofooter&​noeditbutton}} 
  
ep/labs/09.1638559044.txt.gz · Last modified: 2021/12/03 21:17 by andrei.mirciu
CC Attribution-Share Alike 3.0 Unported
www.chimeric.de Valid CSS Driven by DokuWiki do yourself a favour and use a real browser - get firefox!! Recent changes RSS feed Valid XHTML 1.0