# Differences

This shows you the differences between two versions of the page.

Link to this comparison view

ep:labs:05 [2019/10/27 17:09]
ep:labs:05 [2020/11/10 16:44] (current)
ioan_adrian.cosma [Python Scientific Computing Resources]
Line 1: Line 1:
-====== Lab 05 - Plotting ======+====== Lab 05 - Plotting ​(Numpy & Matplotlib) ​======

===== Objectives ===== ===== Objectives =====

-  * Offer an introduction to Gnuplot +  * Offer an introduction to Numpy & matplotlib
-  * Get you familiarised with basic plots in Gnuplot+  * Get you familiarised with the numpy API
+  * Understand ​basic plotting with matplotlib

-===== Contents =====

-{{page>:​ep:​labs:​05:​meta:​nav&​nofooter&​noeditbutton}}

-===== Gnuplot Introduction ​=====+===== Python Scientific Computing Resources ​=====

-Gnuplot is freecommand-driveninteractive,​ function ​and data plotting program. It can be downloaded at https://​sourceforge.net/​projects/​gnuplot/​. The official Gnuplot documentation can be found at http://​gnuplot.sourceforge.net/​documentation.html.+In this lab, we will study new library in python that offers fastmemory efficient manipulation of vectorsmatrices ​and tensors**numpy**We will also study basic plotting of data using the most popular data visualization libraries in the python ecosystem**matplotlib**

-It was originally created ​to allow scientists and students to visualize mathematical functions ​and data interactively, but has grown to support many non-interactive uses such as web scripting. It is also used as a plotting engine by third-party applications such as Octave.+For scientific computing we need an environment that is easy to use, and provides a couple of tools like manipulating ​data and visualizing results.
+Python is very easy to use, but the downside ​is that it's not fast at numerical computing. Luckily, we have very eficient libraries for all our use-cases.

-The command language of Gnuplot is case sensitive, i.e. commands and function names written in lowercase are not the same as those written in capitals. All command names may be abbreviated as long as the abbreviation is not ambiguous. Any number of commands may appear on a line, separated by semicolons (;). Strings may be set off by either single or double quotes, although there are some subtle differences. See syntax (p. 44) and quotes (p. 44) for more details in the Gnuplot documentation (http://​gnuplot.sourceforge.net/​docs_5.0/​gnuplot.pdf).+**Core computing libraries**

-Commands may extend over several input lines by ending each line but the last with a backslash (\). The backslash must be the last character on each line. The effect is as if the backslash ​and newline were not there. That is, no white space is implied, nor is a comment terminated. ​+  * numpy and scipy: scientific computing
+  * matplotlib: plotting library

-For built-in help on any topic, type **help** followed by the name of the topic or **help ?** to get a menu of available topics.+**Machine Learning**

+  * sklearn: machine learning toolkit
+  * tensorflow: deep learning framework developed by google
+  * keras: deep learning framework on top of tensorflow for easier implementation
+  * pytorch: deep learning framework developed by facebook

-===== Tutorial =====

+**Statistics and data analysis**

-{{namespace>​:ep:labs:​05:​contents:​tutorial&​nofooter&​noeditbutton}}+  * pandasvery popular data analysis library
+  * statsmodelsstatistics

+We also have advanced interactive environments:​

-===== Tasks =====+  * IPython: advanced python console
+  * Jupyter: notebooks in the browser

-{{namespace>:​ep:​labs:​04:​contents:​tasks&​nofooter&​noeditbutton}}+There are many more scientific libraries available.

-==== 05. [20p] Gnuplot graphs ====

-Using the Gnuplot documentation,​ implement a script that plots four graphs. Use the data from {{:​ep:​laboratoare:​data4.txt|}} as follows: the first graph should plot columns 1 and 2, the second columns 1 and 3, the third one columns 1 and 4, and the fourth one should be a 3D graph plotting columns 1, 2 and 3. +Check out these cheetsheets ​for fast reference ​to the common libraries:
-  * Use different colours ​for the data in each graph. +
-  * Remove the keys for each graph. +
-  * Give a title to each graph. +
-  * Give names to each of the axesX or Y (or Z for the 3D graph). In the case of the 3D graph: Make the numbers on the axes readable, and correct the position of the names of the axes if these are displayed over the axes numbers. +
-  * Make the script generate the .eps file containing your plot.+

-<note tip> +**Cheat sheets:**
- **Hint:** Consider the following commands: set key, xlabel, set title, set xtics, set terminal, set output, using. +
-</​note>​+

-<​solution ​-hidden> +  ​[[https://​perso.limsi.fr/​pointal/​_media/​python:​cours:​mementopython3-english.pdf)|python]]
-<​code>​+  - [[https://​s3.amazonaws.com/​assets.datacamp.com/​blog_assets/​Numpy_Python_Cheat_Sheet.pdf|numpy]]
+  - [[https://​s3.amazonaws.com/​assets.datacamp.com/​blog_assets/​Python_Matplotlib_Cheat_Sheet.pdf|matplotlib]]
+  - [[https://​s3.amazonaws.com/​assets.datacamp.com/​blog_assets/​Scikit_Learn_Cheat_Sheet_Python.pdf|sklearn]]
+  - [[https://​github.com/​pandas-dev/​pandas/​blob/​master/​doc/​cheatsheet/​Pandas_Cheat_Sheet.pdf|pandas]]

-reset #flush all variables+**Other:**

-set term postscript color eps enhanced +  - [[https://​stanford.edu/​~shervine/​teaching/​cs-229/​refresher-probabilities-statistics|Probabilities & Stats Refresher]]
-set output '​myplot.eps' +  - [[https://​stanford.edu/​~shervine/​teaching/​cs-229/​refresher-algebra-calculus|Algebra]]
-set size 1,1 #use default pallet size(100% of width and height) +
-set multiplot +
-unset key+

-#Graph 1
-set size 0.5,0.5 #half the width and height
-set origin 0,0.5 #x,y
-set title '​First'​
-plot '​data4.txt'​ using 1:2  w l lw 0.5

-#Graph 2 +<note>This lab is organized in a Jupyer Notebook hosted on Google ColabYou will find there some intuitions ​and applications ​for numpy and matplotlibCheck out the Tasks section below.</​note>​
-set size 0.5,0.5  +
-set origin 0.5,0.5 +
-set xlabel '​X'​ +
-set ylabel '​Y'​ +
-set title '​Second'​ +
-plot '​data4.txt'​ using 1:3 w l lw 0.5 +
- +
-#Graph 3 +
-set size 0.5,0.5 +
-set origin 0,0 +
-set xlabel '​X'​ +
-set ylabel '​Y'​ +
-set title '​Third'​ +
-plot '​data4.txt'​ using 1:4 w l lw 0.5 +
- +
-#Graph 4 +
-set size 0.5,0.5 +
-set origin 0.5,0 +
-set view 60,15 +
-set xtics 0,1000 +
-set ytics 1.8,0.3 +
-set ztics 1,0.2 +
-set xlabel '​X'​ offset 0,-1 +
-set ylabel '​Y'​ +
-set zlabel '​Z'​ +
-set title '​Fourth'​ +
-splot '​data4.txt'​ using 1:2:3 w l lw 0.5 lc rgb '​black'​ +
-set xtics auto +
-set ytics auto +
-set ztics auto +
- +
-unset multiplot +
- +
-</code> +
-</​solution> +
- +
-==== 06[30p] Gnuplot bar graphs ==== +
- +
-Use Gnuplot ​and the data from {{:​ep:​labs:​ep_lab5_autodata.txt|autoData.txt}} to generate separate **bar** graphs ​for the following:​ +
-  * The "//​MidPrice//"​ of all the "//​small//"​ cars. +
-  * The average fuel consumption (MPG - miles per gallon) for all the "//​large//"​ cars. +
-  * The "//​MaxPrice//"​ over the average fuel consumption for all "//​chevrolet//"​ and "//​ford//"​ cars. +
- +
-The graphs should be as complete as possible (title, axes names, etc.) +
- +
-<note tip> +
-**Hint:** Gnuplot conditional plotting. +
-</note+
- +
-<​solution -hidden>​ +
-<​code>​ +
- +
-with boxes - pentru bar chart (set style fill solid - sa fie pline) +
- +
-reset +
- +
-set size 1, 1 +
-set multiplot layout 2,2 rowsfirst +
- +
-set title 'Graph Small Cars'​ +
-set xlabel '​Car'​ +
-set ylabel '​MidPrice'​ +
-unset key +
-plot "​data5.txt"​ using (strcol(4) eq "​small"​ ? $6 : 0) w l lw 0.5 lc rgb '​red'​ + - + -set title 'Graph Fuel Consumption'​ + -set xlabel '​Car'​ + -set ylabel '​Fuel'​ + -unset key + -plot "​data5.txt"​ using (strcol(4) eq "​large"​ ? ($8 + $9)/2 : 0) w l lw 0.5 lc rgb '​blue'​ + - + -set title 'Graph Avg' + -set xlabel '​X'​ + -set ylabel '​Y'​ + -unset key + -plot "​data5.txt"​ using (strcol(2) eq '​chevrolet'​ || strcol(2) eq "​ford"​ ?$7 : 0) w l lw 0.5 lc rgb '​green',​ \ +
-     "​data5.txt"​ using (strcol(2) eq '​chevrolet'​ || strcol(2) eq "​ford"​ ? ($8 +$9)/2 : 0) w l lw 0.5 lc rgb '​orange'​ +
- +
-unset multiplot +
-</​code>​ +
-</​solution>+ 