This shows you the differences between two versions of the page.
ewis:laboratoare:11 [2021/05/05 13:03] alexandru.predescu |
ewis:laboratoare:11 [2023/04/26 19:18] (current) alexandru.predescu [Resources] |
||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== Lab 11. Project / Case Study ====== | + | ===== Lab 11. Project / Case Study ===== |
This week is for working on the laboratory project and preparing the final presentation. | This week is for working on the laboratory project and preparing the final presentation. | ||
- | ===== Requirements ===== | + | ==== Requirements ==== |
Select an algorithm and a dataset to make a presentation / case study of 10-25 slides. | Select an algorithm and a dataset to make a presentation / case study of 10-25 slides. | ||
Line 11: | Line 11: | ||
<note important> | <note important> | ||
An algorithm can be selected by at most 3 students. | An algorithm can be selected by at most 3 students. | ||
- | If you choose an algorithm that was used in the Laboratory, you will have to use an additional dataset and compare the results. | + | * If you choose an algorithm that was used in the Laboratory, you will have to use another dataset. |
+ | * If you choose another algorithm, you can also use a dataset from the Laboratory. | ||
</note> | </note> | ||
The structure of the presentation should be as follows: | The structure of the presentation should be as follows: | ||
- | ==== Introduction ==== | + | === Introduction === |
- | Describe the context of your case study. Introduce the algorithm and possible applications | + | Describe the context of your case study. Introduce the algorithm and possible applications in real world scenarios. |
- | ==== Methodology ==== | + | === Methodology === |
- | Describe the algorithm in more detail (text, pseudocode, flowcharts) and the way it can be used on a real data set. | + | Describe the algorithm in more detail (text, pseudocode, flowcharts), with use cases and methods for working with real data sets. |
- | ==== Case Study ==== | + | === Case Study === |
Describe the selected data set(s) and the implementation of the algorithm in Python. | Describe the selected data set(s) and the implementation of the algorithm in Python. | ||
Present the results (text and visual output) and your observations. | Present the results (text and visual output) and your observations. | ||
- | ==== Conclusion ==== | + | === Conclusion === |
Describe the outcome of the case study and applications in real world scenarios in broader terms. | Describe the outcome of the case study and applications in real world scenarios in broader terms. | ||
- | ===== Resources ===== | + | === Resources === |
- | ==== Algorithms and Libraries ==== | + | Present the resources used for the case study: algorithm, data sets and other references |
+ | |||
+ | ==== Resources ==== | ||
+ | |||
+ | === Algorithms and Libraries === | ||
You can refer to the Laboratories or use any other implementation in Python. | You can refer to the Laboratories or use any other implementation in Python. | ||
Line 41: | Line 46: | ||
Useful libraries for data science in Python include: | Useful libraries for data science in Python include: | ||
- | === [Machine Learning] === | + | **Machine Learning** |
* [[https://scikit-learn.org/stable/|Scikit-learn]] | * [[https://scikit-learn.org/stable/|Scikit-learn]] | ||
+ | |||
+ | **Deep Learning** | ||
* [[https://www.tensorflow.org/learn|TensorFlow]] | * [[https://www.tensorflow.org/learn|TensorFlow]] | ||
* [[https://keras.io/|Keras]] | * [[https://keras.io/|Keras]] | ||
- | === [Data Processing] === | + | **Data Processing** |
* [[https://pandas.pydata.org/|Pandas]] | * [[https://pandas.pydata.org/|Pandas]] | ||
* [[https://numpy.org/|NumPy]] | * [[https://numpy.org/|NumPy]] | ||
* [[https://www.scipy.org/|SciPy]] | * [[https://www.scipy.org/|SciPy]] | ||
- | === [Visualizations] === | + | **Visualizations** |
* [[https://matplotlib.org/|matplotlib]] | * [[https://matplotlib.org/|matplotlib]] | ||
* [[https://seaborn.pydata.org/|Seaborn]] | * [[https://seaborn.pydata.org/|Seaborn]] | ||
Line 61: | Line 68: | ||
* [[https://machinelearningmastery.com/clustering-algorithms-with-python/|Machine Learning Mastery]] | * [[https://machinelearningmastery.com/clustering-algorithms-with-python/|Machine Learning Mastery]] | ||
- | ==== Datasets ==== | + | === Datasets === |
You can use any open dataset. Some common sources include: | You can use any open dataset. Some common sources include: | ||
- | === [Dataset aggregators for Machine Learning projects] === | + | **Dataset aggregators for Machine Learning projects** |
* [[https://www.kaggle.com/datasets|Kaggle]] | * [[https://www.kaggle.com/datasets|Kaggle]] | ||
* [[https://data-flair.training/blogs/machine-learning-datasets/|Data Flair]] | * [[https://data-flair.training/blogs/machine-learning-datasets/|Data Flair]] | ||
* [[https://www.openml.org/search?type=data|OpenML]] | * [[https://www.openml.org/search?type=data|OpenML]] | ||
+ | * [[https://archive.ics.uci.edu/ml/datasets.php|UCI Machine Learning Repository]] | ||
- | === [Open Data] === | + | **Open Datasets** |
* [[https://data.europa.eu/data/datasets?locale=data&minScoring=0|data.europa.eu]] | * [[https://data.europa.eu/data/datasets?locale=data&minScoring=0|data.europa.eu]] | ||
* [[https://data.worldbank.org/|World Bank Open Data]] | * [[https://data.worldbank.org/|World Bank Open Data]] |