This repository contains example data and scripts showcasing data collection and processing for projects of the Software Development, Maintenance and Operations.
The three projects are:
- Project 1: Developer de-duplication
- Project 2: Temporal centrality and monitoring metrics
- Project 3: Temporal state detection and anomaly detection
project1devs/: Directory with data for Project 1devs.csv: List of developers mined from eShopOnContainersProjectdevs_similarity.csv: Similarity tests for each pair of developersdevs_similarity_t=0.7.csv: Similarity tests for each pair of developers with similarity threshold 0.7
project1developers.py: Script demonstrating mining developer information and Bird heuristic to determine duplicate developersproject2&3traces/: Directory with data for Projects 2 & 3train-ticket-traces.csv: CSV file with traces for train-ticket system from an open datasetproject2edgeflow.csv: CSV file providing a list of real-time service calls for Project 2project3snapshots.csv: CSV file providing snapshot networks for service calls grouped by intervals for Project 3project2_katz_exponential.csv: CSV file with Temporal Katz Centrality with exponential decayproject2_katz_constant.csv: CSV file with Temporal Katz Centrality with constant decayproject2_katz_truncated.csv: CSV file with Temporal Katz Centrality with truncated exponential decay
project2&3data.py: Script processing raw traces into formats required by Projects 2&3project2centrality.py: Script demonstrating temporal katz centrality on temporal edge flow network of microservice calls for Project 2project3anomaly.py: Script demonstrating temporal state detection on temporal networks of microservice calls for Project 3requirements.txt: List of used libraries with specified versions
The scripts were developed and tested on a Mac (UNIX) environment with Python 3.10. There should be no compatibility issues with running the scripts on Windows.
The versions of imported libraries are provided in requirements.txt.
It is recommended to create a Python virtual environment and install the exact versions there.