Download Observation Images Documentation
This file serves to access and download the observation images from the provided URL of each observation.
This file saves the downloaded JPG images to the data/images/raw
directory for further processing.
The observations are sourced from the directory data/observations
.
Each image is saved in the following format
Attributes:
Name | Type | Description |
---|---|---|
root_path |
str
|
The absolute path to the root of the project directory. |
csv_path |
str
|
The complete path (absolute + relative) to the observation data in |
raw_image_path |
str
|
The save location of the downloaded images. The directory is |
length |
int
|
This value represents the quantity of observations in the dataset. Used in the calculation of the progress bar. |
count |
int
|
This value represents the total images downloaded. Used in the calculation of the progress bar. |
create_dataset(observations)
This method created the DataFrame of the dataset and pre-processes it for image extraction.]
Parameters:
Name | Type | Description | Default |
---|---|---|---|
observations |
str
|
The name of the CSV file to convert into a DataFrame and pre-process for image extraction. |
required |
Returns:
Type | Description |
---|---|
DataFrame
|
The pre-processed DataFrame containing observations ready for image extraction. |
Source code in raw_data_access.py
29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 |
|
image_download(x)
This method extracts the URL from an observation, and downloads the image.
This method is used in conjunction with the DataFrame.apply method within a lambda expression.
This method saves the image in the following format: images/raw
save path.
Each image downloaded increases the global count variable to update the status bar.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
x |
row
|
The row of the dataframe. |
required |
Source code in raw_data_access.py
49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 |
|
status_bar_update()
This method updates the visual status bar to represent the status of the image download.
Source code in raw_data_access.py
74 75 76 77 78 79 80 81 82 83 84 |
|