Date Extractor Python Package
This project provides a Python utility package to extract specific components (year, month, day, and time) from ISO 8601 date strings. The input can either be in a string format or a pandas series. These functions can be applied individually or integrated with data analysis workflows in Pandas, simplifying date manipulation and analysis.
Setup Instructions
1. Use Conda Environment
Make sure you have conda installed. If you don’t have it, install it from here.
If needed, set up conda:
bash ${HOME}/Downloads/Miniforge3.sh -b -p "${HOME}/miniforge3"
source "${HOME}/miniforge3/etc/profile.d/conda.sh"
conda activate
conda init
Create and activate the conda environment:
conda env create -f environment.yml
conda activate date_extractor_env
2. Install Poetry
Poetry is the tool used to manage dependencies. You need to install Poetry globally.
Run the following command to install Poetry:
curl -sSL https://install.python-poetry.org | python3 -
3. Set Up the Project
Clone the repository:
git clone https://github.com/yourusername/date_extractor_mds.git
cd date_extractor_mds
Running Tests
1. Run Poetry Install
In the root of the folder, run in terminal to install dependencies:
poetry install
2. Run Tests
Install Dependencies
Run the follow commands sequentially to check that the tests pass, and to check test coverage:
poetry run pytest
poetry run pytest --cov=src/date_extractor_mds
poetry run pytest --cov-branch --cov=src/date_extractor_mds
Package Installation
To install the package, use the following command:
$ pip install date_extractor_mds
Usage
extract_year: Extracts the year as a four-digit integer from an ISO 8601 date string.
from date_extractor_mds import extract_year date_string = "2025-02-02T14:30:00" year = extract_year(date_string) print(year) # Output: 2025
extract_month: Retrieves the month as an integer (1-12) from the ISO 8601 date.
from date_extractor_mds import extract_month date_string = "2025-02-02T14:30:00" month = extract_month(date_string) print(month) # Output: 2
extract_day: Captures the day as an integer (1-31) from the ISO 8601 date.
from date_extractor_mds import extract_day date_string = "2025-02-02T14:30:00" day = extract_day(date_string) print(day) # Output: 2
extract_time: Returns the time component as a string in hh:mm:ss format.
from date_extractor_mds import extract_time date_string = "2025-02-02T14:30:00" time = extract_time(date_string) print(time) # Output: 14:30:00
Position in Python Ecosystem:
This package complements existing Python libraries like datetime and pandas by offering specialized, lightweight utilities focused solely on ISO 8601 string manipulation. While datetime provides similar functionality, this package simplifies usage by bypassing full date parsing for basic extraction tasks, increasing performance in large-scale data analysis.
Contributing
Interested in contributing? Check out the Contributing Guidelines. Please note that this project is released with a Code of Conduct. By contributing to this project, you agree to abide by its terms.
Contributors
Rashid Mammadov
Derek Rodgers
Yibin Long
Fazeeia Mohammed
License
The Date Extractor Python Package was created by Rashid Mammadov, Derek Rodgers, Yibin Long, and Fazeeia Mohammed. It is licensed under the terms of the MIT license, linked here.
Credits
date_extractor_mds was created with cookiecutter and the py-pkgs-cookiecutter template.