GPC Pre-Installed Packages¶
The General Purpose Container (GPC) is built with several tools pre-installed. You can reference any of these apps in your scripts and/or specify other apt-get and Python packages to be installed at order runtime.
Pre-installed apt-get tools¶
The following apt-get packages come pre-installed in DataKitchen's GPC:
| Apt-get tools | ||
|---|---|---|
| build-essential curl dialog freetds-bin freetds-dev git gpgv |
jq libncurses5-dev libpq-dev libxml2-dev libxslt1-dev nano net-tools python-distribute |
python3 python3-dev python3-pip rsync tar tzdata wget |
For the GPC container specific to Python3.9, the following packages are also installed:
python3.9python3.9-distutilspython3.9-devsoftware-properties-common
Pre-installed Python packages¶
The following Python packages come pre-installed in the GPC. The package name and current version are listed.
| Python packages | ||
|---|---|---|
| awscli>=1.18.137 azure-cli>=2.11.1 beautifulsoup4>=4.12.2 boto>=2.49.0 boto3>=1.28.4 Cython>=3.0.2 cryptography>=3.4.5 DKUtils>=2.11.6 google-api-python-client>=2.97.0 google-cloud>=0.34.0 |
google-cloud-bigquery>=3.11.4 google-cloud-storag>=2.10.0 jupyter>=1.0.0 oauth2client>=4.1.3 openpyxl>=3.1.2 matplotlib>=3.7.2 numpy>=1.25.2 pandas>=2.1.0 paramiko>=3.3.1 psycopg2>=2.9.7 |
pyyaml>=6.0.1 requests>=2.31.0 scikit-learn>=1.3.0 scipy>=1.11.2 setuptools>=50.3.0 simple-salesforce>=1.12.4 sqalchemy==1.4.46 traitlets<=5.9.0 tableauserverclient>=0.25 xlrd>=2.0.1 |
DKUtils¶
The DKUtils Python package includes additional libraries pre-installed for use in a GPC container node.
| Python packages | ||
|---|---|---|
| dataclasses==0.6 events_ingestion_client==1.0.0 jira==2.0.0 pandas==1.1.2 paramiko==2.10.4 scp==0.13.2 |
Sphinx==4.5.0 sphinx-rtd-theme==0.4.3 google-api-python-client==1.10.1 google-auth-httplib2==0.0.4 google-auth-oauthlib==0.4.2 sqlalchemy==1.4.27 |