Skip to content

GPC Pre-Installed Packages

The General Purpose Container (GPC) is built with several tools pre-installed. You can reference any of these apps in your scripts and/or specify other apt-get and Python packages to be installed at order runtime.

Pre-installed apt-get tools

The following apt-get packages come pre-installed in DataKitchen's GPC:

Apt-get tools
build-essential
curl
dialog
freetds-bin
freetds-dev
git
gpgv
jq
libncurses5-dev
libpq-dev
libxml2-dev
libxslt1-dev
nano
net-tools
python-distribute
python3
python3-dev
python3-pip
rsync
tar
tzdata
wget

For the GPC container specific to Python3.9, the following packages are also installed:

  • python3.9
  • python3.9-distutils
  • python3.9-dev
  • software-properties-common

Pre-installed Python packages

The following Python packages come pre-installed in the GPC. The package name and current version are listed.

Python packages
awscli>=1.18.137
azure-cli>=2.11.1
beautifulsoup4>=4.12.2
boto>=2.49.0
boto3>=1.28.4
Cython>=3.0.2
cryptography>=3.4.5
DKUtils>=2.11.6
google-api-python-client>=2.97.0
google-cloud>=0.34.0
google-cloud-bigquery>=3.11.4
google-cloud-storag>=2.10.0
jupyter>=1.0.0
oauth2client>=4.1.3
openpyxl>=3.1.2
matplotlib>=3.7.2
numpy>=1.25.2
pandas>=2.1.0
paramiko>=3.3.1
psycopg2>=2.9.7
pyyaml>=6.0.1
requests>=2.31.0
scikit-learn>=1.3.0
scipy>=1.11.2
setuptools>=50.3.0
simple-salesforce>=1.12.4
sqalchemy==1.4.46
traitlets<=5.9.0
tableauserverclient>=0.25
xlrd>=2.0.1

DKUtils

The DKUtils Python package includes additional libraries pre-installed for use in a GPC container node.

Python packages
dataclasses==0.6
events_ingestion_client==1.0.0
jira==2.0.0
pandas==1.1.2
paramiko==2.10.4
scp==0.13.2
Sphinx==4.5.0
sphinx-rtd-theme==0.4.3
google-api-python-client==1.10.1
google-auth-httplib2==0.0.4
google-auth-oauthlib==0.4.2
sqlalchemy==1.4.27