Skip to content

Scripts

Overview

The integrated scripting support in AP allows data analysts and data scientists to create data transformations and analytics. AP supports the R scripting language. Python support is coming soon.

In AP, data pipelines automates the ingestion of data from multiple sources into the central data store. Scripts have access to the entire catalog of datasets and tables, allowing for integrated and sophisticatd data transformations.

AP provides a web-based editor for scripting with syntax highlighting and code completion and a console area which displays script output. This makes it straight-forward to write scripts and get access to data.

Scripts can be included as jobs in workflows in combination with data pipelines, SQL transformations and destinations, allowing for flexible and orcestrated data movement and transformation.

Script flow

The sripting editor provides a function for aquiring a connection to the AP data warehouse. The function name is connect_datawarehouse and is available for all supported scripting languages. This function provides a connection to the data warehouse, making it easy to read and write data to the data warehouse. For scripts, a typical flow is described below.

  • Retrieve data from the data warehouse with a SQL using the data warehouse connection provided by the connect_datawarehouse function.
  • The SQL query will specify aggregation and filters to retrieve the relevant data, and ensure that an appropriate set of data records is pulled into the script environment.
  • Perform data computation such maching learning, forecasting and data modelling.
  • Load the resulting data into a data frame.
  • Write the data frame to a table in the data warehouse.

This section covers the supported scripting languages.

R scripting

The R scripting language is supported and is ideal for statistical computing and graphics.

Script R editor

Python scripting

The Python scripting language is supported and is ideal for machine learning, data modelling and statistics.

Script Python editor

Natural text script generation

The script editor allows for specifying a natural text description of the outcome you want to achieve in a conversational style and have the AI-powered editor generate a script automatically. For example, ask the platform to perform outlier detection using the modified Z-score statistical algorithm, build a light-weight API data connector and provide forecasting for a particular data table.

Generate script

  1. In the script editor area, click the right side panel.
  2. Specify the natural text description in the input area.
  3. Click the generate icon or click Enter.
  4. In the script output area, click the Copy icon to copy the text.
  5. Alternatively, click the Insert icon to insert the script directly into the script area.
  6. Click the Close icon or click anywhere outside the right side panel to close it.

Natural text script generation

Manage scripts

The following section covers how to view, create, update and remove scripts.

View scripts

  1. Click Scripts in the left side menu to list all scripts.
  2. Click the name of a script to see more information.

Script overview

Create script

  1. Click the Create new button from the top-right corner.
  2. Enter the following information.

    Field Description
    Name The name of the script
    Language The script language
    Refresh schedule The interval for when to run the script (required)
    Description A description of the script
    Tags Free text tags which categorizes the script
    Script The script code

Script view

Edit script

  1. Find and click the script to edit in the list.
  2. Open the context menu by clicking the icon in the top-right corner.
  3. Click Edit.
  4. Edit values in the relevant sections.
  5. Click Save at the bottom of the section.
  6. Close the dialog by clicking the close icon in the top-left corner.

Edit script code

  1. Find and click the view to edit in the list.
  2. Click the context menu in the top-right corner.
  3. Click Edit script.
  4. In the script editor, edit the script query.
  5. Click Save.

Remove script

  1. Find and click the script to remove in the list.
  2. Open the context menu by clicking the icon in the top-right corner.
  3. Click Remove.

Manage access for script

  1. Find and click the script in the list.
  2. Open the context menu by clicking the icon in the top-right corner.
  3. Click Share.
  4. Grant appropriate access levels to users and user groups.
  5. Click Save.

Using variables in scripts

Scripts support the use of variables, allowing you to reference dynamic values within your script code. To reference a variable, use the following syntax.

${VAR_NAME}

For example, if you have a variable named START_DATE, you can use it in your script as ${START_DATE}. When the script is executed, ${START_DATE} will be replaced with the actual value of the variable.

Packages

Packages commonly used for data analysis and data science are available in the AP scripting environment.

Python

The available Python packages are listed below.

Package Description
numpy Scientific computing including arrays, matrice and math functions.
scipy Scientific and technical computing, built on numpy.
pandas Data manipulation and analysis including data frames.
scikit-learn Classical machine learning and predictive data analysis.
nltk Natural language toolkit working with human language data.
joblib Pipeline persistence and parallel computing.
tqdm Extensible progress bar for long running processes.
tabulate Print data in pretty, readable tables.
prophet Forecasts for univariate time series datasets.
python-dateutil Handling of time-series data and time zone conversions.