Name	Name	Last commit message	Last commit date
Latest commit History 62 Commits
.github	.github
alfred	alfred
.gitignore	.gitignore
.pylintrc	.pylintrc
.release-please-manifest.json	.release-please-manifest.json
CHANGELOG.md	CHANGELOG.md
LICENSE	LICENSE
Makefile	Makefile
README.md	README.md
pyproject.toml	pyproject.toml
release-please-config.json	release-please-config.json
requirements.txt	requirements.txt
test.py	test.py

Overview

Welcome to the alfred-python SDK, the official Python library for interfacing with Alfred, your intelligent process automation platform. This SDK provides a simple and efficient way to integrate Alfred's capabilities into your Python applications.

Prerequisites

Python v3.8+

Usage

Check out this simple example to get up and running:

from alfred.rest import AlfredClient
from alfred.base import Configuration

config = Configuration.default()
auth_config = {"api_key": "AXXXXXXXXXXXXXXXXXXXXXX"}

client = AlfredClient(config, auth_config)

values = client.data_points.get_values("XXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXX")
print(values)

Sessions

A Session is a mechanism designed for asynchronous file uploads. It serves as a container or grouping for files that are uploaded at different times or from various sources, but are all part of a single Job. To see more information visit our official documentation.

Get session by ID

# Get a session by ID
result = client.sessions.get("XXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXX")
print(result)

Create session

# Create a session
result = client.sessions.create()
print(result)

Jobs

A Job represents a single unit of work that group one or more Files within Alfred. To see more information visit our official documentation.

Get job by ID

# Get a job by ID
result = client.jobs.get("XXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXX")
print(result)

Create job

job = {
   "session_id": "session-id",
   "propagate_metadata": True,
   "merge": True,
   "decompose": True,
   "metadata": {
      "key": "value",
   },
   "channel": "channel",
   "parent_file_prefix": "prefix",
   "page_rotation": 90,
   "container": "container",
   "file_name": "file-name",
   "file_names": ["file-name-1", "file-name-2"],
}

# Create a job
result = client.jobs.create(job)
print(result)

Here is a description for each valid argument when creating a job:

Parameter	Type	Description
session_id	string	Session ID
metadata	any	Metadata of the job
propagate_metadata	boolean	If `true` ensures that the provided metadata at the Job level is attached to all the specified Files.
merge	boolean	If `true`, when all provided Files are either images or PDFs, the system combines them into a single file for the purpose of processing.
decompose	boolean	If `true`, when the provided File is a PDF, the system will decompose it into individual pages for processing.
channel	string	Channel
parent_file_prefix	string	The `parent_file_prefix` parameter is used to specify a virtual folder destination for the uploaded files.
page_rotation	number	Page rotation
container	string	Virtual container where the referenced remote file is located.
file_name	string	Unique name of the file within an object storage source.
file_names	string[]	Array of unique names of the files within an object storage source.

Files

File is an individual document or data unit undergoing specialized operations tailored for document analysis and management. To see more information visit our official documentation.

Get file by ID

# Get a file by ID
result = client.files.get("XXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXX")
print(result)

Download file by ID

# Download a file by ID
result = client.files.download("XXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXX")

with open(result.get("original_name"), "wb") as f:
   f.write(result.get("file").getvalue())

Upload remote file

# Upload a remote file
result = client.files.upload({
   "url": "<File URL>",
   "metadata": {}
})
print(result)

Upload a local file

with open("<Path to local file>", "rb") as upload_file:
   result = client.files.upload_file({
      "file": upload_file,
      "session_id": "XXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXX",
      "metadata": {}
   })
   print(result)

Data Points

Data Points are the core of Alfred's platform and represent data that you want to extract. To see more information visit our official documentation.

Important

Data Points where previously known as Metadata.

Get Data Point by File ID

# Get a data point by file ID
result = client.data_points.get_values("XXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXX")
print(result)

Configuration

This section provides detailed instructions and guidelines for configuring the SDK to interface effectively with the target API.

Retry Policy

In this SDK, we implement automatic retries to enhance the reliability of network requests. However, to maintain the integrity of data transactions, retries are only enabled for HTTP methods that are considered idempotent. Idempotent methods are those that can be called multiple times without different outcomes. Thus, retries are applied only to the following HTTP methods:

GET: Retrieves data from the server without changing any state.
PUT: Updates a resource in a way that it can be repeatedly updated without changing the outcome beyond the initial application.
DELETE: Removes a resource and subsequent deletions of the same resource are redundant.
HEAD: Fetches metadata about a resource without side-effects.
OPTIONS: Retrieves supported communication options for a given URL or server without causing any side effects.

For non-idempotent methods like POST and PATCH, the SDK does not perform retries by default because doing so could potentially result in unwanted side effects or duplicate operations. If you need to enable retries for these methods under specific circumstances, please handle them cautiously in your application logic.

Real-time Events

The alfred-python library provides a way to listen to events emitted by Alfred IPA in real-time through a websockets implementation. This feature is particularly useful when you need to monitor the progress of a Job, File, or any other event that occurs within the Alfred platform. To see more information visit our official documentation.

Getting started

To get started, you need to create an instance of the AlfredRealTimeClient class.

from alfred import AlfredRealTimeClient
from alfred.base import Configuration
from alfred import AuthConfiguration

config = Configuration.default()

auth_config = AuthConfiguration({
    "api_key": "AXXXXXXXXXXXXXXXXXXXXXX"
})

client = AlfredRealTimeClient(config, auth_config, verbose=True)

File Events

These events are specifically designed to respond to a variety of actions or status changes related to Files. To see more details about File events, visit our official documentation.

# Listen to all File events
client.on_file_event(lambda data: print(data))

Job Events

Alfred performs asynchronous document classification, extraction, and indexing on a variety of file types. The events detailed here offer insights into how a Job progresses, fails, retries, or completes its tasks. To see more details about Job events, visit our official documentation.

# Listen to all Job events
client.on_job_event(lambda data: print(data))

Specific Events

This enables you to select a specific event you wish to monitor from the list of supported events. It's particularly useful when you want to listen to a specific event instead of all events of a particular type.

Here's an example of how to listen to a specific event:

from alfred.base import FileEvent, JobEvent

# Listen to the specific File Done event
client.on(FileEvent.FILE_DONE_EVENT.value, lambda data: print(data))

# Listen to the specific Job Finished event
client.on(JobEvent.JOB_FINISHED_EVENT.value, lambda data: print(data))

Here is a list of all supported events:

Event Type	Event Name	Description
FileEvent	`FILE_ADD_TO_JOB_EVENT`	Triggered when a file is added to a job for processing.
FileEvent	`FILE_CATEGORY_CREATE_EVENT`	Occurs when a new category is created for a file.
FileEvent	`FILE_CATEGORY_DELETE_EVENT`	Signals the deletion of a file's category.
FileEvent	`FILE_CHANGE_TAG_EVENT`	Indicates a change in the tag associated with a file.
FileEvent	`FILE_DONE_EVENT`	Marks the completion of file processing.
FileEvent	`FILE_EXTRACTED_DATA_CREATE_EVENT`	Triggered when new data is extracted from a file.
FileEvent	`FILE_EXTRACTED_DATA_DELETE_EVENT`	Occurs when extracted data from a file is deleted.
FileEvent	`FILE_FAILED_EVENT`	Indicates a failure in file processing.
FileEvent	`FILE_MOVE_EVENT`	Signals the movement of a file within the system.
FileEvent	`FILE_MOVE_TO_PENDING_EVENT`	Triggered when a file is moved to a pending state.
FileEvent	`FILE_MOVE_TO_RECYCLE_BIN_EVENT`	Indicates movement of a file to the recycle bin.
FileEvent	`FILE_PROPERTY_CREATE_EVENT`	Reflects the creation of a file property.
FileEvent	`FILE_PROPERTY_DELETE_EVENT`	Signals the deletion of a file property.
FileEvent	`FILE_REMOVE_TAG_EVENT`	Signals the removal of a tag from a file.
FileEvent	`FILE_STATUS_UPDATE_EVENT`	Indicates an update in the file's status.
FileEvent	`FILE_UPDATE_EVENT`	Triggered when a file is updated in any manner.
JobEvent	`JOB_CREATE_EVENT`	Triggered when a new job is instantiated for file operations.
JobEvent	`JOB_EXCEEDED_RETRIES_EVENT`	Fires when job exceeds maximum retry attempts for a stage.
JobEvent	`JOB_FAILED_EVENT`	Occurs when a job halts due to an unrecoverable error.
JobEvent	`JOB_FINISHED_EVENT`	Triggered when job successfully completes all workflow stages.
JobEvent	`JOB_INVALID_EVENT`	Fires when job fails initial validation of input files or parameters.
JobEvent	`JOB_RETRY_EVENT`	Triggered when job retries a stage after a recoverable failure.
JobEvent	`JOB_STAGE_UPDATE_EVENT`	Occurs when job transitions from one workflow stage to another.
JobEvent	`JOB_START_EVENT`	Triggered when job begins its workflow and state machine.

Development Setup

Setting up the development environment

To contribute to alfred-python, you'll need to set up a Python development environment. We recommend using Conda, a popular package and environment management system. Here’s how to set it up:

Install Conda: If you do not have Conda installed, you can download and install it from Miniconda (a minimal installer) or Anaconda (a full-featured distribution).
Create a new Conda environment: Open your terminal and run the following command to create a new environment named alfred-python:
```
conda create --name alfred-python python=3.8 -y
```
Activate the environment: Activate the newly created environment by running:
```
conda activate alfred-python
```
Install dependencies: With the environment activated, install the required dependencies by running:
```
pip install -r requirements.txt
```

This sets up a basic Python environment tailored for development purposes.

Optional: Setting up a testing environment

For testing and development, it's often useful to install the package in a way that reflects changes in real-time. To do this, you can install the package in editable mode. Here’s how to create a separate environment for testing and install the SDK:

Create a testing environment: It's a good practice to separate testing and development environments to avoid conflicts. Create a new environment named alfred-python-test:
```
conda create --name alfred-python-test python=3.8 -y
```
Activate the testing environment:
```
conda activate alfred-python-test
```
Install the SDK in editable mode: Navigate to the root directory of the alfred-python project and run:
```
pip install --editable .
```

Building the Project

To package alfred-python into distributable formats such as source archives and wheels, you will need to use the build module, a modern tool for building packages that adheres to PEP 517. Follow these steps to build the project:

Preparing the build environment

Before building the project, ensure that your development environment is activated and up-to-date. If you don’t have an environment set up, please refer to the Development Setup section to create and activate one.

Installing the build Tool

With your environment ready, install the build package. This package provides a simple, reliable way to build your project. Install it using pip:

pip install -U build

Running the build process

Once build is installed, you can generate the build artifacts by running the following command from the root directory of your project:

python -m build

This command will produce a source distribution (tar.gz) and a wheel file (whl) in the dist/ directory. These files are what you would upload to a package index like PyPI, or distribute to other developers.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Prerequisites

Usage

Sessions

Get session by ID

Create session

Jobs

Get job by ID

Create job

Files

Get file by ID

Download file by ID

Upload remote file

Upload a local file

Data Points

Get Data Point by File ID

Configuration

Retry Policy

Real-time Events

Getting started

File Events

Job Events

Specific Events

Development Setup

Setting up the development environment

Optional: Setting up a testing environment

Building the Project

Preparing the build environment

Installing the build Tool

Running the build process

About

Releases 2

Packages

Contributors 7

Languages

License

tagshelfsrl/alfred-python

Folders and files

Latest commit

History

Repository files navigation

Overview

Prerequisites

Usage

Sessions

Get session by ID

Create session

Jobs

Get job by ID

Create job

Files

Get file by ID

Download file by ID

Upload remote file

Upload a local file

Data Points

Get Data Point by File ID

Configuration

Retry Policy

Real-time Events

Getting started

File Events

Job Events

Specific Events

Development Setup

Setting up the development environment

Optional: Setting up a testing environment

Building the Project

Preparing the build environment

Installing the build Tool

Running the build process

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 7

Languages

Packages