Jobs

Tasks related to running jobs for groups of Tamr projects

tamr_toolbox.workflow.jobs.run(project_list, *, run_apply_feedback=False, run_estimate_pair_counts=False)[source]

Run multiple projects in order

Parameters
  • project_list (List[Project]) – A list of Tamr projects

  • run_apply_feedback (bool) – Whether train should be called on the pair matching model or categorization model (based on project type)

  • run_estimate_pair_counts (bool) – Whether an estimate pairs job should be run

Return type

List[Operation]

Returns

The operations that were run

Raises

NotImplementedError – Raised if run() is called on an unsupported project type

tamr_toolbox.workflow.jobs.get_upstream_projects(project)[source]

Check for upstream projects associated with a specified project

Parameters

project (Project) – the tamr project for which associated upstream projects are retrieved

Return type

List[Project]