Microsoft PowerPoint
Note: Users must procure and maintain valid licenses to integrate this commercial DE tool with the Istari Digital platform. Please contact your local IT administrator for assistance.
Supported Functions:
Getting Started
The PowerPoint integration provides support for Microsoft Office 2019 and 2021 PowerPoint, allowing users to extract data from .ppt
and .pptx
files.
Methods to Link to Istari Digital Platform
Upload: Yes
Link: No
Files Supported
The istari Digital Platform can extract from the following file types:
.ppt
.pptx
All other PowerPoint file types not supported at this time. Please submit a feature request if an important file type is not supported.
Example Files
Download Example Presentation: example_presentation.pptx
Setup for Administrators
Ensure that Istari Digital Agent and appropriate Istari Digital Software is installed on the machine.
Version Compatibility
This software was tested with Microsoft Office 2019 and Office 2021, and is intended to run in a Windows environment due to reliance on the PowerPoint interop assembly.
Function Coverage and Outputs
The PowerPoint software can produce a number of artifacts extracted from the PowerPoint presentation. The table below describes each output artifact and its type.
Route | Coverage | Artifact Content Example |
---|---|---|
Extract slides - PNG | Yes | |
Extract slides text - JSON | Yes | |
Extract whole deck - PDF | Yes | |
Extract slides - PDF | Yes | |
Extract slides - PPTX | Yes | |
Extract whole deck - ODP | Yes | |
Extract Embedded Images - PNG | Coming Soon | Coming Soon |
Detailed SDK Reference
Prerequisite: Install Istari Digital SDK and initialize Istari Digital Client per instructions here
Step 1: Upload and Extract the File(s)
Upload the file as a model
model = client.add_model(
path="example_presentation.pptx",
description="Powerpoint example Model",
display_name="Powerpoint Model Name",
)
print(f"Uploaded base model with ID {model.id}")
Extract once you have the model ID
extraction_job = client.add_job(
model_id = model.id,
function = "@istari:extract",
tool_name = "microsoft_office_powerpoint",
tool_version = "2019",
operating_system = "Windows Server 2019",
)
print(f"Extraction started for model ID {model.id}, job ID: {extraction_job.id}")
Please choose appropriate tool_name, tool_version, and operating_system for your installation of this software.
Above is an example of how to call the function
Step 2: Check the Job Status
extraction_job.poll_job()
Step 3: Retrieve Results
Example
for artifact in model.artifacts:
output_file_path = f"c:\\extracts\\{artifact.name}"
if artifact.extension in ["txt", "csv", "md", "json", "html"]:
with open(output_file_path, "w") as f:
f.write(artifact.read_text())
else:
with open(output_file_path, "wb") as f:
f.write(artifact.read_bytes())
Troubleshooting
- For general Agent and Software Troubleshooting Click Here
- If experiencing errors while extracting data from the .ppt or .pptx file, test that your file successfully opens in Microsoft Office 2019 and 2021 PowerPoint.
FAQ
- Are macro enabled documents supported? No, macro enabled documents are not supported.