Seeking Assistance with Reading Indexed Data from Attachments (PDF, DOCX, XLSX) Using Jira API

Get involved · January 5, 2024

Hello,

I am currently working on a project where I'm using the Jira API to read data indexed in attached files (such as PDF, DOCX, XLSX, etc.). I'm able to successfully retrieve data from text files. However, when I try to fetch data from other file types like DOCX, PDF, and XLSX, I'm unable to retrieve the correct data. It seems I cannot load content in any language, including English, from these files. The loaded content does not include the main text of the documents, but rather seems to only contain metadata related to these file extensions.

However, I am aware that there are ways to correctly load this data through Jira plugins. Could you advise me on how I should make API calls to correctly retrieve the indexed contents of these file types? Here is an example of the code I am using:

fileContentUrl = "https://" + MASTERURL + ".atlassian.net/rest/api/3/attachment/content/" + file['id']
response_content = requests.request(
"GET",
fileContentUrl,
headers=headers,
auth=auth
)
response_content.encoding = 'utf-8'
print(response_content.text)

Any insights or suggestions on how to resolve this issue would be greatly appreciated. Thank you!

Forums

Product Q&A

Community resources

Support

Top groups

Community resources

Support

Learn

Community resources

Support

Events

Community resources

Support

Seeking Assistance with Reading Indexed Data from Attachments (PDF, DOCX, XLSX) Using Jira API

1 answer

Suggest an answer

Was this helpful?

Thanks!

TAGS

Community showcase

Atlassian Community Events