Hi guys,
I am trying to build a conversational chatbot for my org based on our confluence space and for that, I need to extract the whole space and then build a RAG implementation pipeline, which I am planning to build on Azure.
Please tell me how to effectively extract the whole page tree. I don't have admin access, so the option to export the whole space is not available to me.
Are there any other ways that have worked for you?
If something can be done, it can usually be done with REST API Calls.
Its been a while since I've done any Confluence REST Calls. This chart comes from makeing a call for the spaces in my instance, then the pages within each Space. There is plenty of parent/child/verison/author/date type information available.
My example is somewhat limited; I don't have access to any large confluence installations at the moment.
The Confluence REST API documentation is here: https://developer.atlassian.com/cloud/confluence/rest/v2/intro/#about
Hope this gets you started. You will need an API key, but I do not think Admin Rights are needed.
Online forums and learning are now in one easy-to-use experience.
By continuing, you accept the updated Community Terms of Use and acknowledge the Privacy Policy. Your public name, photo, and achievements may be publicly visible and available in search engines.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.