We are creating an ML model that needs the confluence data as training inputs. Please suggest what is the best way to load the confluence document data of my organization to the ML model. We also want to use Confluence data as a corpus of documents for RAG(Retreval Augmented Generation).
Is there any way to generate JSON formatted input files from Confluence pages?
@Disha Patel Welcome to the Atlassian community
It really depends how you need to feed the data. Every time I have done this in the past we have always used the Confluence API. This way we can consume the Confluence pages in a format that we can control. https://docs.atlassian.com/atlassian-confluence/REST/6.6.0/ This does provide you with JSON as well.
This helps. Thank you
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
Online forums and learning are now in one easy-to-use experience.
By continuing, you accept the updated Community Terms of Use and acknowledge the Privacy Policy. Your public name, photo, and achievements may be publicly visible and available in search engines.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.