Export pages by exporting text only

Hello everyone,

Im trying to extract the text only from my Wiki, without any Wiki language and without any pins to other files in the code itself. I tried to export through HTML and XML and it didn’t do the work, as expected. I need the text only for a new tool i want to implement, means i need the information only of my pages.
The tool would think the Wiki codes are normal text and will just convert it into regular text, that will appear on the page.

I would be more than happy to recieve a solution for it :=)

You can get plain text by adding this query to your url:

/?xpage=plain&outputSyntax=plain

Here as an example:

bin/view/Sandbox/?xpage=plain&outputSyntax=plain

Thx. See also https://www.xwiki.org/xwiki/bin/view/Documentation/DevGuide/Architecture/URL%20Architecture/Standard%20URL%20Format/ for the full reference doc.

1 Like

This is a good idea. I would just suggest the following more up to date way to get this result (using the dedicated /get/ action instead of switching the template used for the /view/ action):

bin/get/Sandbox/?outputSyntax=plain

Just one thing to have in mind: this is not going to give you the whole content of the wiki, only the result of the execution of the page. So it all depends if your goal is to extract all the data of the wiki or only what users see.

1 Like