I have now been testing XWiki both stable and LTS versions for a long time, but I am just starting to grasp what happens to the URLs contained in the pages of the imported spaces.
The context : we test XWiki latest up to date versions of the 15.x and 16.6.0. This past August I removed and did a fresh import of about 50 spaces containing altogether several hundreds of pages, and retrieving the full logs generated while using Filter Streams Converter in the process (select/copy/paste each to a text file, which isn’t trivial).
After importing a Confluence Space and while doing the conversion with Filter Streams Converter lots of error messages come up, relating to missing pages in specific spaces.
The missing pages appear to be related to the URLs in the pages that are being converted.
In other words, the pages imported in a given space contain URLs pointing to other spaces. These URLs do refer to existing pages, but
these URLs appear to have been transcribed the wrong way, thus pointing to pages yet to be created : which is wrong. These pages do exist
but not with the URL obtained from the migration into XWiki.
I have been studying what happens. To do this, I looked for the name of a page declared as missing as per the log, using the XWiki search engine.
Then I found the page, and compared the two URLs : the one found in the imported page, and the one which is the right address for the page.
This is what happens in the construction of the URLs:
The beginning of the URL imported in pages should be:
https://mydomaine.com/xwiki/bin/view/SpaceID/ but instead it is https://mydomaine.com/xwiki/bin/create/SpaceID/
Notice we get create instead of view. But it is not the only difference : Additionnaly, there is a long
suite of characters added at the end which should not exist, and which starts with:
/WebHome?parent=AndHereMore+Characters.year.Full+Title+Topic
So the final URL should look like:
(https://)mydomain.com/xwiki/bin/view/SpaceID/Page_name/Sub_page_name/Final_sub_page_name/
Instead of :
(https://)mydomaine.com/xwiki/bin/create/SpaceID/Final_sub_page_name/WebHome?parent=AndHereMore+Characters.year.Full+Title+Topic.Webhome
And I am not even sure how exactly the part between WebHome?parent= and .Webhome is made from.
So some questions arise : in order to import Confluence spaces, and get it right with everything including the links in the pages, how could we proceeed to get the URLs right?
I have read about scripts related to nesting, can that help? https://www.xwiki.org/xwiki/bin/view/Documentation/UserGuide/Features/ContentOrganization/NestedPagesMigration/#HFixthehierarchy
If so, once a script created, what would be the steps to be taken in order to make it work?
I know we are not the first Confluence users working on a migration project to use XWiki, there must be some methods to fix issues such as this one?