When USMAI migrated from Drupal to Confluence, we did a bulk import of HTML
that worked very well. I wrote a script to scrape the web page and name
files based on their HTML title. The only thing is I can't remember exactly
how we did the import -- either through the UI or directly on the server.
If this sounds like something you'd be interested in trying, let me know
and I can scrounge my notes and speak with the admin I worked with to get
it all working.
On Wed, Sep 15, 2021 at 10:26 AM Pikas, Christina K. <
[log in to unmask]> wrote:
> This is very promising -thank you! Turns out I can't even do the Word
> version because the export to word plugin requires access to the cloud.
>
> Thinking now wordpress xml xport > markdown > pandoc > word ... ew!
>
> -----Original Message-----
> From: Code for Libraries <[log in to unmask]> On Behalf Of Voß, Jakob
> Sent: Wednesday, September 15, 2021 8:42 AM
> To: [log in to unmask]
> Subject: [EXT] [CODE4LIB] AW: Wordpress > Atlassian Confluence?
>
> APL external email warning: Verify sender [log in to unmask]
> before clicking links or attachments
>
> Hi Christina,
>
> Maybe this works:
>
> 1. export Markdown from WordPress (e.g.
> https://github.com/lonekorean/wordpress-export-to-markdown)
> 2. convert Markdown to Confluence syntax with pandoc 3. import pages
>
> Step 2 likely requires additional cleanup with pandoc filters or manual
> search and replace to not include junk and to keep internal links.
>
> Jakob
>
> ________________________________________
> Von: Code for Libraries <[log in to unmask]> im Auftrag von Pikas,
> Christina K. <[log in to unmask]>
> Gesendet: Dienstag, 14. September 2021 14:31:25
> An: [log in to unmask]
> Betreff: [CODE4LIB] Wordpress > Atlassian Confluence?
>
> Hi All,
> Recent discussions reminded me I might have good luck asking here (fingers
> crossed!) I'm faced with moving 2 internal WordPress sites to an internal
> hosted Confluence wiki. I'm thinking post > page. As far as I can tell, the
> Confluence community says to use a plugin to convert posts to Word and then
> use the Confluence tool to import from Word into pages.... This sounds...
> awful? I wouldn't be doing this if there were any other choice (well,
> besides SharePoint).
>
> Any suggestions? I know there's a Confluence XML to move from hosted to
> cloud instances? Has anyone successfully done this?
>
> Thanks,
> Christina
>
> ------
> Christina K. Pikas, BS, MLS, PhD
> (she/her/hers)
> Librarian
> The Johns Hopkins University Applied Physics Laboratory
> [log in to unmask]<mailto:[log in to unmask]>
>
--
*Joseph A. Koivisto*
*Systems Librarian*
*Consortial Library Applications Support*
*University of Maryland*
*McKeldin Library, Room B0238*
*7649 Library Ln., College Park, MD 20742-7011*
*[log in to unmask] <[log in to unmask]>*
*301-314-4812**[image: ORCID iD icon]orcid.org/0000-0002-7515-2522*
<https://orcid.org/0000-0002-7515-2522>
|