Blog content migration
This task is part of the FellowshipHacks/Projects/Blog project.
Find the way to migrate the content of the Fellows blogs from the current ez-publish platform to the new Wordpress MU platform.
People working on this task.
Volunteers are always welcome! Have a look at FellowshipHacks to know how you can help
Last updated on: 090202
The migration procedure is based on exporting the blog content as RSS file from EZ and import it into WPMU. Since not all the information is exported from EZ, some data must be migrated from the EZ database to the WPMU one.
A migration route has been found for all components (posts, comments, tags, etc) and has been tested on all blogs.
- Find (and read) useful documentation on EZ and WPMU internals
- Current EZ platform assessment
- Complete first list of migration routes and collect information about them
Check if it's worth to use the WPDB class to do the SQL scripting work
- Not really worth unless you are: proficient in OO-PHP and willing to develop a WP plugin
- EZ RSS feeds contain only the latest 15 entries of each blog
The EZ admin panel -> RSS (and the EZ configuration file) lets you set the maximum and default number of entries, but that applies to RSS feeds that you can build with the admin panel, not to the RSS feeds automatically generated for blogs.
- Found: the parameter is set in line 4 of /design/fsfe/templates/rss_pagelayout.tpl
- Author: all posts are titled as made by "admin".
Fixed using this patch to /wp-admin/import/rss.php: http://mu.wordpress.org/forums/topic.php?id=3066
- Check problems in Georg's blog
- Tried importing Georg's feed into my account, without problems; so it must be a problem on Georg's account
The WP RSS importer can import tags (using the <category> RSS element, since there's no <tag> element in the RSS spec); WP will import them as Categories, not as Tags (while EZ only has tags, WP has both "tags" and "categories"; the difference is that "categories" can be arranged hierarchically). If the user is fine with that, OK. Otherwise, WP provides a command to automatically convert (=move, not copy) categories to tags (and vice-versa), preserving the relationship between tag/category and posts.
cri: I hacked /design/fsfe/templates/rss_pagelayout.tpl to include the <category> element to the RSS feed, and tested import and subsequent conversion to tags
Blog entries migrated to WP contain links to pictures hosted on the ez platform: see e.g. https://www.fsfe.org/var/fsfe/storage/images/fellows/mk/pictures/booth_blit_08/214498-1-eng-GB/booth_blit_08.jpg
cri: I've searched the database for all links to fsfe.org contained in all current blog posts
90% of them point directly to the (ezpublish)/var/fsfe/storage... tree, so all we have to do is to copy this tree to the new webserver
- The rest point to generic fsfe.org pages, or to blogs (we already have a plan for migration / URL rewriting for these), or to stuff that is no longer available (e.g. to the old fsfe.org implementation in Plone, the old forums)
about 15-20 links point to other EZ objects (files, articles, etc) hosted in folders under the Fellows pages outside the /var/fsfe/storage... tree; we can add some custom URL-rewriting rules for these, or ask Fellows to upload the media to Wordpress (the list is in ~/wrk/0812_blog_test/images on cavendish)
- Test the RSS migration procedure on all blogs and collect feedback from the bloggers
Notes: discarded migration routes
Note: I've moved here the initial notes about other possible migration routes. These are no more relevant, since we've gone the RSS way.
Route #2: CSV
See Ivan s-h@ 081115.
Ivan has installed one ez plugin to export in CSV, but it seems that only thing it can do is export from one database table.
See if this WP plugin is useful: http://www.zackpreble.com/2008/03/28/importing-posts-from-csv-file-into-wordpress/
- Export from ez data in CSV format (through an ez plugin)
- Import CSV data in WPMU
Route #3: EZ packages
cri: I didn't try it, but I guess it could be useful to migrate other kinds of content: e.g. objects in Fellows home directories: create a package and upload it to the Fellow home page in the wiki.
(from the f-h@ archives)
Via Administration you can create packages: http://ez.no/doc/ez_publish/technical_manual/3_8/features/packages these content object packages are tar.gz xml and Images etc.
Route #4: XML -> eXtended RSS
Has anybody tried it?
cri: I would leave this as the last option, since it requires developing XSL code to transform EZ's XML into WP's XML; it's much more convenient for me to work with SQL...
(from the f-h@ archives)
The content in eZ can be exported as XML and then transformed into WordPress eXtended RSS. eZ allows you to export all data as XML - transforming to another XML schema should be fairly straightforward. Reference: http://ez.no/doc/ez_publish/technical_manual/3_8/reference/modules/content But you can use the extension: http://svn.ez.no/svn/extensions/ezxmldesign/