Non-ASCII from Posterous to WordPress

If you are moving from Posterous to WordPress, like I did, and you write your posts using non-ASCII characters (such as those very weird from el idioma español) you will notice that everything becomes garbled after you import.

The solution is actually quite easy. Inside the posts/ folder of your zipped Posterous backup you’ll find a bunch of XML files. Just concatenate them together, prepend an XML header (like this, but you’ll find the exact one for you reading your wordpress_export_1.xml file) and append </channel></rss>.

Finally, you have to cut all the carriage returns. I used Cygwin’s cat for the first part of the process and Notepad++ for removing the \n’s. That’s it.

About these ads

Posted on February 21, 2013, in Uncategorized. Bookmark the permalink. Leave a comment.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

Follow

Get every new post delivered to your Inbox.

Join 1,800 other followers

%d bloggers like this: