If you process a lot of text, you know the value of programs that will make that job easier. I used to use Awk for those text cleanup jobs, but now I have found a better friend: Textsoap! Creating and sharing Textsoap cleaners is easy and fun.
As I write new cleaners, I will add them to the list below. You can use them as they are or modify them as you need. It is really easy. I also have an Alfred workflow for working with TextSoap.
|Fix Bible Text.tscleaner||This cleaner takes text copied from a theWord bible program and formats it the way I like it.|
|Make Title from URL.tscleaner||This takes the slug out of an URL and creates a title from it.|
|Name:Address converter.tscleaner||This one takes a list of <name>:<address> and creates an unordered list of anchors.|
|PostURL to Title.tscleaner||Same as Make Title from URL.tscleaner except it assumes the slug is the text given.|
|Remove Hyphens.tscleaner||Remove all Hyphens and make them spaces.|
|Remove dashes.tscleaner||Remove all dashes and makes them spaces.|
|Remove p tag.tscleaner||Remove all html p tags for including into WordPress.|
|list of addresses to unordered list anchors.tscleaner||Takes a list of Urls and makes an unordered list of anchors for them.|
|CleantoMinHTML||This cleaner will take HTML, remove all styling tags, remove all paragraph tags, but leave list type tags and anchor tags. This cleaner is great for taking expressive HTML to the bare minimum for inclusion into a CMS.|
I have also written an article on writing cleaners for Mac Tuts+. It is called:
- How to Effortlessly Create Markdown With TextSoap
This is a complicated cleaner that takes markdown text, translates it to HTML, and fixes it up to work in a WordPress page post.