DOM manipulation in PHP

I've found PHP Simple HTML DOM to be the most useful and straight forward library yet. Better than PECL I would say.

I've written an article on how to use it to scrape myspace artist tour dates (just an example.) Here's a link to the php simple html dom parser.


PHP has a PECL extension that gives you access to the features of HTML Tidy. Tidy is a pretty powerful library that should be able to take code like that and close tags in an intelligent manner.

I use it to clean up malformed XML and HTML sent to me by a classified ad system prior to import.

Tags:

Html

Php

Dom