Hearaman Hearaman - 5 months ago 33
PHP Question

How to get values inside <![CDATA[values]] > using php DOM?

How can i get values inside <![CDATA[values]] > using php DOM.
This is few code from my xml.

<Destinations>

<Destination>
<![CDATA[Aghia Paraskevi, Skiatos, Greece]]]]>><![CDATA[
<CountryCode>GR</CountryCode>
</Destination>

<Destination>
<![CDATA[Amettla, Spain]]]]>><![CDATA[
<CountryCode>ES</CountryCode>
</Destination>

<Destination>
<![CDATA[Amoliani, Greece]]]]>><![CDATA[
<CountryCode>GR</CountryCode>
</Destination>

<Destination>
<![CDATA[Boblingen, Germany]]]]>><![CDATA[
<CountryCode>DE</CountryCode>
</Destination>

</Destinations>

Answer

Working with PHP DOM is fairly straightforward, and is very similar to Javascript's DOM.

Here are the important classes:

  • DOMNode — The base class for anything that can be traversed inside an XML/HTML document, including text nodes, comment nodes, and CDATA nodes
  • DOMElement — The base class for tags.
  • DOMDocument — The base class for documents. Contains the methods to load/save XML, as well as normal DOM document methods (see below).

There are a few staple methods and properties:

  • DOMDocument->load() — After creating a new DOMDocument, use this method on that object to load from a file.
  • DOMDocument->getElementsByTagName() — this method returns a node list of all elements in the document with the given tag name. Then you can iterate (foreach) on this list.
  • DOMNode->childNodes — A node list of all children of a node. (Remember, a CDATA section is a node!)
  • DOMNode->nodeType — Get the type of a node. CDATA nodes have type XML_CDATA_SECTION_NODE, which is a constant with the value 4.
  • DOMNode->textContent — get the text content of any node.

Note: Your CDATA sections are malformed. I don't know why there is an extra ]] in the first one, or an unclosed CDATA section at the end of the line, but I think it should simply be:

<![CDATA[Aghia Paraskevi, Skiatos, Greece]]>

Putting this all together we:

  1. Create a new document object and load the XML
  2. Get all Destination elements by tag name and iterate over the list
  3. Iterate over all child nodes of each Destination element
  4. Check if the node type is XML_CDATA_SECTION_NODE
  5. If it is, echo the textContent of that node.

Code:

$doc = new DOMDocument();
$doc->load('test.xml');
$destinations = $doc->getElementsByTagName("Destination");
foreach ($destinations as $destination) {
    foreach($destination->childNodes as $child) {
        if ($child->nodeType == XML_CDATA_SECTION_NODE) {
            echo $child->textContent . "<br/>";
        }
    }
}

Result:

Aghia Paraskevi, Skiatos, Greece
Amettla, Spain
Amoliani, Greece
Boblingen, Germany