CFLib.org – Common Function Library Project

detableize(string)

Last updated August 25, 2005

Version: 1 | Requires: ColdFusion MX | Library: StrLib

 
Rated 1 time(s). Average Rating: 5.0

Description:
Handy for scraping screens. This will strip all table, tr, td, and th tags from a string. It also removes leading whitespace and extra newline characters to eliminate code formatting.

Return Values:
Returns a string.

Example:

view plain print about
<cfset htmlString = "<table>
    <tr>
        <th width="
"18"" height=""11"">
            this
        </th>
    </tr>
    <tr>
        <td>
     | | is (test whitespace stripping)
        </td>
    </tr>
    <tr bgcolor="
"red"">
        <td>
            some
        </td>
    </tr>
    <tr>
        <td>
            test
        </td>
    </tr>
    <tr>
        <td>
            code
        </td>
    </tr>
</table>"
>


<cfoutput>
#htmlCodeFormat(htmlString)#
<br><br>
#htmlCodeFormat(detableize(htmlString))#
</cfoutput>

Parameters:

Name Description Required
string String to format. Yes

Full UDF Source:

view plain print about
<!---
 Strips all table and table content tags and extra whitespace from a string.
 
 @param string      String to format. (Required)
 @return Returns a string. 
 @author Jared Rypka-Hauer (jared@web-relevant.com) 
 @version 1, August 25, 2005 
--->

<cffunction name="detableize">
    <cfargument name="string" type="string" required="true" />
    <cfset var outputString = arguments["string"]>
    <cfset outputString = reReplaceNoCase(outputString , "</*table>", "", "all")>
    <cfset outputString = reReplaceNoCase(outputString , "</*t[rhd](\s*\w*=*""*\w*""*)*>", "", "all")>
    <cfset outputString = reReplaceNoCase(outputString , "(?m)^\s*", "", "all")>
    <cfset outputString = reReplaceNoCase(outputString , "\n{2,}", "#chr(10)#", "all")>
    <cfreturn outputString />
</cffunction>
blog comments powered by Disqus

Search CFLib.org


Latest Additions

Dave Anderson Dave Anderson added
iniToStruct
20 day(s) ago

Dave Anderson Dave Anderson added
deDupeArray
20 day(s) ago

Richard Richard added
dice
22 day(s) ago

Isaac Dealey Isaac Dealey added
getRelative
a while ago

Top Rated

Darwan Leonardo Sitepu backupDatabase
Rated 5.0, 22 time(s)

Barney Boisvert indentXml
Rated 5.0, 10 time(s)

Kevin Pepperman generateSsccAsn
Rated 5.0, 4 time(s)

Raymond Camden highlightAndCrop
Rated 5.0, 4 time(s)

Created by Raymond Camden / Design by Justin Johnson