html-to-md
html-to-md copied to clipboard
无法解析包裹在 <recordset> <record><![CDATA[ ]]> </record> </recordset> 的 html table
遇到一段奇怪的html,是一个table 包含了两个 链接,但问题是, table外面有
<div id="7383225"><script type="text/xml">
<datastore>
<nextgroup><![CDATA[<a href="/module/jpage/dataproxy.jsp?page=1&appid=1&appid=1&webid=1568&path=/&columnid=1071207&unitid=7383225&permissiontype=0"></a>]]></nextgroup>
<recordset>
<table width="100%" border="0" cellspacing="0" cellpadding="0" align="center" style="margin-bottom:15px;">
</table>
<record><![CDATA[
<tr>
<td height="23" align="left" style="border-bottom:dashed 1px #ccc">
<span style="padding-right:8px;"><img src="/picture/176/1701181528363439739.gif"></span>
<a style="line-height:45px;font-size:16px;" href='/art/2025/1/13/art_1071207_59031776.html' class='bt_link' title='xxxxxxxxxxxxxxx' target="_blank">xxxxxxxxxxxxxxxxx</a>
</td>
<td align="right" class="bt_time" style="font-size:16px;border-bottom:dashed 1px #ccc">2025-01-13</td> </tr>]]></record>
<record><![CDATA[
<tr>
<td height="23" align="left" style="border-bottom:dashed 1px #ccc">
<span style="padding-right:8px;"> <img src="/picture/176/1701181528363439739.gif"></span>
<a style="line-height:45px;font-size:16px;" href='/art/2024/12/11/art_1071207_59031343.html' class='bt_link' title='xxxxxxxxxxxxxxxx' target="_blank">xxxxxxxxxxxxxxxxxxx</a>
</td>
<td align="right" class="bt_time" style="font-size:16px;border-bottom:dashed 1px #ccc">2024-12-11</td> </tr>]]>
</record>
</recordset>
</datastore>
经过markdown转换后,结果基本上是空的, 请问这种情况该如何解决,谢谢
@helxsz
这个html-md不支持xml, 没写支持的代码, 你看能不能先转换成html, 在尝试转换md, 谢谢