feedparser icon indicating copy to clipboard operation
feedparser copied to clipboard

Published date string duplicated

Open adamn opened this issue 8 years ago • 0 comments

When going to this feed, the published field is populated as the date twice in a row (u'2016-05-24 14:17:57.02016-05-24 14:17:57.0').


<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="http://www.unodc.org/misc/feed.xsl"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:content="http://purl.org/rss/1.0/modules/content/" version="2.0"><channel><title>UNODC Publications</title><link>http://www.unodc.org/unodc/en/feed/publications.xml</link><description>UNODC Publications</description><item><title>World wildlife crime report 2016</title><link>http://www.unodc.org/documents/data-and-analysis/wildlife/World_Wildlife_Crime_Report_2016_final.pdf</link><guid>http://www.unodc.org/documents/data-and-analysis/wildlife/World_Wildlife_Crime_Report_2016_final.pdf</guid><description></description><pubDate>Tue, 24 May 2016 2:17:57 PM CEST</pubDate></item><item><title>The Afghan Opiate Trade and Africa - A Baseline Assessment- 2016</title><link>http://www.unodc.org/documents/data-and-analysis/Afghanistan/Afghan_Opiate_trade_Africa_2016_web.pdf</link><guid>http://www.unodc.org/documents/data-and-analysis/Afghanistan/Afghan_Opiate_trade_Africa_2016_web.pdf</guid><description></description><pubDate>Wed, 16 Mar 2016 4:34:12 PM CET</pubDate></item><item><title>Afghanistan Opium Survey 2015 - Socio-economic analysis</title><link>http://www.unodc.org/documents/crop-monitoring/Afghanistan/Afghanistan_opium_survey_2015_socioeconomic.pdf</link><guid>http://www.unodc.org/documents/crop-monitoring/Afghanistan/Afghanistan_opium_survey_2015_socioeconomic.pdf</guid><description> Afghanistan Opium Survey 2015 - Socio-economic analysis </description><pubDate>Wed, 16 Mar 2016 2:19:37 PM CET</pubDate></item><item><title>Afghanistan Opium Survey 2015 - Cultivation and Production</title><link>http://www.unodc.org/documents/crop-monitoring/Afghanistan/_Afghan_opium_survey_2015_web.pdf</link><guid>http://www.unodc.org/documents/crop-monitoring/Afghanistan/_Afghan_opium_survey_2015_web.pdf</guid><description></description><pubDate>Fri, 18 Dec 2015 1:19:09 PM CET</pubDate></item><item><title>Southeast Asia Opium Survey 2015 - Lao PDR, Myanmar</title><link>http://www.unodc.org/documents/crop-monitoring/sea/Southeast_Asia_Opium_Survey_2015_web.pdf</link><guid>http://www.unodc.org/documents/crop-monitoring/sea/Southeast_Asia_Opium_Survey_2015_web.pdf</guid><description>Southeast Asia Opium Survey 2015 - Lao PDR, Myanmar</description><pubDate>Tue, 15 Dec 2015 4:30:42 PM CET</pubDate></item><item><title>Drug Money - the illicit proceeds of opiates trafficked on the Balkan route</title><link>http://www.unodc.org/documents/data-and-analysis/Studies/IFF_report_2015_final_web.pdf</link><guid>http://www.unodc.org/documents/data-and-analysis/Studies/IFF_report_2015_final_web.pdf</guid><description></description><pubDate>Thu, 26 Nov 2015 3:15:21 PM CET</pubDate></item><item><title>Strengthening the medico-legal response to sexual violence</title><link>http://www.unodc.org/documents/publications/WHO_RHR_15.24_eng.pdf</link><guid>http://www.unodc.org/documents/publications/WHO_RHR_15.24_eng.pdf</guid><description></description><pubDate>Wed, 25 Nov 2015 11:07:00 AM CET</pubDate></item><item><title>Afghanistan Opium Survey 2015 - Executive Summary</title><link>http://www.unodc.org/documents/crop-monitoring/Afghanistan/Afg_Executive_summary_2015_final.pdf</link><guid>http://www.unodc.org/documents/crop-monitoring/Afghanistan/Afg_Executive_summary_2015_final.pdf</guid><description>Afghanistan Opium Survey 2015 - Executive Summary</description><pubDate>Wed, 14 Oct 2015 7:53:45 AM CEST</pubDate></item><item><title>Estado Plurinacional de Bolivia - Monitoreo de Cultivos de Coca 2014 </title><link>http://www.unodc.org/documents/bolivia/Bolivia_Informe_Monitoreo_Coca_2014.pdf</link><guid>http://www.unodc.org/documents/bolivia/Bolivia_Informe_Monitoreo_Coca_2014.pdf</guid><description></description><pubDate>Tue, 18 Aug 2015 11:04:00 AM CEST</pubDate></item><item><title>Peru - Informe Monitoreo de Cultivos de Coca 2014 (Summary in English included)</title><link>http://www.unodc.org/documents/crop-monitoring/Peru/Peru_Informe_monitoreo_coca_2014_web.pdf</link><guid>http://www.unodc.org/documents/crop-monitoring/Peru/Peru_Informe_monitoreo_coca_2014_web.pdf</guid><description></description><pubDate>Wed, 15 Jul 2015 6:51:02 PM CEST</pubDate></item></channel></rss>

published_parsed is correct though:

>>> parser['entries'][0].get('published')
u'2016-05-24 14:17:57.02016-05-24 14:17:57.0'
>>> parser['entries'][0].get('published_parsed')
time.struct_time(tm_year=2016, tm_mon=5, tm_mday=24, tm_hour=14, tm_min=17, tm_sec=57, tm_wday=1, tm_yday=145, tm_isdst=0)
>>>

This is on Python 2.7.11 using version 5.2.1 from PyPI and the following virtualenv:

BeautifulSoup==3.2.1
boto3==1.2.2
botocore==1.3.30
cssselect==0.9.1
docutils==0.12
feedparser==5.1.3
futures==2.2.0
goose-extractor==1.0.25
jieba==0.38
jmespath==0.9.0
lambda-uploader==0.5.1
lxml==3.6.0
nltk==3.2.1
Pillow==3.2.0
piprot==0.9.6
python-dateutil==2.5.3
python-lambda-local==0.1.2
requests==2.3.0
requests-futures==0.9.7
simplejson==3.8.2
six==1.10.0
virtualenv==15.0.1

adamn avatar Jun 07 '16 18:06 adamn