edx-downloader KeyError: u'href

Hi, I kept encountering the following KeyError when running then standalone windows version. Not sure what's wrong there?

Traceback (most recent call last): File "", line 650, in File "", line 417, in main File "D:\PyInstaller-2.1\edx-dl\build\edx-dl\out00-PYZ.pyz\bs4.element", line 905, in getitem KeyError: u'href'

.

Apr 24 '15 04:04 kcw-022

same problem

Apr 30 '15 10:04 ghost

@kevw22 @Hasset Sorry for the late reply. Could you tell me which course? I can't replicate this problem.

May 10 '15 06:05 coiby

Hi @coiby , the error encountered in initiation.

May 10 '15 11:05 ghost

@Hasset Thank you!

According to the info you provide, there's something wrong with parsing out article[class='course']. Normally, COURSE should look like this:

<article class="course honor">
<section class="details">
<div aria-hidden="true" class="wrapper-course-image">
<a class="cover" href="/courses/McGillX/Body101x/1T2015/info">
<img alt="Body101x The Body Matters Home Page" class="course-image" src="/c4x/McGillX/Body101x/asset/Body101x_thumbnail.jpeg"/>
</a>
...
</div>
</section>
...
</article>

Could you print the content in COURSE?

for COURSE in COURSES:
        c_name = COURSE.h3.text.strip()
        print(COURSE) #add this before line 417
        c_link = BASE_URL + COURSE.a['href']
        if c_link.endswith('info') or c_link.endswith('info/'):
            state = 'Started'
        else:
            state = 'Not yet'
        courses.append((c_name, c_link, state))
    numOfCourses = len(courses)

May 10 '15 13:05 coiby

Hi @coiby ,

May 10 '15 13:05 ghost

@Hasset I'm sorry, but what I mean is that you add debugging code and run the program again to print COURSE.

May 10 '15 14:05 coiby

Hi @coiby , I don't know how to run the debugging code.

May 10 '15 14:05 ghost

Hi @coiby , I got it.

May 11 '15 09:05 ghost

@Hasset Thanks for your feedback. I've confirmed this bug. It's because there's no hyperlink, i.e., no href attribute for some courses which haven't started yet. A temporarily solution is to unenroll out of that kind of courses. But I'll fix this bug before tomorrow night.

May 12 '15 00:05 coiby

@coiby That's a very good news! Hurry up! :D

May 12 '15 03:05 ghost

@Hasset I've adopted a solution from iemejia/edx-downloader. But only edx-dl.py has been updated. Standalone packages will be updated a few days later.

May 13 '15 12:05 coiby

@coiby I've tested. It runs smoothly. Thanks!

May 13 '15 12:05 ghost

edx-downloader edx-downloader copied to clipboard

KeyError: u'href

edx-downloader
edx-downloader copied to clipboard