scrape-linkedin-selenium
scrape-linkedin-selenium copied to clipboard
Added jobs and people in company_scraper
Requires some company_scraper testing with temporary Linkedin accounts. command : scrapeli --company=delltechnologies -o output.json
Some parts of the code are language-dependent, meaning it won't work for languages other than English (such as regular expressions etc.). But it works as expected for English. click_on_tab()
function doesn't seem to be working for hidden tabs in the expander. Function can be simplified easily to tackle with that issue:
def click_on_tab(self, tab_name):
main_url = "https://www.linkedin.com/company/{}/".format(self.company)
try:
self.driver.get(main_url + tab_name)
except:
print("Tab cannot be found.")
return
You have to add self.company = company
to the constructor method of CompanyScraper
object.
def click_on_tab(self, tab_name): main_url = "https://www.linkedin.com/company/{}/".format(self.company) try: self.driver.get(main_url + tab_name) except: print("Tab cannot be found.") return
Thanks a lot, i overlooked this issue, Added your code .
Some parts of the code are language-dependent, meaning it won't work for languages other than English (such as regular expressions etc.). But it works as expected for English.
click_on_tab()
function doesn't seem to be working for hidden tabs in the expander. Function can be simplified easily to tackle with that issue:def click_on_tab(self, tab_name): main_url = "https://www.linkedin.com/company/{}/".format(self.company) try: self.driver.get(main_url + tab_name) except: print("Tab cannot be found.") return
You have to add
self.company = company
to the constructor method ofCompanyScraper
object.
Added the code and verified with different language as well. Also fixed the json output where cyrillic characters were showing in \u escape characters.