google-play-scraper icon indicating copy to clipboard operation
google-play-scraper copied to clipboard

Genres/categories incorrect or missing

Open archon810 opened this issue 2 years ago • 11 comments

  • Operating System: OpenSUSE 15.3
  • Node version: v12.20.0
  • google-play-scraper version: git+https://github.com/facundoolano/google-play-scraper#pull/545/head"

Description:

Let's take https://play.google.com/store/apps/details?id=mytown.home as an example. Both in json and html, we see multiple categories for this game: https://play.google.com/store/apps/category/GAME_SIMULATION https://play.google.com/store/apps/category/GAME_CASUAL

However, google-play-scraper returns only "genre":"Educational","genreId":"GAME_EDUCATIONAL" which isn't even visible on the page. Both GAME_SIMULATION and GAME_CASUAL are missing from the returned data.

image

  1. Can support be added for more than one genre/category per app?
  2. The correct categories should be returned. I suppose, in this case, it should be both of the above plus GAME_EDUCATIONAL?

Another example: https://play.google.com/store/apps/details?id=com.mojang.minecraftpe. This should have https://play.google.com/store/apps/category/GAME_SIMULATION, but we get back "genre":"Arcade","genreId":"GAME_ARCADE". image

And another: https://play.google.com/store/apps/details?id=com.nianticlabs.pokemongo. This should have https://play.google.com/store/apps/category/GAME_ACTION and https://play.google.com/store/apps/category/GAME_CASUAL, but we get "genre":"Adventure","genreId":"GAME_ADVENTURE". image

I don't understand why the data mismatches so much. Is the wrong data being looked at?

archon810 avatar May 31 '22 22:05 archon810

https://play.google.com/store/apps/details?id=com.google.android.googlequicksearchbox doesn't even show a category in HTML? I can't find it. Yet the scraper returns "genre":"Tools","genreId":"TOOLS". Google, why are you like this?

archon810 avatar May 31 '22 23:05 archon810

@archon810 the scrapper get the return from javascript data i think

baguse avatar Jun 01 '22 05:06 baguse

@archon810 the scrapper get the return from javascript data i think

This needs looking at, because the data is there in the JSON too, I think.

archon810 avatar Jun 01 '22 05:06 archon810

PSA. Google released a new version of the Google play UI. Genres are among some of the fields that have changed.

The change started rolling out May 12th per this article: https://chromeunboxed.com/google-play-store-web-app-redesign-begins-rollout-us

You may not have seen the change because Google was doing AB testing.

As of this week, it looks like they are 100% on the new UI.

borgmonitoringzoo avatar Jun 02 '22 17:06 borgmonitoringzoo

@facundoolano Is this one on your radar as well?

archon810 avatar Jun 13 '22 17:06 archon810

I would not help @archon810, as they use open source software like this one, but refuse any information on their own software.

89z avatar Jun 14 '22 05:06 89z

I would not help @archon810, as they use open source software like this one, but refuse any information on their own software.

I'm blocking you because you seem to have some weird vendetta and complete lack of understanding of licenses, like MIT.

archon810 avatar Jun 14 '22 06:06 archon810

can you check if this still happens with the latest version?

facundoolano avatar Jun 23 '22 16:06 facundoolano

@facundoolano the issue still persist when using git+https://github.com/facundoolano/google-play-scraper#v9.0.0

maciejmackowiak avatar Jun 28 '22 13:06 maciejmackowiak

@facundoolano any updates here? where you able to look into this?

maciejmackowiak avatar Jul 11 '22 09:07 maciejmackowiak

This issue still persists. "APPLICATION" is the only category that is being returned.

srikanthlogic avatar Aug 16 '22 20:08 srikanthlogic