INDIA_SOI_2022 icon indicating copy to clipboard operation
INDIA_SOI_2022 copied to clipboard

Build a single subdistrict boundary file

Open planemad opened this issue 2 years ago • 13 comments

Cleaning the data upto the subdistrict level will be useful to work with the village shapes and perform other data joins. There are several gaps even at the subdistrict level which probably need to be addressed first.

Missing coverage

  • Arunachal
  • Dadra and Nagar Haveli and Daman and Diu
  • Gujarat (can be built from villages)
  • Jammu and Kashmir
  • Ladakh
  • Lakshadweep
  • Maharashtra (can be built from villages)
  • Meghalaya
  • Sikkim
  • Tripura
Screen Shot 2022-09-20 at 12 44 52 PM

Outdated coverage

Cursory glance shows that these subdistricts maybe not be accurate or incomplete

  • Bihar
  • Odisha
  • West Bengal
Screen Shot 2022-09-20 at 12 47 28 PM

planemad avatar Sep 20 '22 16:09 planemad

So we should merge sub districts together from sub district files. Clean state name, district name, and sub district name. Does that sound like a plan? Should we merge these from the subdistricts or the villages? I'm thinking subdistricts but see value in doing from villages up.

justinelliotmeyers avatar Sep 20 '22 19:09 justinelliotmeyers

And maybe also map it to LGD. Let me know how I can help.

upperwal avatar Sep 21 '22 07:09 upperwal

Merged file in Readme: https://drive.google.com/drive/folders/1MOlQQC41_q1i0nHjf9p2XYLWNOkGQ_o4 I started work on this. Merged all subdistricts together for 25 states. Added the following fields: CS - Clean State CD - Clean District CT - Clean Tehsil CSLGD - Clean State LGD RANDOM_ID - ID placeholder until we get SOI IDs or complete all Subdistrict LGDs

Added all state names and LGD codes. Had to fix Chhattisgarh and Uttar Pradesh to correct spelling.

justinelliotmeyers avatar Sep 21 '22 13:09 justinelliotmeyers

Just added subdistricts metadata to gsheets here, if it's easier to open it up for others to comment.

pratapvardhan avatar Sep 21 '22 14:09 pratapvardhan

So far I only cleaned the CS state name. The districts and subdistricts all need qc/ cleaning. I think a good way to confirm them being cleaned is by populating the LGD codes. So if a districts LGD field is populated, then we know the name is stable and finished. Same with Tehsil

justinelliotmeyers avatar Sep 21 '22 14:09 justinelliotmeyers

Should then first do this exercise on a single districts file and move here?

pratapvardhan avatar Sep 21 '22 14:09 pratapvardhan

@pratapvardhan lets finish this file first. Then we can either do all districts where we dont have subdistrict data or bring in dissolved villages to substitute the missing subdistricts (Maharashtra / Gujarat)

justinelliotmeyers avatar Sep 21 '22 14:09 justinelliotmeyers

If we can connect with the SOI and find out if they plan to release admins for missing states, that would be helpful, but not necessary right away. If not we may be able to bring in block data from the PMGSY project. But lets try to stick with one source for now.

justinelliotmeyers avatar Sep 21 '22 14:09 justinelliotmeyers

@justinelliotmeyers I meant, isn't adding LGD codes in a downstream way more efficient. First fix district codes, then it's easier to match subdistricts within the district and then move to village codes? or am I missing something?

pratapvardhan avatar Sep 21 '22 14:09 pratapvardhan

yeah, whatever way you want to work it or do it, jump in! Sometimes I do things backwards!>!? I did states first, and yes was planning on districts, then subdistricts. Wherever you can help and clean is appreciated!

justinelliotmeyers avatar Sep 21 '22 14:09 justinelliotmeyers

Have added a sheet with the subdistricts from Gujarat and Maharashtra using the village shapes https://docs.google.com/spreadsheets/d/1zu0-GMtpFuZ7XkxdfjQClWfIOGnj0GCiF-W2lL0rMsE/edit#gid=499730654

planemad avatar Sep 22 '22 19:09 planemad

@planemad @justinelliotmeyers I have recently cleaned the GJ village files, will request a pull in a while, you guys can use those files for GJ

abhasdudeja avatar Oct 27 '22 19:10 abhasdudeja

Cleaning the data upto the subdistrict level will be useful to work with the village shapes and perform other data joins. There are several gaps even at the subdistrict level which probably need to be addressed first.

Missing coverage

  • Arunachal
  • Dadra and Nagar Haveli and Daman and Diu
  • Gujarat (can be built from villages)
  • Jammu and Kashmir
  • Ladakh
  • Lakshadweep
  • Maharashtra (can be built from villages)
  • Meghalaya
  • Sikkim
  • Tripura

Regarding the Missing UT of DNH&DD. One can import these PDFs to CAD and export the boundaries to Shapefiles -> Edit & Spatially Adjust.

Village names are also given in the PDFs

DNH Plan Detailed Plan Part-2 Detailed Plan Part-1

abhasdudeja avatar Oct 28 '22 22:10 abhasdudeja