Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Russia: Public bodies from government contract system #67

Open
antontarasenko opened this issue Apr 28, 2015 · 6 comments
Open

Russia: Public bodies from government contract system #67

antontarasenko opened this issue Apr 28, 2015 · 6 comments
Labels
Data Data sources and ingestion automation

Comments

@antontarasenko
Copy link

Website http://clearspending.ru/opendata/ collects data from the Russian government procurement system. The latest dataset http://clearspending.ru/download/opendata/customers-20150219.json.zip contains about 274K firms controlled by the government (SOEs and government units).

Example:

{
  "kpp": "614901001",
  "fax": "7-86385-30387",
  "contracts223Count": 3,
  "phone": "7-86385-30387",
  "shortName": "МУП \"Редакция газеты \"Наш край\"",
  "contracts223YearStats": {
    "2015": {
      "contractsCount": 0,
      "contractsSum": 0
    },
    "2014": {
      "contractsCount": 3,
      "contractsSum": 1145205.92
    },
    "2013": {
      "contractsCount": 0,
      "contractsSum": 0
    },
    "2012": {
      "contractsCount": 0,
      "contractsSum": 0
    }
  },
  "ogrn": "1026102198176",
  "okato": "60232501000",
  "regionCode": "61",
  "inn": "6149004988",
  "allNames": [
    "Муниципальное унитарное предприятие \"Редакция газеты \"Наш край\" Миллеровского района"
  ],
  "contracts223Sum": 1145205.92,
  "postalAddress": "346130, Ростовская, Миллерово, 20 лет РККА, дом 41",
  "legalAddress": "346130, Ростовская, Миллерово, 20 лет РККА, дом 41",
  "fullName": "Муниципальное унитарное предприятие \"Редакция газеты \"Наш край\" Миллеровского района"
}

Anyone would like to add it here?

@rufuspollock
Copy link
Member

@danfowler would you be interested in taking a look here? PublicBodies is another nice DB that it would be good to find an ongoing maintainer for :-)

@antontarasenko
Copy link
Author

@ivbeg Can we use this data?

@ivbeg
Copy link

ivbeg commented Apr 28, 2015

Sure, you can use it. Also list of government suppliers available too http://clearspending.ru/download/opendata/suppliers-20150219.json.zip but this data could be often outdated.

Also we scraped all non-profit organizations - http://data.prongo.ru/dataset/nko-register-20140716 it's 1 year old but usable. About 300k organizations.

But before using the data it's important to know how it's organized. Russian public entities have multiple unique identifiers. INN, OGRN, OKPO and special identifier for non-profit organizations.

@augusto-herrmann augusto-herrmann added the Data Data sources and ingestion automation label Jan 19, 2016
@augusto-herrmann
Copy link
Collaborator

@ivbeg, could you elaborate a little bit more about the multiple unique identifiers available for Russian public entities?

What do you suggest we use for ids? Most countries in publicbodies.org use a slug generated from the full public entity name, some use an acronym and Italy uses a number. If we use something based on the name or acronym, should the slug be based on the original Cyrillic or a Roman alphabet transcription (as many Russian sites do when generating slugs for URLs)? For comparison, the Greek data uses the original Greek alphabet for ids here.

As a side note, we could really use some kind of standardization of ids in this project. I think I'll open another issue to discuss that.

@ivbeg
Copy link

ivbeg commented Sep 11, 2017

@augusto-herrmann sure.

We have major updates since that time.

  1. Each Russian public body is part of budget process so they have unique budget code ("record num") like "463Ц5852" that could include cyrillic letters. So it's not so good for unique urls generation.
  2. It's better to use INN. It's unique code for each organization in Russia. It's 10 digits code with 9 meaningful digits and 1 digit to validate code. It's well known code type in Russia and it's good to use as part of url slug
  3. We have much more detailed and clean list of all public bodies from combination of Russian Ministry of Finances registries. It's part of our commercial project http://apicrafter.ru (we haven't found the way to do it non-commercial way and we couldn't get any grant from foreign countries in Russia). It's about 200 000 public bodies and government/municipal entities.

We could provide data for publicbodies.org, just we would ask for mentioning our APICrafter project if you would like to use whole dataset.

@rufuspollock
Copy link
Member

@ivbeg happy to credit APICrafter and we'd love to get this dataset integrated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Data Data sources and ingestion automation
Projects
None yet
Development

No branches or pull requests

4 participants