iD name suggestion index - asking non-English-speaking mappers to review

classic Classic list List threaded Threaded
13 messages Options
Reply | Threaded
Open this post in threaded view
|

iD name suggestion index - asking non-English-speaking mappers to review

Michał Brzozowski
https://github.com/osmlab/name-suggestion-index

So this index is used in iD and it's supposed to suggest names of
shops/amenities that are part of some chain. (like: McDonald's, Aldi,
...). But due to how it works (counting occurrences) there are generic
nouns that end up here, which is bad tagging (like shop=bakery
name=Bakery [1]) and could form a feedback loop that proliferates them
even more.

I ask non-English speakers to find anything they are sure it's a noun
and not a proper name. name-suggestions.json specifies name
suggestions and filter.json specifies what "non-names" should be
filtered.

Some examples I found that need confirmation:

Аптека
Apotheke
Boulangerie
Пекарня
Зоомагазин
Обувь
Стройматериалы
Салон красоты


[1] For the sake of example, as "Bakery" is filtered already

_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk
Reply | Threaded
Open this post in threaded view
|

Re: iD name suggestion index - asking non-English-speaking mappers to review

nebulon42
Apotheke is the German noun for pharmacy.
Best, nebulon42

Am 2015-05-16 um 20:07 schrieb Michał Brzozowski:

> https://github.com/osmlab/name-suggestion-index
>
> So this index is used in iD and it's supposed to suggest names of
> shops/amenities that are part of some chain. (like: McDonald's, Aldi,
> ...). But due to how it works (counting occurrences) there are generic
> nouns that end up here, which is bad tagging (like shop=bakery
> name=Bakery [1]) and could form a feedback loop that proliferates them
> even more.
>
> I ask non-English speakers to find anything they are sure it's a noun
> and not a proper name. name-suggestions.json specifies name
> suggestions and filter.json specifies what "non-names" should be
> filtered.
>
> Some examples I found that need confirmation:
>
> Аптека
> Apotheke
> Boulangerie
> Пекарня
> Зоомагазин
> Обувь
> Стройматериалы
> Салон красоты
>
>
> [1] For the sake of example, as "Bakery" is filtered already
>
> _______________________________________________
> talk mailing list
> [hidden email]
> https://lists.openstreetmap.org/listinfo/talk
>

_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk
Reply | Threaded
Open this post in threaded view
|

Re: iD name suggestion index - asking non-English-speaking mappers to review

Marc Gemis
In reply to this post by Michał Brzozowski
Boulangerie is the french word for bakery

m.

On Sat, May 16, 2015 at 8:07 PM, Michał Brzozowski <[hidden email]> wrote:
https://github.com/osmlab/name-suggestion-index

So this index is used in iD and it's supposed to suggest names of
shops/amenities that are part of some chain. (like: McDonald's, Aldi,
...). But due to how it works (counting occurrences) there are generic
nouns that end up here, which is bad tagging (like shop=bakery
name=Bakery [1]) and could form a feedback loop that proliferates them
even more.

I ask non-English speakers to find anything they are sure it's a noun
and not a proper name. name-suggestions.json specifies name
suggestions and filter.json specifies what "non-names" should be
filtered.

Some examples I found that need confirmation:

Аптека
Apotheke
Boulangerie
Пекарня
Зоомагазин
Обувь
Стройматериалы
Салон красоты


[1] For the sake of example, as "Bakery" is filtered already

_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk


_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk
Reply | Threaded
Open this post in threaded view
|

Re: iD name suggestion index - asking non-English-speaking mappers to review

nebulon42
In reply to this post by Michał Brzozowski
Another thing. Is this still active? There hasn't been much activity
lately and it has also open pull requests that could have been merged
already. I'm just asking because I spotted some things to correct, but I
want to make sure that I do not waste my time because a pull request
might not get merged.

Still, quite useful project!

nebulon42

Am 2015-05-16 um 20:07 schrieb Michał Brzozowski:

> https://github.com/osmlab/name-suggestion-index
>
> So this index is used in iD and it's supposed to suggest names of
> shops/amenities that are part of some chain. (like: McDonald's, Aldi,
> ...). But due to how it works (counting occurrences) there are generic
> nouns that end up here, which is bad tagging (like shop=bakery
> name=Bakery [1]) and could form a feedback loop that proliferates them
> even more.
>
> I ask non-English speakers to find anything they are sure it's a noun
> and not a proper name. name-suggestions.json specifies name
> suggestions and filter.json specifies what "non-names" should be
> filtered.
>
> Some examples I found that need confirmation:
>
> Аптека
> Apotheke
> Boulangerie
> Пекарня
> Зоомагазин
> Обувь
> Стройматериалы
> Салон красоты
>
>
> [1] For the sake of example, as "Bakery" is filtered already
>
> _______________________________________________
> talk mailing list
> [hidden email]
> https://lists.openstreetmap.org/listinfo/talk
>

_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk
Reply | Threaded
Open this post in threaded view
|

Re: iD name suggestion index - asking non-English-speaking mappers to review

SimonPoole
In reply to this post by Michał Brzozowski

Th other question is: do we actually want to support (as in suggest
spelling) non-chain names or not? Currently the majority of at least the
restaurant names do not belong to chains, they are simply frequently
used names.

Simon

Am 16.05.2015 um 20:07 schrieb Michał Brzozowski:

> https://github.com/osmlab/name-suggestion-index
>
> So this index is used in iD and it's supposed to suggest names of
> shops/amenities that are part of some chain. (like: McDonald's, Aldi,
> ...). But due to how it works (counting occurrences) there are generic
> nouns that end up here, which is bad tagging (like shop=bakery
> name=Bakery [1]) and could form a feedback loop that proliferates them
> even more.
>
> I ask non-English speakers to find anything they are sure it's a noun
> and not a proper name. name-suggestions.json specifies name
> suggestions and filter.json specifies what "non-names" should be
> filtered.
>
> Some examples I found that need confirmation:
>
> Аптека
> Apotheke
> Boulangerie
> Пекарня
> Зоомагазин
> Обувь
> Стройматериалы
> Салон красоты
>
>
> [1] For the sake of example, as "Bakery" is filtered already
>
> _______________________________________________
> talk mailing list
> [hidden email]
> https://lists.openstreetmap.org/listinfo/talk
>

_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk

signature.asc (499 bytes) Download Attachment
Reply | Threaded
Open this post in threaded view
|

Re: iD name suggestion index - asking non-English-speaking mappers to review

Mateusz Konieczny-2
In reply to this post by Michał Brzozowski
On Sat, 16 May 2015 20:07:00 +0200
Michał Brzozowski <[hidden email]> wrote:

> I ask non-English speakers to find anything they are sure it's a noun
> and not a proper name. name-suggestions.json specifies name
> suggestions and filter.json specifies what "non-names" should be
> filtered.

Sklep spożywczy (Polish for grocery store)
Apteka (Polish for pharmacy)

_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk
Reply | Threaded
Open this post in threaded view
|

Re: iD name suggestion index - asking non-English-speaking mappers to review

Stefan Baebler

I see "Petrol" is categorized under discardedNames, but is is in fact a valid full name of company with chain of gas pumps in Slovenia (www.petrol.si).

Are there many other similar cases that would make it worthwhile to make this functionality location-aware, dependant on regions (countries, continents, languages)?

Best regards,
Štefan

16. maj 2015 11.08 pop. je oseba "Mateusz Konieczny" <[hidden email]> napisala:
On Sat, 16 May 2015 20:07:00 +0200
Michał Brzozowski <[hidden email]> wrote:

> I ask non-English speakers to find anything they are sure it's a noun
> and not a proper name. name-suggestions.json specifies name
> suggestions and filter.json specifies what "non-names" should be
> filtered.

Sklep spożywczy (Polish for grocery store)
Apteka (Polish for pharmacy)

_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk

_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk
Reply | Threaded
Open this post in threaded view
|

Re: iD name suggestion index - asking non-English-speaking mappers to review

Warin
On 17/05/2015 3:57 PM, Stefan Baebler wrote:

I see "Petrol" is categorized under discardedNames, but is is in fact a valid full name of company with chain of gas pumps in Slovenia (www.petrol.si).

Are there many other similar cases that would make it worthwhile to make this functionality location-aware, dependant on regions (countries, continents, languages)?

Petrol is in common use in Australia... an 'English' speaking country.

16. maj 2015 11.08 pop. je oseba "Mateusz Konieczny" <[hidden email]> napisala:
On Sat, 16 May 2015 20:07:00 +0200
Michał Brzozowski <[hidden email]> wrote:

> I ask non-English speakers to find anything they are sure it's a noun
> and not a proper name. name-suggestions.json specifies name
> suggestions and filter.json specifies what "non-names" should be
> filtered.

Sklep spożywczy (Polish for grocery store)
Apteka (Polish for pharmacy)

_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk


_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk


_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk
Reply | Threaded
Open this post in threaded view
|

Re: iD name suggestion index - asking non-English-speaking mappers to review

John F. Eldredge
Is there a company in Australia named Petrol, or is it simply the term for the fuel? As I understand it, the question is whether Petrol is valid as a company name (apparently

On May 17, 2015 10:53:07 PM CDT, Warin <[hidden email]> wrote:
On 17/05/2015 3:57 PM, Stefan Baebler wrote:

I see "Petrol" is categorized under discardedNames, but is is in fact a valid full name of company with chain of gas pumps in Slovenia (www.petrol.si).

Are there many other similar cases that would make it worthwhile to make this functionality location-aware, dependant on regions (countries, continents, languages)?

Petrol is in common use in Australia... an 'English' speaking country.

16. maj 2015 11.08 pop. je oseba "Mateusz Konieczny" <[hidden email]> napisala:
On Sat, 16 May 2015 20:07:00 +0200
Michał Brzozowski <[hidden email]> wrote:

> I ask non-English speakers to find anything they are sure it's a noun
> and not a proper name. name-suggestions.json specifies name
> suggestions and filter.json specifies what "non-names" should be
> filtered.

Sklep spożywczy (Polish for grocery store)
Apteka (Polish for pharmacy)

_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk


_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk



talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk

--
John F. Eldredge -- [hidden email] (615) 299-6451
"Darkness cannot drive out darkness; only light can do that. Hate cannot drive out hate; only love can do that." -- Martin Luther King, Jr.
_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk
Reply | Threaded
Open this post in threaded view
|

Re: iD name suggestion index - asking non-English-speaking mappers to review

John F. Eldredge
In reply to this post by Warin
Is there a company in Australia named Petrol, or is it simply the term for the fuel? As I understand it, the question is whether Petrol is valid as a company name.


On May 17, 2015 10:53:07 PM CDT, Warin <[hidden email]> wrote:
On 17/05/2015 3:57 PM, Stefan Baebler wrote:

I see "Petrol" is categorized under discardedNames, but is is in fact a valid full name of company with chain of gas pumps in Slovenia (www.petrol.si).

Are there many other similar cases that would make it worthwhile to make this functionality location-aware, dependant on regions (countries, continents, languages)?

Petrol is in common use in Australia... an 'English' speaking country.

16. maj 2015 11.08 pop. je oseba "Mateusz Konieczny" <[hidden email]> napisala:
On Sat, 16 May 2015 20:07:00 +0200
Michał Brzozowski <[hidden email]> wrote:

> I ask non-English speakers to find anything they are sure it's a noun
> and not a proper name. name-suggestions.json specifies name
> suggestions and filter.json specifies what "non-names" should be
> filtered.

Sklep spożywczy (Polish for grocery store)
Apteka (Polish for pharmacy)

_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk


_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk



talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk

--
John F. Eldredge -- [hidden email] (615) 299-6451
"Darkness cannot drive out darkness; only light can do that. Hate cannot drive out hate; only love can do that." -- Martin Luther King, Jr.
--
John F. Eldredge -- [hidden email] (615) 299-6451
"Darkness cannot drive out darkness; only light can do that. Hate cannot drive out hate; only love can do that." -- Martin Luther King, Jr.
_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk
Reply | Threaded
Open this post in threaded view
|

Re: iD name suggestion index - asking non-English-speaking mappers to review

Warin
In reply to this post by John F. Eldredge
Opps... Your correct John....
It is a term used in common speech for fuel in Australia (American = gas).

On 19/05/2015 1:30 AM, John F. Eldredge wrote:
Is there a company in Australia named Petrol, or is it simply the term for the fuel? As I understand it, the question is whether Petrol is valid as a company name (apparently

On May 17, 2015 10:53:07 PM CDT, Warin [hidden email] wrote:
On 17/05/2015 3:57 PM, Stefan Baebler wrote:

I see "Petrol" is categorized under discardedNames, but is is in fact a valid full name of company with chain of gas pumps in Slovenia (www.petrol.si).

Are there many other similar cases that would make it worthwhile to make this functionality location-aware, dependant on regions (countries, continents, languages)?

Petrol is in common use in Australia... an 'English' speaking country.

16. maj 2015 11.08 pop. je oseba "Mateusz Konieczny" <[hidden email]> napisala:
On Sat, 16 May 2015 20:07:00 +0200
Michał Brzozowski <[hidden email]> wrote:

> I ask non-English speakers to find anything they are sure it's a noun
> and not a proper name. name-suggestions.json specifies name
> suggestions and filter.json specifies what "non-names" should be
> filtered.

Sklep spożywczy (Polish for grocery store)
Apteka (Polish for pharmacy)

_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk


_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk


talk mailing list [hidden email] https://lists.openstreetmap.org/listinfo/talk

--
John F. Eldredge -- [hidden email] (615) 299-6451
"Darkness cannot drive out darkness; only light can do that. Hate cannot drive out hate; only love can do that." -- Martin Luther King, Jr.

_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk


_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk
Reply | Threaded
Open this post in threaded view
|

Re: iD name suggestion index - asking non-English-speaking mappers to review

Paul Hartmann-2
In reply to this post by nebulon42
On 16.05.2015 20:51, nebulon42 wrote:
> Another thing. Is this still active? There hasn't been much activity
> lately and it has also open pull requests that could have been merged
> already. I'm just asking because I spotted some things to correct, but I
> want to make sure that I do not waste my time because a pull request
> might not get merged.

I second that. We could use this index for JOSM, but unfortunately the
reported issues don't get fixed and the overall quality is not good
enough at the moment.

It would be nice if the original authors or someone else would maintain
this project.

Paul



_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk
Reply | Threaded
Open this post in threaded view
|

Re: iD name suggestion index - asking non-English-speaking mappers to review

John Firebaugh
Over the weekend I merged all outstanding pull requests. These changes will get picked up automatically in the next release of iD.

Thanks for the contributions, please keep them coming.

_______________________________________________
talk mailing list
[hidden email]
https://lists.openstreetmap.org/listinfo/talk