unicodia icon indicating copy to clipboard operation
unicodia copied to clipboard

Flag emojis are unsearchable

Open curya opened this issue 1 year ago • 29 comments

Searching for any flag emoji only provides results for the "regional indicator symbol" letters.

Example: Searching for 🇺🇸 only returns 🇺 and 🇸.

curya avatar Sep 16 '23 23:09 curya

This is both a problem and a feature.

  1. Emoji are unsearchable, but how to search them? Could you suggest?
  2. This is a feature, split into characters. And 🇺🇸 is actually 🇺+🇸. image

Mercury13 avatar Sep 18 '23 06:09 Mercury13

P.S. When I deleted characters between U and S, they ligated into USA flag.

Mercury13 avatar Sep 18 '23 06:09 Mercury13

@curya , maybe you have some suggestions?

Mercury13 avatar Sep 18 '23 10:09 Mercury13

P.S. When I deleted characters between U and S, they ligated into USA flag.

Deleted what???

@curya , maybe you have some suggestions?

I'm just a lowly end user so I'm probably not the right person to ask. But it seems to me that if I type "us flag", it should bring up the correct emoji. (The normal U and S, not the regional indicator symbols.) This is also the way emojipedia.org handles it. https://emojipedia.org/search?q=us%20flag

curya avatar Sep 18 '23 15:09 curya

@curya You have written “🇺 and 🇸”. When I delete “␣and␣”, I instantly get 🇺🇸. Right now emoji search is not implemented, and what problems I have:

  1. How to tell multi-character emoji from a normal single character.
  2. How to return to search list if we moved to wrong place.
  3. How to treat “us flag” if I have “flag: United States”

This is not just emoji library, it’s the entire Unicode.

Mercury13 avatar Sep 18 '23 18:09 Mercury13

I instantly get 🇺🇸 in Firefox+W10, it has Twemoji on board. If you have Chromelike+W10, you won’t get, as W10 has no flag emoji, and Chromelikes completely rely on system emoji. They just display some strange U + some strange S.

Mercury13 avatar Sep 18 '23 18:09 Mercury13

image This is what I get when typing 🇺🇸 after deleting " and ".

And like I said, I'm probably the wrong person to ask about how to solve this problem. I'm not a programmer. Maybe you can contact the people at emojipedia and ask how they did it?

Maybe a separate search box for only emojis?

curya avatar Sep 18 '23 20:09 curya

Not from programmer’s perspective, but from user’s. I see W10 + really high DPI (glad that hiDPI works somehow, it was ha-ard).

W10 does not have US emoji, so US in strange font is everything it can. And there’s no conceptual difference between blonde 👱🏻‍♀️ (which W10 can) and US flag 🇺🇸 (which W10 cannot).

So what’s the difference? My scope is the entire Unicode, they cover emoji sequences only. I can insert some temporary feature (e.g. US flag) when you paste THE EMOJI ITSELF. Not “US flag” text, but emoji itself. But it breaks a bit the feature “break into codepoints”, so I need some interface justification, e.g. image

Another interface breakup. When you click that US flag, you go somewhere, and there’s hard to return to search results. What to do? These are purely interface troubles, I can program somehow of course, but they will be incomplete.

Mercury13 avatar Sep 18 '23 21:09 Mercury13

It shouldn't matter what OS I'm on. Unicodia should display the emoji correctly. In this case, the flags. They display just fine in the Library tab: image

Pasting the emoji itself is a step in the right direction. But I feel like the program should also return the correct emoji when searching for "us flag" or "uk flag" or whatever. (Not regional indicator letters).

Just a suggestion. Maybe move the emoji search to the Library tab?

curya avatar Sep 18 '23 21:09 curya

UA, US and so on are actually two codepoints and do not fit straightly into my “break into codepoints” feature. That’s why I’m asking to devise a solution. Right now I’m thinking about data structures that will find finished multi-character emoji inside text, but that’s another thing.

Mercury13 avatar Sep 19 '23 05:09 Mercury13

@curya Just by tags you see, I took your request seriously, and thinking what to do and how to help you. I know that Unicodia has these problems.

Mercury13 avatar Sep 19 '23 12:09 Mercury13

I appreciate it! Unfortunately, I'm not sure I can help you. Maybe there should be a separate "emoji" tab so we can search flags (🇺🇸) and all variations of emojis (like 👱🏻‍♀️)?

curya avatar Sep 19 '23 18:09 curya

@curya Recently I’ve read a well-known story about Pontiac that started when the owner bought chocolate ice-cream, and refused when he bought vanilla ice-cream. It was hard to translate from owner’s language to engineer’s one. Vanilla ice-cream (default in the USA, thus term “vanilla” for original unmodded software) was in a separate fridge, and buying it was quicker. Because of carburettor deficiencies, the car failed to start in a specific time range after stopping. (VAZ owners will come: cover it with a wet rag.)

Mercury13 avatar Sep 21 '23 08:09 Mercury13

image @curya Is it OK?

Mercury13 avatar Sep 28 '23 11:09 Mercury13

I will not release until Noto runs through all their fonts. And I’ll write more.

Mercury13 avatar Sep 28 '23 11:09 Mercury13

@curya Recently I’ve read a well-known story about Pontiac that started when the owner bought chocolate ice-cream, and refused when he bought vanilla ice-cream. It was hard to translate from owner’s language to engineer’s one. Vanilla ice-cream (default in the USA, thus term “vanilla” for original unmodded software) was in a separate fridge, and buying it was quicker. Because of carburettor deficiencies, the car failed to start in a specific time range after stopping. (VAZ owners will come: cover it with a wet rag.)

I think something is lost in translation here...

@curya Is it OK?

Looks good to me. I'll have to test it when the new version is released.

curya avatar Sep 29 '23 11:09 curya

@curya Google “pontiac vanilla ice cream” About VAZ (Volga Automobile Plant): to own a car/moped in the USSR, you needed to do most of maintenance for yourself, so Soviet car-owners knew lots of lifehacks.

Noto did nothing today, waiting for monthly release.

Mercury13 avatar Sep 29 '23 15:09 Mercury13

Does this support the other multi-character emojis as well? 👱🏻‍♀️ 🧑‍🧑‍🧒 🐦‍🔥 🍋‍🟩

curya avatar Sep 29 '23 17:09 curya

image

Mercury13 avatar Sep 29 '23 18:09 Mercury13

Everything is decodable except single-character and VS16.

Mercury13 avatar Sep 29 '23 18:09 Mercury13

@curya One more search rule image

Mercury13 avatar Sep 30 '23 11:09 Mercury13

What if I type "ua flag"?

curya avatar Sep 30 '23 11:09 curya

Right now nothing. I’ll try of course.

Mercury13 avatar Sep 30 '23 11:09 Mercury13

@curya Done image

Mercury13 avatar Sep 30 '23 21:09 Mercury13

Words “flag” and “of” are flag-related keywords. “Of” is dubious (the last counts if several), and “flag” is just a keyword. So “of flag ua” and “ua flag” do the same. More than 3 words, digits → bad string, no search

Mercury13 avatar Sep 30 '23 21:09 Mercury13

Nice.

Does it also work with "ukraine flag" or "ukraine"?

curya avatar Oct 01 '23 05:10 curya

@curya Normal emoji search isn’t done yet, mostly because of special words like “person” and “flag”. But test what I made.

Mercury13 avatar Oct 01 '23 20:10 Mercury13

@curya Making an emergency fix right now, I’ll think later about true emoji search.

Mercury13 avatar Oct 03 '23 05:10 Mercury13

It seems to work very well. Thanks for making this work! :)

curya avatar Oct 04 '23 16:10 curya

@curya @MAZ06 This isn’t exactly your task, but did today…

  • You can find a character by its emoji name: e.g. both “savoring” (emoji) and “savouring” (character)

Mercury13 avatar Jun 28 '24 14:06 Mercury13