slurp icon indicating copy to clipboard operation
slurp copied to clipboard

Seek add search on `salary`

Open jbampton opened this issue 2 years ago • 7 comments

https://github.com/slurpcode/slurp/tree/main/seek

https://slurp.readthedocs.io/en/latest/seek.html

jbampton avatar Sep 30 '23 12:09 jbampton

still open ?

Dxuian avatar Oct 04 '24 18:10 Dxuian

image the salary is not a number based field so while regex and possible i would like to discuss the approach needed here
even with regex some are ranges so while we can simply present their mean i would still like to put this out @jbampton

Dxuian avatar Oct 05 '24 04:10 Dxuian

Thinking...

// We consider all units are $ AUD.

is_hour = False
if "per hour" in salary:
   is_hour = True

filter_salary = salary.replace(non digit, non `-` characters)

BaseMax avatar May 18 '25 21:05 BaseMax

All Characters Similar to Hyphen (-)

1. Standard Hyphen / Dash Characters

Character Name Unicode Use Case
- Hyphen-minus U+002D Standard hyphen or minus sign in ASCII
Hyphen U+2010 Pure hyphen (non-breaking)
Non-breaking hyphen U+2011 Used to prevent line breaks at hyphen
En dash U+2013 Ranges (e.g., 10–20), substitute for a hyphen
Em dash U+2014 Sentence breaks or parenthetical elements
Horizontal bar U+2015 Similar to em dash (Japanese typography)

2. Minus / Mathematical Symbols

Character Name Unicode Use Case
Minus sign U+2212 Used in mathematics, more precise than -
Small minus sign U+FE63 Full-width typography (Asian scripts)
Full-width hyphen-minus U+FF0D Full-width version of - (Asian typography)

3. Other Similar or Confusable Characters

Character Name Unicode Use Case
˗ Modifier letter minus sign U+02D7 Phonetic usage
Superscript minus U+207B Superscript notation
Subscript minus U+208B Subscript notation
Figure dash U+2012 Aligns figures in tabular data
Overline (short) U+23AF Mathematical overbar
Two-em dash U+2E3A Rare—used in some style guides
Three-em dash U+2E3B Very rare—archaic usage

BaseMax avatar May 18 '25 21:05 BaseMax

Here’s a single list of all the characters, separated by spaces:

- ‐ ‑ – — ― − ﹣ - ˗ ⁻ ₋ ‒ ⎯ ⸺ ⸻

BaseMax avatar May 18 '25 21:05 BaseMax

@BaseMax i have returned is this still a issue ?

Dxuian avatar Oct 01 '25 07:10 Dxuian

@Dxuian I am not working in this repo currently

jbampton avatar Oct 01 '25 15:10 jbampton