Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create lexical statistics from ESCO skills #51

Open
ioggstream opened this issue Jul 28, 2024 · 0 comments · May be fixed by #60
Open

Create lexical statistics from ESCO skills #51

ioggstream opened this issue Jul 28, 2024 · 0 comments · May be fixed by #60
Assignees
Labels
documentation Improvements or additions to documentation good first issue Good for newcomers

Comments

@ioggstream
Copy link
Contributor

ioggstream commented Jul 28, 2024

I expect

To have a lexical summary of ESCO skills labels (both label and altLabel) that includes:

  • average number of words for each skill
  • number of unique lemmas for each skill
  • number of total lemmas

Example

labels =  ['assure customer satisfaction',
  'customer satisfaction guarantee',
  'ensure customer satisfaction',
  'guarantee customer satisfaction',
  'guaranteeing customer satisfaction',
  'promise customer satisfaction',
  'provide customer satisfaction',
  'to guarantee customer satisfaction']
  • average number of words for each skill: 3.125
  • number of unique lemmas for each skill: 8
  • lemmas histogram for each skill:
[('customer', 8),
 ('satisfaction', 8),
 ('guarantee', 4),
 ('promise', 1),
 ('to', 1),
 ('ensure', 1),
 ('provide', 1),
 ('assure', 1)]
@ioggstream ioggstream added documentation Improvements or additions to documentation good first issue Good for newcomers labels Jul 28, 2024
@ioggstream ioggstream moved this to In Progress in esco-playground Jul 28, 2024
masciotta02 added a commit to masciotta02/esco-playground that referenced this issue Jul 30, 2024
masciotta02 added a commit to masciotta02/esco-playground that referenced this issue Jul 31, 2024
masciotta02 added a commit to masciotta02/esco-playground that referenced this issue Aug 3, 2024
masciotta02 added a commit to masciotta02/esco-playground that referenced this issue Aug 7, 2024
masciotta02 added a commit to masciotta02/esco-playground that referenced this issue Aug 8, 2024
masciotta02 added a commit to masciotta02/esco-playground that referenced this issue Aug 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation good first issue Good for newcomers
Projects
Status: In Progress
Development

Successfully merging a pull request may close this issue.

2 participants