Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

데이터 품질 관리 #42

Open
2 tasks done
heehehe opened this issue Feb 17, 2024 · 0 comments
Open
2 tasks done

데이터 품질 관리 #42

heehehe opened this issue Feb 17, 2024 · 0 comments
Assignees
Labels

Comments

@heehehe
Copy link
Owner

heehehe commented Feb 17, 2024

  • 이미 추출된 채용공고 (중복되지 않도록)
  • 여러 사이트에 올라온 동일 공고 (중복되지 않도록)
    • 생각보다 많이는 없는듯한..?
  • 마감공고 지난 케이스 - 데이터에서는 그대로 두고, TODAY 기준으로 대응되도록..?
  • job name 통합 (rule-based로?)
    • 빅데이터 엔지니어 & 데이터 엔지니어
  • tech stack 통합
    • Python & python, Aws & AWS - 모두 lowercase로 dbt에서
    • "C / c++" -> C, C++ 분리시키기
    • 파이썬 & Python
@heehehe heehehe added the data label Feb 17, 2024
@heehehe heehehe self-assigned this Feb 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: Todo
Development

No branches or pull requests

1 participant