Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Table extraction with tabula #14

Open
wants to merge 6 commits into
base: main
Choose a base branch
from
Open

Table extraction with tabula #14

wants to merge 6 commits into from

Conversation

haiyenvu96
Copy link
Collaborator

@haiyenvu96 haiyenvu96 commented Jun 2, 2024

  • ollama-extract.py
    • Extract table with tabula-py package
    • Save the extracted table in the file 'tables' with the format '.csv'
    • If 'tables' is not available, make directory

=============================================================

  • notebooks/debug-extract-texts-tables.ipynb
    • To play debug
    • Prove that the package camilot cannot extract any tables
    • Package tabula found 2 tables (exact in reality)

=============================================================

  • 2 examples of table extracted ('output_0.csv', 'output_1.csv')

@thinhngo-x thinhngo-x linked an issue Jun 12, 2024 that may be closed by this pull request
@thinhngo-x
Copy link
Collaborator

This could be added to the PR #21 after being merged.

Copy link

gitguardian bot commented Aug 6, 2024

⚠️ GitGuardian has uncovered 1 secret following the scan of your pull request.

Please consider investigating the findings and remediating the incidents. Failure to do so may lead to compromising the associated services or software components.

🔎 Detected hardcoded secret in your pull request
GitGuardian id GitGuardian status Secret Commit Filename
13015975 Triggered Generic Password cd7495f export_neo4j/upload_neo4j_from_json.py View secret
🛠 Guidelines to remediate hardcoded secrets
  1. Understand the implications of revoking this secret by investigating where it is used in your code.
  2. Replace and store your secret safely. Learn here the best practices.
  3. Revoke and rotate this secret.
  4. If possible, rewrite git history. Rewriting git history is not a trivial act. You might completely break other contributing developers' workflow and you risk accidentally deleting legitimate data.

To avoid such incidents in the future consider


🦉 GitGuardian detects secrets in your source code to help developers and security teams secure the modern development process. You are seeing this because you or someone else with access to this repository has authorized GitGuardian to scan your pull request.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Extract information
2 participants