AttributeError: module 'pdfminer.high_level' has no attribute 'extract_text'" #297

subhrajit-mohanty · 2025-01-20T08:55:05Z

I am getting the following issue when I was trying to extract the attached

PDF.

FileConversionException: Could not convert 'Interstallar.pdf' to Markdown. File type was recognized as ['.pdf', '.pdf', '.fdf']. While converting the file, the following error was encountered:

Traceback (most recent call last):
  File "/opt/miniconda/envs/py311/lib/python3.11/site-packages/markitdown/_markitdown.py", line 1239, in _convert
    res = converter.convert(local_path, **_kwargs)
          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/opt/miniconda/envs/py311/lib/python3.11/site-packages/markitdown/_markitdown.py", line 490, in convert
    text_content=pdfminer.high_level.extract_text(local_path),
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: module 'pdfminer.high_level' has no attribute 'extract_text'

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AttributeError: module 'pdfminer.high_level' has no attribute 'extract_text'" #297

AttributeError: module 'pdfminer.high_level' has no attribute 'extract_text'" #297

subhrajit-mohanty commented Jan 20, 2025

AttributeError: module 'pdfminer.high_level' has no attribute 'extract_text'" #297

AttributeError: module 'pdfminer.high_level' has no attribute 'extract_text'" #297

Comments

subhrajit-mohanty commented Jan 20, 2025