Logo Logo Logo Logo Logo
0
  • No products in the cart.
Cart Total:$0.00
  • HME
  • ABT
  • BLG
  • SHP
  • JRNL
  • CNTCT
  • HME
  • ABT
  • BLG
  • SHP
  • JRNL
  • CNTCT

09 Dec Install Poppler on Databricks cluster

Posted at 02:11h in coding, databricks, ocr, python by iambdot 0 Comments

I'm working on a project where I have to use Optical Character Recognition (OCR) to extract and analyze data from scanned PDF documents. This ETL process will be running on a Databricks cluster. To accomplish this I am using the following Python libraries pdf2image and easyocr....

Read More
  • Install Poppler on Databricks cluster
    09 December, 2023
  • PySpark function chaining
    09 March, 2023
  • Python string to integer
    05 March, 2023
Archives
  • December 2023
  • March 2023
Categories
  • coding
  • databricks
  • ocr
  • pyspark
  • python

    Get in Touch

    Philadelphia, PA

    © copyright 2025 bdot
    proudly made for you by Okike Box