Saif Mahin

@saifmahin

Vetted Pro

4.9(149)

Level 2

Python Developer: AI Data Extraction and Web Scraping

Bangladesh
Inglés, Bengalí
Parte de la información aparece en idioma inglés.
Revisado por el equipo de Fiverr Pro

El equipo de Fiverr Pro seleccionó a Saif Mahin por su experiencia.

Revisado para

  • Extracción de datos

Sobre mí
Python developer with 259+ orders and 10+ years of freelancing experience, specializing in web scraping, AI-powered data extraction, and automation. I build production-grade pipelines that extract structured data from complex sources: large-scale websites, e-commerce platforms, scanned invoices, messy PDFs, and image-based documents. Tools: Python, Selenium, BeautifulSoup, Scrapy, OCR, OpenAI, LangChain, AWS, Snowflake, FastAPI, Pandas. From scraping 100K+ product listings to extracting data from 12M+ row invoice datasets, I deliver clean, ready-to-use data, every time.... Lee más

Habilidades

s
saifmahin
Saif Mahin
USD 20/hora
desconectado • 
Tiempo medio de respuesta: 1 hora

Revisa mis servicios

Extracción de datos
I will build a web scraper, data scraper, and handle web scraping
5.0(84)
Extracción de datos
I will design your python data scraper for perfect extraction
5.0(34)

¿Quieres trabajar por horas?

Dile a Saif Mahin qué necesitas.

USD 20

/

hora

Porfolio

Experiencia laboral

SupplyCopia

Python Developer

SupplyCopia • Tiempo completo

Dec 2022 - Present3 yrs 5 mos

As a Python Developer at Supply Copia, I build scalable data pipelines, AI-powered document processing systems, and automation frameworks that handle large-scale unstructured data with high accuracy. What I've built and delivered: Document & Invoice Processing: Designed end-to-end invoice extraction pipelines processing 100K+ documents monthly, transforming unstructured PDFs into clean, structured datasets (Excel, CSV, Parquet). Built AI-assisted parsing using OpenAI APIs and LangChain to resolve field ambiguities and boost extraction accuracy. Created automated QA frameworks to catch mismatches in amounts, vendors, and invoice numbers at scale. AI & Intelligent Systems: Integrated embedding models and re-rankers (BGE) for schema mapping and intelligent column matching. Contributed to AI chatbot development, connecting LLMs with structured data and knowledge bases. Led automation initiatives using AWS Lambda, reducing manual effort and improving processing speed. Web Scraping & Automation: Engineered high-performance scraping systems with concurrency, retry logic, proxy rotation, and anti-bot strategies for large-scale data collection. Built and deployed REST APIs using FastAPI and Flask for internal tools and data workflows. Designed S3-based orchestration workflows for storing and processing structured outputs. Data Engineering & Analytics: Developed Snowflake-based data pipelines with monthly partitioned tables and consolidated reporting layers. Built data reconciliation systems using fuzzy matching (RapidFuzz), normalization, and rule-based + AI logic. Implemented parallel processing (ThreadPoolExecutor, batching, checkpointing) to handle thousands of vendors efficiently. I work closely with cross-functional teams to deliver reliable, production-ready solutions that drive data accuracy, automation, and business efficiency.

149 Reseñas
4.9

(143)
(6)
(0)
(0)
(0)
Desglose de calificaciones
  • Nivel de comunicación del Freelancer
    5
  • Calidad de la entrega
    5
  • Valor de la entrega
    4.9
1-5 de 149 reseñas
Ordenar por
Más relevante
    M

    martijnp17

    Cliente recurrente

    NL

    Países Bajos

    5

    Happy with the work Saif delivers! We've placed 18 orders at this moment of time.

    Hasta USD50

    $

    4 días

    Tiempo

    gig

    Extracción de datos

    Útil?
    No
    G
    image-docs

    garricklau

    US

    Estados Unidos

    5

    he took the time to understand exactly what I needed and produced and documentation that proved his skill

    USD100-USD200

    $

    1 día

    Tiempo

    gig

    Extracción de datos

    Útil?
    No
    V

    vindavis1

    AU

    Australia

    5

    Saif was really good. He knows what we are after. Great communication and we got what we promised.

    USD50-USD100

    $

    6 días

    Tiempo

    gig

    Extracción de datos

    Útil?
    No
    P

    p_dmdr

    Cliente recurrente

    NL

    Países Bajos

    5

    Excellent work, just like last year. Will come back next year.

    USD200-USD400

    $

    10 días

    Tiempo

    gig

    Extracción de datos

    Útil?
    No
    L

    leonardodurso

    AZ

    Azerbaiyán

    5

    He did a great job

    USD400-USD600

    $

    2 semanas

    Tiempo

    gig

    Extracción de datos

    Útil?
    No