You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Cato P.CP

Cato P.

Freelance Data Engineer | Python, Spark

€350/day
Barcelona, ES
3-7 years

Average response time: 1 hour

About Cato

My Toolkit & Experience

  • Python & PySpark: Extensive use of PySpark for building distributed data pipelines. Solid understanding of Spark’s lazy evaluation model and performance best practices (e.g., avoiding premature collect/write operations).
  • Spark: Knowledgeable in Spark's core concepts and optimization strategies. While recent experience is Python-based.
  • Big Data & Data Lakes: Hands-on experience with HDFS, Hive (structured queries), and managing silver/golden layer transformations in large-scale systems.
  • Cloud Platforms: Worked with Google Cloud (BigQuery, Cloud Storage) and Azure (Data Factory, Postgres) for data integration and reporting pipelines.
  • Infrastructure & CI/CD: Practical knowledge of GitLab CI/CD, Docker, containerized PostgreSQL, and Django applications. Applied version control strategies, branching policies, and regression testing workflows.
  • Monitoring & Debugging: Experienced in pipeline reliability, error detection, and data quality checks (e.g., malformed data parsing, separator mismatch, and alerting via logs).
  • Security: Follows the principle of least privilege for role-based access management. Advocates for cost-aware, secure deployments.

Knowledge Sharing & community

I run a trilingual blog (EN/ES/FR) focused on Data Engineering and AI, hosted on AWS. I use it to demystify complex topics for a broader audience and sharpen my own understanding by teaching others.


What I Offer
  • ETL/ELT Pipelines: Design, implement, and monitor data pipelines from ingestion to delivery, with quality and performance in mind.

  • Reliable Systems: Strong focus on scalable architecture, fault isolation, and production-ready code.
  • Clear Communication: Transparent updates and collaborative approach across stakeholders and teams.


  • French

    Fluent

  • Spanish

    Native or bilingual

  • English

    Fluent

Can work on-site
Barcelona (up to 50km)

Experience

  • Catobyte
    Freelance Data Engineer & NLP enthusiast
    DIGITAL AND IT
    March 2025 - Today (1 year and 3 months)
    I help teams build practical data workflows using Python, Spark, and cloud tools. I have hands-on experience with batch data pipelines, file format conversion (CSV/Excel to Parquet/ORC), and cloud storage systems like AWS S3 and BigQuery. I've worked with both on-premise clusters and cloud environments to prepare data for analysis and reporting.

    My strength lies in simplifying complex problems and delivering clean, efficient solutions. I’m also diving into NLP and deep learning, currently exploring real-world applications using tools like HuggingFace and PyTorch.

    I'm looking for freelance projects—especially ones with an NLP or AI angle where I can contribute while continuing to learn.
  • Sabbatical leave
    Independent projects
    DIGITAL AND IT
    January 2024 - March 2025 (1 year and 2 months)
    Design and Development of a Technology Blog: Created and managed a blog focused on the topics of data engineering, artificial intelligence, and software development. Wrote in-depth articles for a non-technical audience on advanced concepts in data engineering and AI python HTML / CSS / AWS (Route 53 & S3)
  • Corum l'Épargne
    Data Engineer
    January 2023 - January 2024 (1 year)
    Paris, France
    Développement et optimisation de scripts Azure Data Factory pour créer et maintenir des flux de données .J'ai travaillé avec SQL et des bases de données relationnelles pour la maintenance et le développement de l'entrepôt de données de l'entreprise, en me concentrant sur la vérification et la transformation des données financières. Azure Data Factory / Azure Data Factory / Python

Recommendations

Be the first to recommend Cato

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Master
    Université Paris-Saclay Télécom ParisTech
    2017
    Master
  • M1 Ingénierie Logicielle
    Université de Rennes 1
    2014
    M1 Ingénierie Logicielle

Skill set

Categories