Data Engineer Intern
Location: Fully Remote
Duration: 10–12 weeks (Summer/Fall 2026, flexible to align with academic calendar)
Hours: 15–20 per week, accommodating your class schedule
Casa Carlini is a fiscally sponsored, mission-driven independent publisher through Fractured Atlas, based in NYC, and dedicated to amplifying literature through digital innovation. Build our royalty platform! We’re seeking a Data Engineer Intern to develop an Author Royalty Platform that aggregates sales data from Amazon, IngramSpark, PublishDrive, Audible, and others to compute real-time royalties with graphs, charts, and reports for authors.
This unpaid educational internship follows U.S. Department of Labor FLSA guidelines, prioritizing your learning as the primary beneficiary with hands-on training tied to your formal education (academic credit required).
Design ETL pipelines to ingest sales data from multiple distributors (CSV/API), clean/normalize data, and compute royalties per title/author.
Receive structured training in publishing data engineering (sales aggregation, royalty formulas, dashboard visualization) with iterative feedback like classroom critiques.
Enrolled in computer science, data engineering/science, or related program; must be able to receive academic credit.
Proficiency in Python/SQL, data processing (Pandas), APIs/CSV parsing; experience with visualization tools, cloud (Google Sheets/AWS basics) a plus.
Passion for books/data, problem-solving mindset, open to feedback for remote Zoom/Trello collaboration.
Portfolio-ready royalty platform prototype credited on casacarlini.com.
Stipend-eligible via school programs (e.g., Barnard BBIP, Duke Funding, CUNY Magner)—include details in applications.
Mentorship from publishing pros, free books, references; flexible remote hours in a book-loving team. No paid role entitlement post-internship.
How to Apply: Email resume, code sample (GitHub/data project), and a brief note on your approach to royalty data to jobs@casacarlini.com. Subject: “Data Engineer Intern – [Your School/Program]”. Rolling review; start May/June 2026.
We can’t wait to see your platform crunch our book data—let’s build your data engineering skills together!