Software Engineer II (Backend + Data pipelines) Job at Scribd (San Francisco)

Job Description

We’re seeking a Software Engineer II with strong backend development experience and a passion for solving complex data challenges at scale. In this role, you’ll design, build, and optimize distributed systems that extract, enrich, and process metadata for a wide range of content. You’ll work closely with ML engineers, product managers, and cross-functional partners to integrate machine learning models and LLM-based services into production pipelines and deliver impactful, high-performance solutions. This role offers the opportunity to work on cutting-edge generative AI and metadata enrichment problems at a truly global scale.

Job Responsibility

Design and build scalable systems to extract, enrich, and process metadata from millions of documents, images, and audio content
Leverage LLMs to integrate capabilities like summarization, classification, extraction, and enrichment into metadata pipelines
Collaborate with cross-functional teams, including ML engineers and product managers, to deliver scalable, efficient, and reliable metadata solutions
Optimize and refactor existing systems for performance, scalability, and reliability
Ensure data accuracy, integrity, and quality through automated validation and monitoring
Participate in code reviews, ensuring best practices are followed and maintaining high-quality standards in the codebase
Manage and maintain data pipelines, security and infrastructure

Requirements

5+ years of professional software engineering experience
Proficiency in Python, Scala, Ruby, or similar languages
Experience designing and building distributed systems at scale
Hands-on experience building, deploying, and optimizing solutions using ECS, EKS, or AWS Lambda
Experience with infrastructure-as-code tools like Terraform (or similar)
Experience working with a public cloud provider (AWS, Azure, or Google Cloud)
Familiarity with data processing frameworks like Spark or Databricks for large-scale workloads
Proven ability to test, profile, and optimize systems for performance, scalability, and reliability
Bachelor’s degree in Computer Science or equivalent professional experience

Nice to have

Experience working with LLMs or integrating ML models into production systems

What we offer

Healthcare Insurance Coverage (Medical/Dental/Vision): 100% paid for employees
12 weeks paid parental leave
Short-term/long-term disability plans
401k/RSP matching
Onboarding stipend for home office peripherals + accessories
Learning & Development allowance
Learning & Development programs
Quarterly stipend for Wellness, WiFi, etc.
Mental Health support & resources
Free subscription to the Scribd Inc. suite of products
Referral Bonuses
Book Benefit
Sabbaticals
Company-wide events
Team engagement budgets
Vacation & Personal Days
Paid Holidays (+ winter break)
Flexible Sick Time
Volunteer Day
Company-wide Employee Resource Groups and programs that foster an inclusive and diverse workplace
Access to AI Tools: We provide free access to best-in-class AI tools, empowering you to boost productivity, streamline workflows, and accelerate bold innovation

Scribd - All Job Offers

Select Country

Software Engineer II (Backend + Data pipelines)

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?

Software Engineer II (Backend + Data pipelines)

Software Engineer II - Data

Senior Software Engineer II - Backend - AI Search

Software Engineer II - Backend - Search

Software Engineer II - Finance Data & Experiences

Software Engineer II

Data Engineer II

Software Engineer II

Software Engineer II - Full Stack

Our AI answers in your language