Scribd, Inc. Logo

Scribd, Inc.

Software Engineer (Backend, Python) - Content Understanding

Posted 4 Days Ago
In-Office
Austin, TX, USA
104K-196K Annually
Mid level
In-Office
Austin, TX, USA
104K-196K Annually
Mid level
Build and optimize backend distributed systems to extract, enrich, and process metadata at scale. Integrate LLMs and ML models into production pipelines, ensure data quality and monitoring, collaborate with ML engineers and product, and maintain infrastructure, security, and performance of metadata systems.
The summary above was generated by AI

Scribd, Inc. is on a mission to advance human understanding. Our four products — Scribd®, Slideshare®, Everand™, and Fable — help billions of people across the globe move beyond access and into insight, application, and expertise.

Culture at Scribd, Inc.

We support a culture where our employees can be real and be bold; where we debate and commit as we embrace plot twists; and where every employee is empowered to take action as we prioritize the customer.

We believe the best work happens when individual flexibility is balanced with meaningful community connection. Scribd Flex empowers employees to choose the workstyle and location that support their best performance, while committing to intentional in-person moments that strengthen collaboration and culture. Occasional in-person attendance is required for all Scribd, Inc. employees, regardless of location.

So what are we looking for in new team members? At Scribd, Inc., we hire for “GRIT.” Traditionally defined as the intersection of passion and perseverance toward long-term goals, GRIT reflects the mindset we expect from every employee. For us, it also serves as a practical framework for how we work: setting and achieving Goals, delivering Results within your role, contributing Innovative ideas and solutions, and strengthening the broader Team through collaboration and attitude.

This posting reflects an approved, open position within the organization.

About the team:

The ML Content Understanding team powers metadata extraction, enrichment, and content understanding across all Scribd brands. We process hundreds of millions of documents, billions of images, and deliver high-quality metadata to enable content discovery and trust for millions of users worldwide.

Our systems operate at massive scale, supporting diverse datasets like user-generated content (UGC), ebooks, audiobooks, and more. We work at the intersection of machine learning, data engineering, and distributed systems, collaborating closely with applied research and product teams to deploy scalable ML and LLM-powered solutions in production.

Role Overview:

We’re seeking a Software Engineer II with strong backend development experience and a passion for solving complex data challenges at scale. In this role, you’ll design, build, and optimize distributed systems that extract, enrich, and process metadata for a wide range of content. You’ll work closely with ML engineers, product managers, and cross-functional partners to integrate machine learning models and LLM-based services into production pipelines and deliver impactful, high-performance solutions. This role offers the opportunity to work on cutting-edge generative AI and metadata enrichment problems at a truly global scale.

Tech Stack:

Our team uses various technologies. The following are the ones that we use on a regular basis: Python, Scala, Ruby on Rails, Airflow, Databricks, Spark, HTTP APIs, AWS (Lambda, ECS, SQS, ElastiCache, Sagemaker, Cloudwatch, Datadog) and Terraform.

Key Responsibilities:

  • Design and build scalable systems to extract, enrich, and process metadata from millions of documents, images, and audio content.

  • Leverage LLMs to integrate capabilities like summarization, classification, extraction, and enrichment into metadata pipelines.

  • Collaborate with cross-functional teams, including ML engineers and product managers, to deliver scalable, efficient, and reliable metadata solutions.

  • Optimize and refactor existing systems for performance, scalability, and reliability.

  • Ensure data accuracy, integrity, and quality through automated validation and monitoring.

  • Participate in code reviews, ensuring best practices are followed and maintaining high-quality standards in the codebase.

  • Manage and maintain data pipelines, security and infrastructure

Requirements:

  • 4+ years of professional software engineering experience

  • Proficiency in Python, Scala, Ruby, or similar languages

  • Experience designing and building distributed systems at scale

  • Hands-on experience building, deploying, and optimizing solutions using ECS, EKS, or AWS Lambda

  • Experience with infrastructure-as-code tools like Terraform (or similar)

  • Experience working with a public cloud provider (AWS, Azure, or Google Cloud)

  • Familiarity with data processing frameworks like Spark or Databricks for large-scale workloads

  • Proven ability to test, profile, and optimize systems for performance, scalability, and reliability

  • Bachelor’s degree in Computer Science or equivalent professional experience

  • Bonus: Experience working with LLMs or integrating ML models into production systems

At Scribd, Inc., your base pay is one part of your total compensation package and is determined within a range. Our pay ranges are based on the local cost of labor benchmarks for each specific role, level, and geographic location. San Francisco is our highest geographic market in the United States.

In the state of California, the reasonably expected salary range is between $126,000 [minimum salary in our lowest geographic market within California] to $196,000 [maximum salary in our highest geographic market within California].

In the United States, outside of California, the reasonably expected salary range is between $103,500 [minimum salary in our lowest US geographic market outside of California] to $186,500 [maximum salary in our highest US geographic market outside of California].

In Canada, the reasonably expected salary range is between $131,500 CAD[minimum salary in our lowest geographic market] to $174,500 CAD[maximum salary in our highest geographic market].

 

We carefully consider a wide range of factors when determining compensation, including but not limited to experience; job-related skill sets; relevant education or training; and other business and organizational needs. The salary range listed is for the level at which this job has been scoped. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for a competitive equity ownership, and a comprehensive and generous benefits package.

Working at Scribd, Inc.

Are you currently based in a location where Scribd, Inc. can employ you?
Employees must have their primary residence in or near one of the following cities. This includes surrounding metro areas or locations within a typical commuting distance:


United States:

Atlanta | Austin | Boston | Dallas | Denver | Chicago | Houston | Jacksonville | Los Angeles | Miami | New York City | Phoenix | Portland | Sacramento | Salt Lake City | San Diego | San Francisco | Seattle | Washington D.C.

Canada:

Ottawa | Toronto | Vancouver

Mexico:

Mexico City

Benefits at Scribd, Inc.

  • Scribd Flex (flexible work model)

  • Comprehensive health, dental, and vision coverage

  • Mental health support and disability coverage

  • Generous paid time off, including vacation, sick time, holidays, winter break, volunteer time, and sabbaticals

  • Paid parental leave and family support benefits

  • Retirement matching and employee equity

  • Learning and development programs and professional growth opportunities

  • Wellness and home office stipends

  • Complimentary access to the Scribd, Inc. suite of products

  • Enterprise access to leading AI tools

Get to Know Scribd, Inc.
About Scribd, Inc.
Life at Scribd, Inc.

We want our interview process to be accessible to everyone. You can inform us of any reasonable adjustments we can make to better accommodate your needs by emailing [email protected] about the need for adjustments at any point in the interview process.

If you apply for a job with Scribd or otherwise engage with us in connection with employment (including as an employee, contractor, or other personnel), the personal information we process in that context is subject to our Employee and Applicant Privacy Policy, which is available here.

Scribd, Inc. is committed to equal employment opportunity regardless of race, color, religion, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law. We encourage people of all backgrounds to apply, and believe that a diversity of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful.

Similar Jobs

6 Minutes Ago
Easy Apply
Remote or Hybrid
US
Easy Apply
80K-105K Annually
Mid level
80K-105K Annually
Mid level
Enterprise Web • Hardware • Internet of Things • Software
The Partner Manager will manage indirect sales through partners, develop Go-to-Market plans, support account executives, recruit new partners, and collaborate with internal teams.
Top Skills: Linkedin Sales NavigatorOutreachSalesforceZoominfo
6 Minutes Ago
Easy Apply
Remote or Hybrid
US
Easy Apply
95K-140K Annually
Senior level
95K-140K Annually
Senior level
Enterprise Web • Hardware • Internet of Things • Software
Manage enterprise sales for medical devices, including building relationships, navigating complex sales cycles, and closing deals while collaborating with stakeholders like Sales Engineering and Customer Success.
Top Skills: Manufacturing Execution System (Mes)Medical Device Sales
7 Minutes Ago
Easy Apply
Remote or Hybrid
14 Locations
Easy Apply
130K-180K Annually
Senior level
130K-180K Annually
Senior level
Automotive • Big Data • Insurance • Software • Transportation
The Senior Manager, GRC leads cybersecurity policies, audits, compliance frameworks, and risk governance. Collaborates with teams to enhance security integrity and compliance.
Top Skills: Compliance AutomationCybersecurityGenerative AiGrc FrameworksIso 27001Pci-DssSoc2Tisax

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account