Clio is the general leader in legal AI technology, empowering legal professionals and law firms of every size to work smarter, faster, and more securely. We are transforming the legal experience for all by bettering the lives of legal professionals while increasing access to justice.
Summary
Clio is seeking a Content Developer Engineer in JavaScript to help build and maintain reliable data-collection pipelines that gather information from the public web and partner sources. This role is essential to growing the company’s legal-content moat by ensuring acquisition systems are accurate, scalable, up to date, and resilient to change. In this role, you will design and operate crawlers and scrapers, handling diverse content types—including HTML, JSON, XML, PDFs, and images—to transform extracted data into structured formats used downstream by our systems. You will collaborate closely with engineering and product stakeholders to deliver high-quality content pipelines that power Clio’s research features.
Pipeline, Infrastructure & Technology Stewardship
- Build, maintain, and improve web crawlers and scrapers using modern Node.js tooling such as Puppeteer and Playwright.
- Implement robust scraping strategies for both static and dynamic websites, utilizing browser automation where required.
- Work confidently with HTTP and FTP protocols, handling requests/responses, authentication, headers, caching, and rate limiting.
- Ensure content pipelines are resilient by implementing structured error handling, retries, and schema validation.
Data Normalization & Transformation
- Parse and transform inputs from multiple formats, including HTML/XHTML, XML, JSON, PDFs, Word documents, and images.
- Convert extracted content into pre-defined schemas with consistent validation to ensure data integrity for downstream systems.
- Work with various data storage systems, including both relational and NoSQL databases, to manage acquired datasets.
Quality Assurance & Documentation
- Write unit and integration tests for crawlers and parsers to maintain quality standards and ensure deterministic runs.
- Produce clear technical documentation, including setup instructions, runbooks, and troubleshooting guides for edge cases.
- Contribute to peer code reviews and maintain test fixtures to make systems easier to operate and extend.
Collaboration & Project Stewardship
- Partner with engineering and product leads to clarify requirements, share progress, and raise risks early in the development cycle.
- Use project management tools like DevOps or Linear to plan, track, and communicate work effectively.
- Demonstrate the ability to work autonomously and turn high-level requirements into executable tasks.
Experience
- 1–3 years of hands‑on experience (or equivalent through degree/bootcamp) in JavaScript, TypeScript, and Node.js.
- Practical experience using web scraping libraries such as Puppeteer or Playwright and HTTP clients like fetch or axios.
- Proven experience parsing and transforming structured or semi‑structured data (JSON, XML, HTML).
- Excellent communication and documentation abilities in English.
What you will find here
Compensation is one of the main components of Clio’s Total Rewards Program. We have developed a series of programs and processes to ensure we are creating fair and competitive pay practices that form the foundation of our human and high-performing culture.
- Competitive, equitable salary with top-tier health benefits and dental coverage.
- Hybrid work environment, with the expectation for local Clions (Barcelona, London, Vancouver, Calgary, Toronto, Dublin, and Sydney) to be in the office at least twice per week.
Diversity, Inclusion, Belonging, and Equity (DIBE) & Accessibility
Our team shows up as their authentic selves, and is united by our mission. We are dedicated to diversity, equity, and inclusion. We pride ourselves on building and fostering an environment where our teams feel included, valued, and enabled to do the best work of their careers, wherever they choose to log in from. We believe that diverse perspectives, skills, backgrounds, and experiences lead to higher‑performing teams and better innovation. We are committed to equal employment, and encourage candidates from all backgrounds to apply.
Clio provides accessibility accommodations during the recruitment process. Should you require any accommodation, please let us know, and we will work with you to meet your needs.
Learn more about our /careers. We are a Human and High‑Performing AI company, meaning we use artificial intelligence to improve all of our operations. In recruitment, AI helps us streamline the process for greater efficiency. However, we have built our systems to ensure that a human always reviews AI‑generated output, and we never make automated hiring decisions.
Disclaimer: We only communicate with candidates through official @clio.com email addresses.
#J-18808-Ljbffr