Site Reliability Engineer II- United States

April 10, 2023
Offerd Salary:$126,000
Working address:N/A
Contract Type:Other
Working Time:Negotigation
Working type:N/A
Ref info:N/A

We are driven by the belief that Artificial Intelligence is mankind's greatest invention. It is the key to building a safer, more vibrant, transparent, and empowered society. We are determined to be an active contributor to shaping our future for the better. We care about the ethical implications of AI and the prosperity and well-being of all individuals, as well as the growth and continued successes of our employees, customers, and partners.

Veritone's mission today is more important than ever. We're here to democratize AI and enable every organization and every person with the power of AI. What started in 2014 with the idea of providing unified access to hundreds of cognitive engines through one common software infrastructure, evolved to the world's first AI operating system, aiWARE, which orchestrates a diverse ecosystem of cognitive engines to power intelligent automation for both commercial and government organizations. As we progress, we will continue to move humans from “in” to “on” to “out of the loop” to help them accelerate workflows, save time and costs, and uncover new insights and opportunities.

What You'll Do
  • Deploy and maintain a resilient, secure, and efficient SaaS application platform to meet established SLAs.
  • Automate, monitoring, management and incident response to achieve an auto- remediation system.
  • Monitor site stability and performance and troubleshoot site issues.
  • Scale infrastructure to meet rapidly increasing demand.
  • Manage cross-functional requirements working with Engineering, Product, Services, and other departments.
  • Collaborate with developers to bring new features and services into production.
  • Independently design and develop tools to aid in operations and automation as well as work jointly with other team members to deliver innovative solutions to complex business and technical challenges.
  • Provide deployment and operations support for multi-tiered distributed software applications.
  • Estimate engineering effort, plan implementation, and rollout system changes that meet requirements for functionality, performance, scalability, reliability, and adherence to development goals and principles.
  • Collaborate in a fast paced environment with multiple teams (software development, release management, build and release, etc...).
  • Collaborate in a fast paced environment with multiple teams in a dynamic entrepreneurial organization
  • Defining how the behavior of large scale systems can be achieved
  • Measuring and achieving reliability through engineering and operations work
  • Monitoring and alert development, documentation and management with the goal of creating an auto-remediation system
  • Adapting security controls to product not typically native to GA releases
  • Developing automation methods to extend standard deployment pipelines for bespoke implementations
  • Patching, policy enforcement, and audit of production systems
  • Driving the Disaster Recovery process
  • What You'll Need
  • Expertise with Terraform and/or Ansible.
  • Knowledge of JavaScript, Go, or other programming languages
  • 5+ years of professional Linux systems and software management experience
  • Expertise with Infrastructure-as-Code including Ansible and Terraform
  • Knowledgeable with code languages including: Go, Node.js, Java
  • Experience with managing infrastructure within Azure, GCP and AWS
  • Expertise with monitoring and alerting systems including Prometheus, Grafana
  • Strong script skills for systems and data driven solutions
  • JIRA experience for project/task management
  • Extensive experience in troubleshooting large-scale distributed systems.
  • Strong background working in AWS, GCP, Azure and general Linux environments.
  • Comprehensive background in monitoring and alerting systems in auto- remediation systems.
  • Proven examples of standardizing security controls across large-scale systems
  • Comfort working within project/task management platforms
  • Systems and Tools

  • Cloud platforms including: Azure, GCP and AWS
  • Infrastructure coding languages: Terraform, Cloudformation, Ansible, Puppet
  • CI/CD: experience working with and supporting build and deploy pipelines and tools: Jenkins, GitHub Actions, Rundeck
  • Datastore Management and Query skills: Postgres, MySQL, Mongo, ElasticSearch, Solr
  • Container orchestration platforms: Docker, Kubernetes, EKS, AKS
  • Familiarity with coding languages including: Go, Node.js, Java, Python
  • Monitoring/Alerting Tools: Prometheus, Grafana, VividCortex, Runscope, Cloudwatch, Monitor, VictorOps
  • OS and Container Hardening: STIG, CIS, SELinux, IPTables, FIPS 140-2
  • JSON data structures and database schemas
  • API Query language: REST, GQL
  • Bonus Points If
  • Bachelor's degree in Computer Science or related field
  • Have worked in regulated or public sector environments through development and assessment of cloud based solutions
  • Worked with, developed, or supported continuous integration/continuous deployment systems
  • Have concrete examples ready to present for creating auto-remediation systems
  • A competitive compensation package
  • Equity Grant(s)
  • Employe Stock Purchase Plan (ESPP)
  • Remote first + Hybrid workplace
  • VERI Communities (Affinity Groups) & Belonging
  • Empowerment to build your career journey at Veritone
  • Flexible (Paid) Time Off
  • Benefits Program: medical, dental, vision, 401K matching, and more!
  • Mental health awareness and support
  • An opportunity to be a part of the next big thing in artificial intelligence!
  • Loves learning & continuous growth; stays current on marketing trends
  • Can juggle multiple projects, priorities, and deadlines with a positive attitude
  • Comfortable in a fast-paced, small company environment
  • Collaborative and always contributing value
  • Driven to win as a team
  • Remote first workplace
  • Check us out!
  • Veritone is a leading provider of artificial intelligence (AI) technology and solutions. The company's proprietary operating system, aiWARE, orchestrates an expanding ecosystem of machine learning models to transform audio, video and other data sources into actionable intelligence. We love to continuously grow while staying ahead of trends and creating structure in an unstructured world.

    If you've made it this far and align with our goals, we look forward to reviewing your qualifications!


    Our company provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability or genetics.

    (Colorado & California Only): Minimum annual salary of $126,000.00. This base pay is for illustrative purposes only and will be determined based on skills and experience comparable to the job requirements. This position may be eligible for additional compensation and benefits including but not limited to: incentive compensation; health benefits; retirement benefits; life insurance; paid time off; parental leave and benefits; and other employee perks and benefits.

    Note: Disclosure as required by sb19-085 (8-5-20) of the minimum salary compensation for this role when being hired in Colorado.




    From this employer

    Recent blogs

    Recent news