Shoreline.io (Acquired by NVIDIA)

Software Development

Redwood City, CA 3,244 followers

Real-time automation and control for cloud operations.

View all 30 employees

About us

Shoreline provides real-time automation and control for cloud operations. Operations teams are under pressure to deliver higher and higher standards of availability, but this is impossible to achieve by fixing incidents manually. Achieving high availability requires automated remediation, but today automating fixes can take months. There’s got to be a better way. Shoreline makes it easy for operators to create automated remediations for well-known issues. You define the behavior for a single host, and Shoreline scales it out across your fleet, dealing with network faults, propagating configs, handling failures, and auditing execution. Anything you can type at the Linux command prompt, including calls to your cloud provider, Kubernetes CLI, or your own shell scripts, you can orchestrate with Shoreline. Think Splunk without lag and with the ability to take action on your system. Shoreline’s approach is built on the experience of operators who built automations to ensure the reliability of millions of instances at major cloud providers.

Website: https://shoreline.io
External link for Shoreline.io (Acquired by NVIDIA)
Industry: Software Development
Company size: 51-200 employees
Headquarters: Redwood City, CA
Type: Privately Held
Founded: 2019
Specialties: SRE, sitereliability, and devops

Products

Shoreline.io (Acquired by NVIDIA)

Incident Management Software

Shoreline is the Cloud Reliability platform — the only platform that lets DevOps engineers build automations in an afternoon, and fix issues forever. Shoreline reduces on-call complexity by running across clouds, Kubernetes clusters, and VMs allowing operators to manage their entire fleet as if it were a single box. Debugging and repairing issues is easy with advanced tooling for your best SREs, automated runbooks for the broader team, and a platform that makes building automations 30X faster. Shoreline does the heavy lifting, setting up monitors and building repair scripts, so that customers only need to configure them for their environment. Shoreline’s modern “Operations at the Edge” architecture runs efficient agents in the background of all monitored hosts. Agents run as a DaemonSet on Kubernetes or an installed package on VMs (apt, yum). The Shoreline backend is hosted by Shoreline in AWS, or deployed in your AWS virtual private cloud.

Locations

Primary

Redwood City, CA 94063, US

Get directions

Employees at Shoreline.io (Acquired by NVIDIA)

See all employees

Updates

Shoreline.io (Acquired by NVIDIA)

3,244 followers
3mo Edited
Report this post
"Shoreline.io is helping us proactively eliminate issues before they lead to downtime. Self-healing repairs are also freeing my team so they can work on new features instead of fixing the same things over and over." - Knowde

Like Comment Share
Shoreline.io (Acquired by NVIDIA) reposted this

AI & Big Data Expo World Series

6,485 followers
3mo
Report this post
We are pleased to confirm that Shoreline.io will be Bronze sponsors at the AI and Big Data Expo. Shoreline.io is a cloud reliability company focused on real-time automation and control for cloud operations teams. Shoreline enables real-time debugging and automated remediation across your fleet, significantly improving availability by reducing mean time to resolution. Shoreline.io’s CEO & Founder Anurag Gupta will present ‘DevOps is Broken’ at 3.20pm on Day 2 of the Expo. In this talk, he will describe how generative AI can effectively help teams that are struggling under the weight of over-reliance on operations consoles, manual interventions that take too long and phishing attack vulnerabilities. Register to join the expo now: https://lnkd.in/ducmhs6h
Like Comment Share
Shoreline.io (Acquired by NVIDIA)

3,244 followers
3mo
Report this post
Dive into the complexities of on-call operations. We've gathered expert insights to shed light on the challenges and costs that teams face, and offer transformative strategies for improvement. How do you currently navigate the challenges of on-call operations in your organization?

The Cost, The Challenges, and The Conquering of On-Call Operations

Shoreline.io (Acquired by NVIDIA) on LinkedIn

Like Comment Share
Shoreline.io (Acquired by NVIDIA)

3,244 followers
4mo
Report this post
The state-of-the-art way to manage access to your production databases nowadays is to give people real-time credentials. But that doesn't go far enough for database security. With real-time credentials, an operator can only log in with the credential when they need to make a change to a database, either to query it, change it, figure out what's going wrong, etc. But it makes no sense for database security. Let me tell you why. If you're giving somebody SYSDBA (if you're on Oracle or equivalent), they can do whatever they want. They can make mistakes, or perhaps they are a bad actor. What you need to do instead is: Control the capabilities that you allow people to do when they access that production database. There are only 10-15 things they need to do: - Check to see what queries are running long. - Check to see whether there's a deadlock through locking. - Check to see whether the query plan has changed for a long-running query, etc. These are not the biggest things in the world. So figure them out and give people access to just those things. Have you ever seen someone in your production ops team accidentally type “<” instead of “>” when doing the DELETE statement? I have. It doesn't make sense to allow these things to be done. You wouldn't let people do that for your applications; you have parameterized queries rather than freeform queries. Similarly, you shouldn't allow it for ad hoc access from your operators. It's the same problem. And real-time credentials do not fix that problem. Real-time credentials just mean that people have some other tool that gets in the way when they have to access production. You can do a lot better. That's my view anyway. Let me know in the comments if you agree or disagree! #cloud #cybersecurity #devops

Like Comment Share
Shoreline.io (Acquired by NVIDIA)

3,244 followers
4mo
Report this post
Our most recent blog takes a look at how to manage zombie processes in containers. Learn how poor container management can lead to orphan processes, draining resources and risking operational integrity.

Managing Zombie Processes in Containers | Shoreline.io

shoreline.io

Like Comment Share
Shoreline.io (Acquired by NVIDIA)

3,244 followers
4mo
Report this post
The biggest security vulnerability for an enterprise is its people. It doesn't matter how good an alarm system might be at my house or how many locks I have on the door if my wife loses her keys at the coffee shop. The challenge is that the Blackhat security guys innovate faster than anyone else. Gone are the days when phishing attacks were easily spotted due to poor grammar or broad targeting. Now, these attacks are hyper-personalized to the individual, using perfect grammar and relevant details that can easily fool even the most vigilant among us. Imagine I am at AWS Reinvent. And an attacker sends a text to one of my employees saying, "Hey, this is Anurag. I need you to buy 25 Starbucks cards because I'm sitting at AWS Reinvent and ran out of gifts. Please just get them and send me a link at this address." Natural language processing and GenAI can be used to mimic my language style and send this request to the right person in the company, making it look far more legitimate. It’s difficult to address this because it’s not being fronted by AI – a human is reading the messages. That's one of the security risks enterprises may face from AI. What are you doing to prevent such attacks at your company? #cybersecurity #phishingattacks #ai

Like Comment Share
Shoreline.io (Acquired by NVIDIA)

3,244 followers
4mo
Report this post
Let's talk about service graph debugging. Many people have moved to Kubernetes and microservices, which makes sense. It allows them to innovate faster, rapidly deploying smaller chunks of independent functionality with stable interfaces to other components. However, it makes production operations stupidly harder. You get spikes in latency at the top of your service. Examples: At one of my customers, a large gaming company, they've got 50 services underneath. This means 50-100 people on a call could be pointing fingers at each other, with valid reasons to state: “It’s not me. My thing is fine.” A half-hour in wasted time. It's even harder for Razorpay, a large payment provider we work with. The time taken to diagnose leads to a backlog of payments that are not getting processed or accepted. This is a situation that has to be manually rectified for every minute of outage. So what do you do? This is where service graph debugging comes in. A lot of observability tools have something in this area. They look at elements of the service graph and figure out what's going on. But the challenge is that it isn't deep. They look at metrics or logs but don't diagnose deeply to figure out where or why something's wrong. This is where Shoreline comes into play. You can build a runbook that checks each component for issues and integrate it into systems like PagerDuty or Opsgenie before paging anyone. Then, you only page the people involved with the specific microservice having the issue. Generally speaking, 90% of the time in an incident is spent on diagnostics, not repair. The repairs tend to be straightforward. Razorpay, for instance, went from hour-long calls with 50 people to maybe a five-minute call for diagnosis. Of course, there's no magic to it. If one of their banks is down, that still needs to be addressed. But at least they're not spending all the time just finding out what’s wrong. That’s why you need to automate the deep diagnostics that your developers or SREs would have done. Because they're doing that work again, and again, and again. And it's completely safe to have a machine do it. #SRE #devops #cloud

Like Comment Share
Shoreline.io (Acquired by NVIDIA) reposted this

Anurag Gupta

Founder/CEO, Shoreline.io - Solving Resiliency Challenges in Production Operations
4mo
Report this post
Let's talk about service graph debugging. Many people have moved to Kubernetes and microservices, which makes sense. It allows them to innovate faster, rapidly deploying smaller chunks of independent functionality with stable interfaces to other components. However, it makes production operations stupidly harder. You get spikes in latency at the top of your service. Examples: At one of my customers, a large gaming company, they've got 50 services underneath. This means 50-100 people on a call could be pointing fingers at each other, with valid reasons to state: “It’s not me. My thing is fine.” A half-hour in wasted time. It's even harder for Razorpay, a large payment provider we work with. The time taken to diagnose leads to a backlog of payments that are not getting processed or accepted. This is a situation that has to be manually rectified for every minute of outage. So what do you do? This is where service graph debugging comes in. A lot of observability tools have something in this area. They look at elements of the service graph and figure out what's going on. But the challenge is that it isn't deep. They look at metrics or logs but don't diagnose deeply to figure out where or why something's wrong. This is where @Shoreline comes into play. You can build a runbook that checks each component for issues and integrate it into systems like PagerDuty or Opsgenie before paging anyone. Then, you only page the people involved with the specific microservice having the issue. Generally speaking, 90% of the time in an incident is spent on diagnostics, not repair. The repairs tend to be straightforward. Razorpay, for instance, went from hour-long calls with 50 people to maybe a five-minute call for diagnosis. Of course, there's no magic to it. If one of their banks is down, that still needs to be addressed. But at least they're not spending all the time just finding out what’s wrong. That’s why you need to automate the deep diagnostics that your developers or SREs would have done. Because they're doing that work again, and again, and again. And it's completely safe to have a machine do it. #SRE #devops #cloud

1 Comment

Like Comment Share
Shoreline.io (Acquired by NVIDIA) reposted this

Mark Porter

Tech geek/Exec/Husband/Father. Passionate about great culture, deep tech, fast experiments, safe deployments. BoD member, Advisor. Eager to help women succeed in Tech. A soulmate and 5 wonderful kids. marklovestech.com
5mo
Report this post
Anurag Gupta (and his company Shoreline.io ) are likely some of the industry’s foremost experts on keeping your fleet up and running - have a good listen!

Anurag Gupta

Founder/CEO, Shoreline.io - Solving Resiliency Challenges in Production Operations
5mo

I shared some valuable insights on leadership, reliability, and team dynamics in this episode of the Unlearn Podcast. Thank you to Barry O’Reilly for having me on!

Building Reliable and Resilient Systems with Anurag Gupta

https://www.youtube.com/

1 Comment

Like Comment Share
Shoreline.io (Acquired by NVIDIA)

3,244 followers
4mo
Report this post
Observability tools rank as the 3rd-highest cost for engineering teams after their people and cloud infrastructure. That’s insane! What's even more insane? You hardly ever use the collected data. Learn more on our blog:

Anurag Gupta on Data Collection Expense | Shoreline.io

Like Comment Share

Browse jobs

Funding

Shoreline.io (Acquired by NVIDIA) 2 total rounds

Last Round

Series B Apr 28, 2022

US$ 35.0M

Investors

Insight Partners

See more info on crunchbase

Shoreline.io (Acquired by NVIDIA)

Software Development

Redwood City, CA 3,244 followers

Real-time automation and control for cloud operations.

About us

Products

Shoreline.io (Acquired by NVIDIA)

Incident Management Software

Locations

Employees at Shoreline.io (Acquired by NVIDIA)

Anurag Gupta

Founder/CEO, Shoreline.io - Solving Resiliency Challenges in Production Operations

Kazi Zaman

Chief Development Officer at Shoreline.io

Vinod Balakrishnan

Vab Goel

Venture Capitalist & Entrepreneur

Updates

Building Reliable and Resilient Systems with Anurag Gupta

https://www.youtube.com/

Join now to see what you are missing

Similar pages

Shoreline AI

NVIDIA

Dawn Capital

Insight Partners

Omi

GMetriXR

Kiavi

Checkstep

YScope

CBTW Product Design & Growth

Browse jobs

Engineer jobs

Senior Software Specialist jobs

Intern jobs

Geographic Information System Specialist jobs

Dotnet Developer jobs

Data Engineer jobs

Site Reliability Engineer jobs

Full Stack Engineer jobs

Quality Assurance Intern jobs

Senior Director of Marketing jobs

Software Engineer jobs

Marketing Director jobs

Developer jobs

Junior Architect jobs

Vice President Marketing jobs

PHP Developer jobs

Java Software Engineer jobs

Designer jobs

Vice President jobs

Vice President Communications jobs

Funding