vector icon indicating copy to clipboard operation
vector copied to clipboard

Who's using Vector in production?

Open binarylogic opened this issue 5 years ago • 45 comments

Using Vector in production? Let us know in the comments!

PSA

We're looking for companies to work closely with to ensure Vector solves production use cases. If interested please email us at [email protected].

binarylogic avatar Aug 23 '19 16:08 binarylogic

At Checkbox (https://checkbox.ai) we use Vector to ship our system and container logs to S3 and Datadog Logs.

zcapper avatar Oct 17 '19 06:10 zcapper

At Kalvad, we use Vector in production for 3 customers already, sending it to OpenDistro.

Very happy with it

Edit: we switched it to some HTTP logs system based on Elixir and warp10

Edit 2 (2023/04/04): We moved to quickwit with Kafka, it is very stable, and amazingly cheap

wowi42 avatar Nov 11 '19 14:11 wowi42

At Comcast we are using vector in production for 4 teams with one team handling close to 8TB of ingest/day using vector. We are currently shipping all our logs to Elasticsearch

raghu999 avatar Aug 19 '20 13:08 raghu999

At NOS we'll be shipping all our logs from home devices to Kafka using Vector. Project's due to get into production next week.

suvl avatar Sep 17 '20 18:09 suvl

At Fundamentei—a site focused on providing Stock market financial information for Brazilian investors—we'll be sending system and container logs to Papertrail/S3.

andreyvital avatar Sep 18 '20 02:09 andreyvital

At Skiley — a platform that provides an improved experience to users of music streaming services —, I started using Vector (replacing Logstash) to forward logs from journald, gathered from multiple services, to Elasticsearch and S3. It has been a joy, and congratulations for the excellent docs!

JoaaoVerona avatar Sep 22 '20 22:09 JoaaoVerona

At BlockFi - BlockFi’s vision is to bridge the worlds of traditional finance and blockchain technology to bring financial empowerment to clients on a global scale. - we use Vector in production to ship logs generated by the host (file, journald, etc.) as well as from within containers. We plan to use it for CloudWatch and CloudTrail logs as well and maybe someday for metrics (we use Telegraf today). We ship to Humio for log/metric aggregation/storage/search/dashboards/alerting/etc. This setup replaced Papertail and Prometheus+Grafana.

gburd avatar Oct 09 '20 13:10 gburd

At Douban - Douban is a Chinese social networking service website that allows registered users to record information and create content related to film, books, music, recent events, and activities in Chinese cities.

We use Vector in production to collect Terabytes of logs(weblog, MySQL logs, etc) per day and forward them to Kafka and ElasticSearch. And we're also using Vector now to send some web server metrics to Statsd. Vector has proved to be robust and efficient in many cases 👍

windmgc avatar Nov 06 '20 03:11 windmgc

Fly.io - App hosting platform running firecracker VMs at the edge.

We use Vector in production to:

  • Transform and send our journald logs to our elasticsearch cluster
  • Capture and transform customer's apps' logs via a unix socket sink (another program sends logs there since Vector doesn't work with named pipes)

We'd love to use it even more! We're looking to replace telegraf and be able to tail from named pipes.

jeromegn avatar Dec 18 '20 18:12 jeromegn

Just here to drop a note that the next version of the Dokku OSS PaaS will include a Vector integration for log shipping.

We chose Vector over other tools for a few reasons:

  • There are a number of integrations available to end users, and as we don't control where Dokku is installed, assuming an installation of a particular solution wasn't going to cut it. We require a flexible solution that continues on with the "batteries included but removable" idea that Dokku was built on, and Vector does this quite handily.
  • Configuring global and app-specific sinks in json is fairly easy, and we managed to distill it to more or less a DSN value. Would be great to have this direct in the core, but the code to support this is easy enough for us to maintain.
  • We needed a tool that integrates directly with Docker. While Dokku supports alternative schedulers such as Kubernetes, most users of alternative schedulers will have other tools to manage logs in their system, and thus we focused on the 80% use case of Docker Local scheduling. In our initial research, this excluded tools such as Filebeat that don't have easy ways to target sinks at specific docker container labels.
  • logspout - the frontrunner from gliderlabs - is fairly unmaintained for a variety of reasons (mostly time). It has some neat features that separate it from vector, but it is better for us to hitch ourselves to well-maintained solutions vs stick our heads in the ground and pretend everything under gliderlabs is the be-all and end-all solution.
  • @binarylogic once sat through an entire dinner with me - and paid for it! - while I berated him about how log shipping was a hard problem and he couldn't do it, so now here I am eating my hat.

Usage docs are here for anyone interesed: http://dokku.viewdocs.io/dokku/deployment/logs/#vector-logging-shipping

josegonzalez avatar Jan 05 '21 04:01 josegonzalez

Sematext now makes use of Vector in Logs Discovery.

otisg avatar Jan 18 '21 21:01 otisg

Our team within Atlassian began using Vector in production a week or so ago. We saw some reductions in CPU/Memory usage compared to our old logging agent (fluentd), which is nice. The main thing we like is being able to perform unit tests on our configuration. Looking forward to a good WASM interface; we'd like to replace some lua with Rust if possible (Update; VRL suits all our needs especially with Vector 0.22!).

As for volumes, I can't give an exact number (Update: around 16 TB per day), but we're processing most of the traffic at the edge of the Atlassian cloud network... So it's a fair bit. Easily billions of events per hour.
If you use an Atlassian product and the response has a Server header with a value globaledge-envoy, it was logged by Vector 🥳

cetanu avatar Feb 02 '21 04:02 cetanu

Clever Cloud is running Vector on each VM for logs and metrics collection.

KannarFr avatar Feb 09 '21 17:02 KannarFr

SIB is using Vector in production for public schools in France. Each school has it's own vector instance for gathering logs then it sends to a central Vector which ships to Elasticsearch and Ceph S3. We were using flowgger before and are really happy with Vector

jothoma1 avatar Jun 21 '21 11:06 jothoma1

@jothoma1 am curious why the switch away from flowgger

tshepang avatar Jun 21 '21 15:06 tshepang

Robinhood is using Vector in many ways!

  • EC2 application logs -> kafka (replaced filebeat)
  • Kubernetes pods logs -> kafka (replaced fluentd)
  • Kafka -> Loki

We've had a great experience with Vector so far as it plays a larger and large role in our observability stack :)

jpdstan avatar Jun 21 '21 17:06 jpdstan

hupu.com is using Vector.

filebeat and syslog -> kafka -> vector -> kafka -> ES/clickhouse

image

kong62 avatar Jun 28 '21 03:06 kong62

@kong62 which interface do you use for view logs from ClickHouse?

Slach avatar Jun 28 '21 03:06 Slach

@kong62 which interface do you use for view logs from ClickHouse?

web develop ourselves, called Pietro and Atel

kong62 avatar Jun 28 '21 08:06 kong62

We (https://github.com/moia-dev/) just rolled it out to our Kubernetes cluster to replace fluentD as log shipper and we're super happy. One of the biggest benefits is the possibility to test the configuration.

guusvw avatar Jul 07 '21 09:07 guusvw

On adidas we're using vector to ingest logs from our CDN creating at the same time metrics from these logs https://medium.com/adidoescode/improving-your-observability-creating-metrics-from-your-logs-9ae8de9299f4

cgimeno avatar Jul 09 '21 11:07 cgimeno

@jpdstan Hello, do you use vector in kafka->loki?

wgb1990 avatar Aug 19 '21 06:08 wgb1990

Geberit is using Vector.

I started seriously testing Vector 2021-03. Before that month, essential features were missing. As I had built a log collection pipeline with Logstash for a decent number of log types, I had some ideas how to design this with Vector. So after a POC, I designed a framework for Vector config. I waited before posting here until I had published the framework. This day is today, finally ;-)

You might find the event-processing-framework useful. From what I can tell, it is the first of a kind for Vector. Note that it is somewhat opinionated. I make heavy use of the Elastic Common Schema (ECS) and YAML instead of TOML.

Vector is awesome, keep up the work!

Cc: @aswath86

ypid-geberit avatar Oct 04 '21 17:10 ypid-geberit

We are at FINAL use Vector A LOT, in several ways:

  • Vector -> Kafka
  • Kafka -> Vector
  • Vector -> Loki
  • Vector -> Main storage

We'll be happy to collaborate with you in order to solve production issues.

haimhm avatar Nov 16 '21 05:11 haimhm

At BedrockStreaming we are using it in production. As we freshly use it, we use it only to send logs to S3 for now. I am happy to help fix production issue

arthurzinck avatar Apr 07 '22 07:04 arthurzinck

At ProtonMail and ProtonVPN we are using Vector to connect Kafka->ClickHouse for one of the anomaly-detection systems.

pm5 avatar Apr 21 '22 03:04 pm5

Railway is using Vector! We use it to send our 20k+ deployment logs to both GCP Storage (for querying) and the filesystem (for streaming). I wrote a post about our general architecture when we first adopted it: https://blog.railway.app/p/building-logs-v2

gschier avatar Apr 25 '22 23:04 gschier

LINE Corp is using vector in production to deliver tons of logs and metrics !

darjeeling avatar May 26 '22 02:05 darjeeling

DataStax has just rolled out Vector as part of our production logging stack for our Astra DBaaS.

jblang avatar Jun 24 '22 15:06 jblang

UWG has been using Vector to ingest network switch logs for our 3 campuses for over a year. Vector works in tandem with Grafana Loki & Mimir as well as a MinIO cluster, all running on Docker Swarm Mode, to monitor 160 network switches and over 20,000 networked devices.

Vector, in particular, has been absolutely invaluable to us as a sort of "glue" for patching together frustratingly non-standard or otherwise proprietary syslog outputs and formatting them for aggregation and long-term indexed storage.

While our students may not see the work Vector is doing for our public institution on the backend, I'm very proud of what it's allowed us to accomplish on a a budget!

mari-arondeus avatar Jul 22 '22 23:07 mari-arondeus