Real-Time Data Analytics for Business

Founded by former Spotify data engineers in 2014, GetInData consists of a team of experienced and passionate Big Data veterans with proven track of records.

Our mission is to help data-oriented organizations to succeed using open-source and cloud technologies such as Flink, Kafka, Spark, Hadoop, Google Cloud Platfrom by providing outsourcing, consulting and training services. Our main speciality is real-time stream processing.

We’ve been already working for tens of companies ranging from fast-growing European startups to global corporations in pharmacy, FMCG, banking and media sectors. We trully focus to help our customers achieve true ROI from their data processing.

We also share our knowledge and experience by writting blog posts, speaking at internatinal conferences and local meetups and contributing back code to open-source projects such as Apache Flink or Apache Beam.

 
$5,000+
 
$50 - $99 / hr
 
10 - 49
 Founded
2014
Show all +
Warsaw, Poland
headquarters
  • Grottgera 15/1
    Warsaw, MZ 00-785
    Poland

Portfolio

Key clients: 

Data-driven companies:

  • Spotify, Truecaller, GoEuro, Freshmail, Play (the largest Polish telco), Synerise and more.
  • Undisclosed customers from telco, pharmacy, FMCG and media sectors.

Reviews

Sort by

Production Environment Design for Software Firm

"They’re quick to respond, easy to communicate with, and able to accurately understand our needs." 

Quality: 
5.0
Schedule: 
5.0
Cost: 
5.0
Willing to refer: 
5.0
The Project
 
$10,000 to $49,999
 
Sept. 2018 - Ongoing
Project summary: 

GetInData designed an AWS production environment to support a Hadoop solution. They’ve also contributed to an HDP platform development and acceptance tests.

The Reviewer
 
1,001-5,000 Employees
 
Stockholm, Sweden
Wilson Yu Cao
Development Team Manager, CSG
 
Verified
The Review
Feedback summary: 

Their consistent communication and responsiveness enable GetInData to drive the project forward. They possess comprehensive knowledge of relevant technologies and have an intuitive understanding of business needs and requirements. Customers can expect a partner that’s open to feedback.

BACKGROUND

Please describe your company and your position there.

I’m a senior architect and development team manager at CSG International, a computer software company.

OPPORTUNITY / CHALLENGE

For what projects/services did your company hire GetInData?

We needed to design a production environment to run the product in AWS.

What were your goals for this project?

We need to build a fraud detection product using Hadoop technologies on a tight deadline, but we didn’t have an experienced Hadoop expert and were struggling to find a strong consultant.

SOLUTION

How did you select this vendor?

A consultant company we worked with and trusted recommended them to us.

Describe the project in detail.

We began by discussing our strategy and explaining our business requirements. These sessions helped us develop both trust and understanding. They moved into the design part, which was split by several checkpoints where we could verify their work and adjust their focus. After, they helped us build the secured HDP (Hortonworks Data Platform) in AWS and complete acceptance tests.

RESULTS & FEEDBACK

Can you share any outcomes from the project that demonstrate progress or success?

They’ve met our expectation and goals and pushed the project forward.

How effective was the workflow between your team and theirs?

Their communication is very good. We had spot checkpoints in the design phase and daily standups during implementation and testing. Everything has gone smoothly.

What did you find most impressive about this company?

Their extensive knowledge regarding everything related to Hadoop operation tasks is outstanding. They’re quick to respond, easy to communicate with, and able to accurately understand our needs.

Are there any areas for improvement?

No, not that I can think of now.

5.0
Overall Score
  • 5.0 Scheduling
    ON TIME / DEADLINES
  • 5.0 Cost
    Value / within estimates
  • 5.0 Quality
    Service & deliverables
  • 5.0 NPS
    Willing to refer

BI & Analytics for e-Commerce Product Search Engine

“GetinData is very professional and highly skilled from a technology perspective.”

Quality: 
4.5
Schedule: 
4.5
Cost: 
4.5
Willing to refer: 
5.0
The Project
 
Confidential
 
Oct. 2017 - Mar. 2018
Project summary: 

Using Apache Kafka, Flink, and Cassandra, GetinData built a proof of concept for an e-commerce business to aggregate millions of products from sites and retail stores. They focused on speed and scalability. 

The Reviewer
 
1-10 Employees
 
Mumbai, India
Anirudha Khopade
Product Manager, E-Commerce Search Engine
 
Verified
The Review
Feedback summary: 

GetinData’s team of subject matter experts leveraged sophisticated technology. They thoughtfully articulated problem statements and found the right solutions. The client looks forward to future engagements. Maintaining a smooth workflow, the team communicated effectively. 
 

BACKGROUND

Introduce your business and what you do there.

I'm a product manager at tanglr.com. Our e-commerce business aggregates products from more than 150 websites and more than 500–1000 offline stores. We process products and display them on the B2C side for shoppers to search from this entire variety of 10 million products. 

OPPORTUNITY / CHALLENGE

What challenge were you trying to address with GetinData?

Our project required expertise with very specific technology. We hired GetinData based on their specialized technical skills. 

SOLUTION

What was the scope of their involvement?

GetinData leveraged their technical expertise to use Apache Kafka and Apache Flink technology for our solution’s pipeline process and microprocessing. We began with a small proof of concept or initial prototype project to assess how adept their knowledge was. Once we felt comfortable working with them, we decided to start a more significant project together. 

All the data is stored in Apache Cassandra. They helped us fine-tune the ecosystem to enhance the speed and scalability. They definitely imparted a substantial amount of knowledge to our in-house team.

What is the team composition? 

We typically worked with 2–3 people from GetinData.

How did you come to work with GetinData?

We found GetinData through a reference. They work with technologies that overlap significantly with the ones we use. We got in touch with them and had initial discussions about how they could use different technology to help us out. 

What is the status of this engagement?

Our project started around October 2017 and lasted approximately 3–6 months.

RESULTS & FEEDBACK

What evidence can you share that demonstrates the impact of the engagement?

I can confidently say that GetinData is a team of subject experts for Apache Kafka and Flint. We found them to be very skilled at using those technologies. They define the right problem statements for our project and found solutions. From a collaboration standpoint, we didn’t have any problems with communication. If we get a chance in the future, we will definitely engage with GetinData again.

How did GetinData perform from a project management standpoint?

We preferred to call and text them, and they cooperated with us from that regard. I didn’t find any issues with the project management. They were always online. GetinData defined the cadence of meetings perfectly and organized the project methodically.

What did you find most impressive about them?

Our project with GetinData was very focused on our core problem statement. We had a very clear bullseye to target. The team worked on a regular basis without needing too much involvement from our side.  

Are there any areas they could improve?

There’s nothing that I can think of. GetinData is very professional and highly skilled from a technology perspective. The services they provided for us were very good. 

4.5
Overall Score
  • 4.5 Scheduling
    ON TIME / DEADLINES
  • 4.5 Cost
    Value / within estimates
  • 4.5 Quality
    Service & deliverables
  • 5.0 NPS
    Willing to refer

Data Cluster for Email Services Company

"Their profound understanding of all aspects of our project amazed me." 

Quality: 
5.0
Schedule: 
5.0
Cost: 
5.0
Willing to refer: 
5.0
The Project
 
$10,000 to $49,999
 
Jan. - July 2018
Project summary: 

GetInData designed and built a complex, scalable data platform that facilitates enhanced analytics functionalities, integrating it with existing data programs and conducting thorough follow-on team training. 

The Reviewer
 
51-200 Employees
 
Kraków, Poland
Wojtek Ptak
CTO, FreshMail
 
Verified
The Review
Feedback summary: 

Completed in half the estimated time and with a fivefold improvement on data collection goals, the robust product has exponentially increased processing capabilities. GetInData’s in-depth engagement, reliability, and broad industry knowledge enabled seamless project execution and implementation.

BACKGROUND

Please describe your company and your position there.

I am the CTO at FreshMail, an email marketing and services company based in Krakow, Poland. Our employees serve thousands of primarily Eastern European business customers of all sizes. 

OPPORTUNITY / CHALLENGE

For what projects/services did your company hire GetInData?

Our core product focuses on providing an easy and friendly email marketing platform through which almost 1.5 billion messages are sent every month, requiring us to deal with a vast amount of data. Because of this, our company decided to invest in a number of large scale data projects. We hired GetInData to help us shape the right data environment and train our internal team. The projects included:

—Implementing a machine learning anti-abuse development environment (e.g. anti-spam, anti-phishing, etc.) that allows us to defend against new types of attacks and better protect our marketing and transactional email clients.

—Developing a new invoicing model to consolidate business rules and simplify usage calculation into a single action.

—Enabling streaming analytics-based marketing features for our customers, including deeper live integration with their platforms.

—Building a long-awaited data lake to support ad-hoc advanced analytics, the skills for which we have heavily invested in through training on analytical tools.

What were your goals for this project?

Our overarching goal was to design, build, and optimize a new data platform, then train our team on it and some additional topics related to the final solution. 

SOLUTION

How did you select this vendor?

With an impressive portfolio of projects implementing both open-source platforms and cloud solutions, GetInData had positioned themselves as one of the most experienced Big Data consulting companies. They also had in-depth, hands-on experience with all of the technologies under our consideration. We connected with them through a series of conversations between several of their key leadership and our technical and business stakeholders. 

Describe the project in detail.

The data cluster needed to meet a number of requirements, including: data ingestion via Apache Kafka, our standard publish-subscribe pattern; data analytics using our established tools, such as Apache Spark; analytics streaming with between 2,000 and 3,000 events per second; and an initial capacity of around 200 terabytes scalable for steady future growth.

We began by defining the appropriate requirements and identifying possible solutions, then determining the best avenue of approach. We then designed the new data platform, built the technical specifications, and acquired the necessary equipment before installing and configuring the final product. We concluded with four days dedicated to intensive internal team training.

What was the team composition?

Our system administration and data engineering teams worked with various members from GetInData, who supervised the overall concept and execution. 

RESULTS & FEEDBACK

Can you share any outcomes from the project that demonstrate progress or success?

The new infrastructure now enables the collection of almost 10,000 different events per second with real-time analytics and large-scale batch data processing. We successfully designed and built the platform within our desired timeline, which would not have been possible without GetInData. Working with them cut our installation and configuration time in half and allowed us to address advanced issues related to data security and governance that we couldn’t solve ourselves. They were also instrumental during negotiations with our hardware vendor, reducing costs and ensuring we received the most appropriate products. 

How effective was the workflow between your team and theirs?

We essentially merged into one team with GetInData, using Slack and Skype for daily communication. This helped us move through the project as smoothly as possible, even during the most intensive phase.

What did you find most impressive about this company?

They offered a tailored approach, extensive experience, and direct, friendly interactions, as well as a professional and dedicated work ethic. Their profound understanding of all aspects of our project amazed me. In addition, not only does GetInData have practical experience with the stack of our choice (Hortonworks’ Apache Hadoop distribution) but they actively participate in the open-source community by sharing their knowledge during conferences and workshops.

Are there any areas for improvement?

There are number of things we could do differently at the moment, but they mostly consist of better preparation on our end. These types of projects are already incredibly sophisticated, but we consolidated nearly all of our sources into one data lake. We could benefit from GetInData's experience by involving them more deeply into our business cases. 

5.0
Overall Score They were a pleasure to work with and I hope to do more with them!
  • 5.0 Scheduling
    ON TIME / DEADLINES
    We could always count on them.
  • 5.0 Cost
    Value / within estimates
    We completed the project within the agreed budget.
  • 5.0 Quality
    Service & deliverables
    All of our goals were achieved flawlessly.
  • 5.0 NPS
    Willing to refer
    I have already referred them!

Big Data Development for Mobile Operator

"GetInData is a relatively small agency with experienced professionals that enjoy and perform their job exceptionally well."

Quality: 
5.0
Schedule: 
5.0
Cost: 
5.0
Willing to refer: 
5.0
The Project
 
Confidential
 
Sept. 2017 - Ongoing
Project summary: 

GetInData assisted with the development of a customized platform using Big Data technologies. The platform has been implemented.

The Reviewer
 
1,001-5,000 Employees
 
Almaty, Kazakhstan
Alexey Brodovshuk
Software Development Supervisor, Kcell
 
Verified
The Review
Feedback summary: 

The platform has dramatically benefitted business and increased efficiency for subscribers. GetInData’s collaborative approach was seamless. Their attention to detail and expert code quality are noteworthy.

BACKGROUND

Please describe your company and your position there.

I’m the software development supervisor for Kcell, a large mobile operator in Kazakhstan.

OPPORTUNITY / CHALLENGE

For what projects/services did your company hire GetInData?

We were looking for a team with experience in Big Data technologies primarily in the Apache suite.

What were your goals for this project?

My business needed a custom platform that could store and process billions of events generated by subscribers.

SOLUTION

How did you select this vendor?

The creators of Apache Flink recommended GetInData.

Describe the project and the services they provided in detail.

Our teams built the solution from scratch. It’s customized to run business based on event patterns.

What was the team composition?

We worked with developers from both of our teams.

RESULTS & FEEDBACK

Can you share any information that demonstrates the impact that this project has had on your business?

During project implementation, our business gained a substantial amount of knowledge regarding the Big Data tech world. The tool works fantastically for millions of subscribers.

How was project management arranged and how effective was it?

Our teams worked collaboratively but remotely. We used Jira, Skype, Confluence, Mattermost, and GitLab. We documented as much as we could, meeting daily and providing status updates. There were a few site visits.

What did you find most impressive about this company?

GetInData is a relatively small agency with experienced professionals that enjoy and perform their job exceptionally well. Their attentiveness and code quality are impressive.

Are there any areas for improvement?

There are none that I can think of.

5.0
Overall Score
  • 5.0 Scheduling
    ON TIME / DEADLINES
  • 5.0 Cost
    Value / within estimates
  • 5.0 Quality
    Service & deliverables
  • 5.0 NPS
    Willing to refer

Feature Dev for Data Stream Processing Platform

“Their involvement allowed us to add a feature to our product.”

Quality: 
5.0
Schedule: 
5.0
Cost: 
4.5
Willing to refer: 
5.0
The Project
 
$10,000 to $49,999
 
Jan. - Mar. 2018
Project summary: 

GetInData designed and built a custom feature that allows the attachment of artifacts in a data stream processing app.

The Reviewer
 
11-50 Employees
 
Berlin, Germany
Stephan Ewen
CTO, Data Artisans
 
Verified
The Review
Feedback summary: 

GetInData delivered a robust mechanism that satisfied requirements.

BACKGROUND

Please describe your company and your position there.

I am a co-founder and the CTO of data Artisans. We provide cutting-edge data stream processing technology in real time.

OPPORTUNITY / CHALLENGE

For what projects/services did your company hire GetInData?

We needed to develop a feature for future versions of our stream processing platform.

What were your goals for this project?

Our goal was to get a robust, easy-to-maintain feature developed promptly.

SOLUTION

How did you select this vendor?

We knew GetInData from former collaborations with users of Apache Flink and stream processing technology.

Can you go into detail about the services they provided and the scope of the project?

The team designed and developed the mechanism with which artifacts such as external third-party libraries can be attached to an Apache Flink stream processing application. The artifacts then get reliably distributed as part of the dynamic scheduling of the applications work tasks—and recovered across failures—without any additional dependency to Apache Flink. The developed feature was eventually contributed back to another open-source project.

What was the team composition?

We worked with two developers, partially on site and partially remote.

RESULTS & FEEDBACK

Can you share any information that demonstrates the impact that this project has had on your business?

Their involvement allowed us to add a feature to our product, despite not having the required developer capacity in-house.

5.0
Overall Score
  • 5.0 Scheduling
    ON TIME / DEADLINES
  • 4.5 Cost
    Value / within estimates
  • 5.0 Quality
    Service & deliverables
  • 5.0 NPS
    Willing to refer