May 19, 2020

Cleansing the complex

Oleg Rogynskyy
Oleg Rogynskyy
4 min
Cleansing the complex

Cleansing CRM data doesn’t have to be an overwhelming task - it can be an easy, manageable and efficient process, as Oleg Rogynskyy, CEO of explains.

The origin of Customer Relationship Management (CRM) can be traced back to the 1990s, when companies such as Siebel helped gradually drive the evolution of contact management software towards CRM systems. Previously, CRMs were built on hierarchical databases, but these have since been wiped out by SQL (Structured Query Language) CRMs. Since then, the likes of SalesForce have moved SQL CRM into the cloud, but the problems that inhibited the platforms 20 years ago, such as inaccurate, incomplete and untrustworthy data, still exist today. 

This is a problem that limits the true potential of CRM software. The technology was built for static data while today’s business data is, in fact, very dynamic. Information is constantly developing and so can quickly become outdated. The current use of CRM is like using flipbooks to try to watch a movie: the method has become obsolete and overtaken by newer, more efficient forms of technology. 

The main issue is that modern CRM platforms, despite their sophistication, focus primarily on processing and consuming data instead of collecting and keeping it accurate. According to Ben Horowitz, we have witnessed the demise of systems of record from the rise of AI. CRMs were built in the point-in-time sales world, meaning that they were built in the days of one-time sales, where activity data and the dynamic nature of contacts didn’t matter. Since then the world has transitioned into a continuous sales world, leading companies like Zuora and Gainsight to try to fix the point-in-time nature of CRM and successfully address data inaccuracy and duplication. 

Specialised tools

A ‘CRM Scan’ can quickly identify data quality metrics and incorporate them into an overall metric called the ‘CRM Health Score’, revealing where efforts need to be focused. This assessment sheds light on CRM fitness and, when combined with a strong understanding of how sales and marketing teams are using the activity data, elevates confidence in prioritising efforts to improve the CRM system. 

Within this process, it is paramount to focus on three primary dimensions of CRM data quality to establish the baseline: 

  1. Is the activity data complete?

  2. Is there a single representation of the activity data?

  3. Does the activity data correctly represent the real world?

Although it is possible to create the metrics internally, this would take several weeks. Not only does this discourage teams who are investing significant time in this work, but it also paralyses them as they often don’t know where to start or whether their efforts are making a difference. 

Important first steps

Identifying data duplication is another hurdle that can undermine productivity. Duplication is typically due to a lack of standard and unique identifiers for companies and the people that work for them. Despite the use of common proxies, including web domain and email addresses, these are often not unique, as the names of companies and people can change or have variations. To tackle duplicates, businesses need to: 

  1. Define duplicates 

The first step is to define what is considered a duplicate. For instance, in contacts and leads this can be email address matches, identical name matches and account associations. 

  1. Set up preventative dedupe rules in the CRM

Businesses should then use features established by Salesforce to block and prevent the creation of duplicate records. 

  1. Identify and clean existing data duplicates

The ‘CRM Scan’ can be used to identify duplicates and clean them up. This requires some planning based on the CRM system in use. There are specialised tools that make this process easier, but in some cases it can be a good step to reinforce the process by taking it offline to use spreadsheet analysis. 

  1. Implement ongoing monitoring for new duplicates

Once data duplicates have been identified and cleaned, it is important to set up preventative de-duplication rules in the CRM platform to monitor and repair duplicates. 

Quick, visible results 

Specialised scan tools, custom reports and dashboards are used to identify, clean and enrich data. This focuses on finding invalid data, such as digits or special characters in contact names, email addresses, web domains and incomplete mailing addresses. This can be done by combining spreadsheets and simple scripts to build update files for a CRM loader, as well as using a database built for this purpose. 

The timescale of this process varies depending on data quantities, the number of duplicates and the amount of data that needs cleansing. With the right tools, reliable measurement and on-going commitment, results can be visible almost immediately. 

In order to achieve this, organisations need to set targets that are tied to business priorities. This will enable businesses to communicate results, rebuild trust in the data and celebrate milestones to keep the momentum going. Benefiting from CRM data doesn’t have to be overwhelming, impossible or disheartening. It can be relatively easy, straightforward and more than satisfying. 

Share article

Jun 18, 2021

GfK and VMware: Innovating together on hybrid cloud

3 min
VMware has been walking GfK along its path through digital transformation to the cloud for over a decade.

GfK has been the global leader in data and analytics for more than 85 years, supplying its clients with optimised decision inputs.  

In its capacity as a strategic and technical partner, VMware has been walking GfK along its digital transformation path for over a decade. 

“We are a demanding and singularly dynamic customer, which is why a close partnership with VMware is integral to the success of everyone involved,” said Joerg Hesselink, Global Head of Infrastructure, GfK IT Services.

Four years ago, the Nuremberg-based researcher expanded its on-premises infrastructure by introducing VMware vRealize Automation. In doing so, it laid a solid foundation, resulting in a self-service hybrid-cloud environment.

By expanding on the basis of VMware Cloud on AWS and VMware Cloud Foundation with vRealize Cloud Management, GfK has given itself a secure infrastructure and reliable operations by efficiently operating processes, policies, people and tools in both private and public cloud environments.

One important step for GfK involved migrating from multiple cloud providers to just a single one. The team chose VMware.

“VMware is the market leader for on-premises virtualisation and hybrid-cloud solutions, so it was only logical to tackle the next project for the future together,” says Hesselink.

Migration to the VMware-based environment was integrated into existing hardware simply and smoothly in April 2020. Going forward, GfK’s new hybrid cloud model will establish a harmonised core system complete with VMware Cloud on AWS, VMware Cloud Foundation with vRealize Cloud Management and a volume rising from an initial 500 VMs to a total of 4,000 VMs. 

“We are modernising, protecting and scaling our applications with the world’s leading hybrid cloud solution: VMware Cloud on AWS, following VMware on Google Cloud Platform,” adds Hesselink.

The hybrid cloud-based infrastructure also empowers GfK to respond to new and future projects with astonishing agility: Resources can now be shifted quickly and easily from the private to the public cloud – without modifying the nature of interaction with the environment. 

The gfknewron project is a good example – the company’s latest AI-powered product is based exclusively on public cloud technology. The consistency guaranteed by VMware Cloud on AWS eases the burden on both regular staff and the IT team. Better still, since the teams are already familiar with the VMware environment, the learning curve for upskilling is short.

One very important factor for the GfK was that VMware Cloud on AWS constituted an investment in future-proof technology that will stay relevant.

“The new cloud-based infrastructure comprising VMware Cloud on AWS and VMware Cloud Foundation forges a successful link between on-premises and cloud-based solutions,” says Hesselink. “That in turn enables GfK to efficiently develop its own modern applications and solutions.

“In market research, everything is data-driven. So, we need the best technological basis to efficiently process large volumes of data and consistently distill them into logical insights that genuinely benefit the client. 

“We transform data and information into actionable knowledge that serves as a sustainable driver of business growth. VMware Cloud on AWS is an investment in a platform that helps us be well prepared for whatever the future may hold.”

Share article