Use cases

Sensika’s 360 intelligence monitors 10x the media with 1/10th the human labour

This case study was originally published on the Rosette.com. Rosette® helps us to unlock the value of the text with multilingual text analysis. The platform is a product of Basis Technology.

EXECUTIVE SUMMARY

Sensika, content volume per day

Sensika is a “media seismometer” for its clients, aiming to detect the subtle “tremors” that most media monitoring tools might miss. Whether it’s knowing a company is in trouble two weeks before it files for bankruptcy,1 or surfacing pricing complaints two hours after a product launch,2 clients rely on Sensika to find relevant media mentions in the markets that matter to their business.

Based on past experiences, Sensika’s founders knew that taking a manual approach to media monitoring was a dead end. No number of humans could review the volume of content that clients require to be processed in an accurate, efficient, and timely manner.

“Big companies navigate like submarines. Using sonar, they send out a signal and listen for the feedback,” Christoff said. “Correct data that comes late is useless for them. They need correct, useful, and timely data.”

By taking an AI approach, Sensika can review more media, with higher accuracy, in a time window that is useful to their clients. However, the proposition of developing a full AI stack from scratch creates a high barrier to entry. So they turned to a vendor to help deliver the quality of results that they need, in the languages that matter to their clients. Rosette, with its complete pipeline of text analytics in 30+ languages, offered an ideal foundation for Sensika to build their advanced media monitoring algorithms.

JUST WHAT MARKETING NEEDS: SENSIKA

Content Volume per media channel

Prior to founding Sensika, the founders were involved in a project working with a team of people providing social media and 360 intelligence to two large clients, a petrochemical conglomerate and a pharmaceutical giant. The clients needed any negative signals classified and reported in a timely fashion. As the volume of data exploded with the rise of the Internet, the team—hundreds of people strong—could just barely cover a few thousand source websites.

“To aggregate, normalize, and unify this data, we were always late because of the pre-processing and then because people had to analyze the information,” Christoff said. “We had to be picky about the volume of content we let into processing, so we missed a lot and became more and more irrelevant.”

The financial means were available to try every social listening and media intelligence monitoring platform on the market, and Christoff did.

“Big companies navigate like submarines. Using sonar, they send out a signal and listen for the feedback. Correct data that comes late is useless for them. They need correct, useful, and timely data.”

From that experience, Christoff and his co-founders started Sensika in 2012, which today monitors 900,000+ websites, social media, 2,500+ TV, radio and print in near real-time with a team of only 50. Harvesting data from these multiple sources daily, Sensika provides a wide range of metrics and alerts, including product intelligence, topic analysis, digital channel performance management, early crises alerting, campaign ROI, R&D intelligence, and Voice of the Customer analysis. Marketing and PR departments and agencies rely on the “360 intelligence” view that Sensika delivers to make decisions from what products to offer—vis-à-vis competitors—to critical pricing adjustments just days after a product launch.

THE CHALLENGE

What Christoff learned when he tried all the commercial solutions was that data and its pre-processing were key to getting the actionable signals that clients wanted. How is my product perceived? Is it affordable or expensive? What features have we seemed to nail exactly? Is there a PR crisis brewing?

Unlike other media monitoring providers, Sensika does their own data harvesting and specialized pre-processing of the source data. Part of this critical pre-processing is metadata extraction, including:

  • Location and timestamp of the news.
  • The social media user who posted.
  • Mentions of people, places, and organizations in the text.
  • Topics mentioned (e.g., Davos conference, G7 Summit, the launch of the newest iPhone, etc.).
  • Key phrases.
  • Concepts.
  • 60+ more types of metadata.

To automate the data collection, analysis, and reporting, Sensika sought:

  • Reliable entity extraction (i.e., finding mentions of people, products, places, and organizations).
  • Foundational text analytics (i.e., the ability to tokenize text into words and normalize characters).
  • Broad language support, particularly for processing complex Arabic script languages.

THE SOLUTION

In the Sensika pipeline, the entities extracted are used to filter search results and drill down to find insights. For example, a search on a new iPhone model will be displayed with filters dynamically generated based on the brands and companies appearing in the results. Thus still-unknown competitors and comparisons against the iPhone are revealed. The data is then classified and tagged with entity-level sentiment analysis. Sensika’s proprietary knowledge graph uncovers relationships between entities and is constantly updated in near real-time.

In this way, Sensika is able to uncover sometimes startling revelations. Christoff relates a period when the stock market news was relentlessly negative. But out of thousands of reports from stock market exchange news, Sensika detected one petrochemical firm that was getting positive press—but it was buried as the fourth or fifth section of a multi-topic article hidden in the back pages of search results. Sensika was able to report on this finding in 4-6 hours to the client, who then had human analysts confirm the report. “Stock exchange people want correct, precise and truthful information fast—always,” Christoff said. “This [example] creates huge credibility for our technology.”

When Sensika started looking for entity extraction, they considered open source NLP packages, but while they were good for English, support for other languages wasn’t enough for their needs. “Our clients are global and especially interested in news that’s related to their business or activity. We have commercial and government customers in Europe and the Middle East, so data harvesting shifts our language analytics requirements.” Christoff said. “ The multilingual coverage that Basis Technology provides is perfect for a company like ours who serves clients with global needs.”

“Our clients are global…so data harvesting shifts our language analytics requirements. The multilingual coverage that Basis Technology provides is perfect for a company like ours who serves clients with global needs.”

~ Konstantin Christoff, Sensika CEO & Co-Founder

“We looked at the cost of developing it ourselves, and clearly we lost and bought it from you [Basis Technology],” Christoff laughed. “We prefer to benefit from great algorithm providers like you guys, but stop at some point and decide what is really strategic to develop ourselves.”

Rosette’s foundational linguistic analysis allows for much more complex and precise insight extraction.

“These capabilities contribute substantially to distinguish us from the more lightweight providers who rely on feeds from the same source presented only with different ‘pretty UIs,’” Christoff said. “We are a tech provider rather than a UI provider.”