Aylien launches news analysis API powered by its deep learning tech

48
Source:   —  April 07, 2016, at 8:01 PM

“The idea for the News API is to give access to the news content that's out there enriched and in real-time to developers and data scientists,” says co-founder Parsa Ghaffari.

Aylien launches news analysis API powered by its deep learning tech

Text analysis startup Aylien, which uses deep learning and NLP algorithms to parse text and extract intel from documents for its customers, has launched a new tool specifically focused on analyzing written news content.

“The idea for the News API is to give access to the news content that's out there enriched and in real-time to developers and data scientists,” says co-founder Parsa Ghaffari. “It’s a very data and analytics centric approach to news.”

The Dublin-based startup says it’s utilizing core text analysis tech powering its existing text API product, which launched back in February two thousand fourteen — but this time it’s focusing exclusively on news content and also doing a small more of the analytical heavy lifting for its customers.

“We decided to oversimplify the utilize case a tiny bit by collecting and analyzing the news documents on our end, rather than giving them the tools to do that themselves. So this was born out of that,” says Ghaffari.

He adds that the text analysis API was already being used by news and media companies “to makes sense of news articles at scale” — so the team has presently stepped in with a tailored product to better serve that demand.

Ghaffari says they’re targeting the News API at developers, data scientists and “solution builders” in verticals such as publishing, PR, news aggregation, newsreader apps, hedge funds, media monitoring, and voice of the customer analysis solutions. So there will evidently be some overlap/cannibalization of existing Aylien users.

Its SaaS Text API product has nearly 20.000 subscribers at this point, with Ghaffari flagging up the likes of Sony, The World Economic Forum and Complex Media as “notable customers”.

While Ghaffari mentioned a map to launch a news API all the way back in two thousand-fourteenth, when TechCrunch latest spoke to him, he says the idea then was to construct a bare bones news ticker. Whereas the News API is a fully featured product in its own right — letting users perform granular look for queries — such as, for example, asking for news stories written about Donald Trump that have a negative sentiment and were published by news outlets based in Wisconsin.

The product also serves up automated summaries of retrieved news article; points to related stories; profiles social media performance; charts the vol of stories on a specific topic over time; shows sentiment breakdown; and details article length.

Users can look for for news by byline to track particular journalists’ output — a useful feature for PRs wanting to intelligently target pitches. (Rather than, *hint-hint*, repeat copy-pasting ‘I read your legend about X and thought you’d be interested in writing about Y’… )

I ask Ghaffari whether he used the news API tool to define which TC journalist to send his pitch to — and he confirms he did (as you’d hope). So, in this one example at least, the tech’s targeting relevance was fair. (It did also propose he approach my colleague Fred, who wrote the prior legend on Aylien.)

The tool draws content from a (human) compiled list of “thousands” of news sources — so, as with much AI tech, there’s still a key role for the human brain when it comes to filtering/sanity checking source material, although Ghaffari reckons this too could be automated in time.

“At the moment it’s a manually curated list of sources that we monitor,” he says, adding: “We are looking at ways to automatize source discovery… But for the initial launch we wanted to have really high quality content, we didn’t wish to have any noise in there. There are challenges if you expand the no of sources, you’re going to obtain a lot of duplicate content, a lot of low quality content… We shouldn’t do that unless we're confident we can allow measures for our users to filter that content.”

Aylien’s analysis engine currently supports six languages — including English, Spanish, German and French — which it’s hoping to ramp up to 15 by July.

Discussing how the core tech works, Ghaffari tells TechCrunch: “We utilize Deep Learning and NLP to realize news articles better, by extracting things such as mentions of entities (people, brands, organizations, products, etc), author’s sentiment, high level category and topical structure of each article, and so on, and we utilize this information in aggregate to train predictive models that can predict things such as best targets for a press release or most favorite topics within a niche, which are tremendously valuable to a publisher, journalist or a PR person.”

“Compared to classical Machine Learning, Deep Learning significantly reduces the necessity for manually labelled data, and makes it easier to ‘hop’ from one speech to another, or even from text to image and vice versa, without losing a lot of information,” he adds.

“Our proprietary NLP engine learns how to carry out multilingual speech processing, which then it applies to news content, just by looking at large volumes of text, and that makes it much more scalable, and capable of learning new languages (which is crucial in our today’s fragmented, globalized world) than what's available out there.”

In terms of competing products, he name checks the likes of Cision’s media monitoring suite, plus products from Kantar Media and Moreover Technologies, but argues that Aylien’s text analysis tech is “much more advanced”, given its been honing its algorithmic smarts for more than three years.

“There are similar offerings by Diffbot, by Alchemy API [and IBM Watson], but we've a lot more of a heavy focus on the fact that this is news content, not just any kind of document,” he adds. “Those guys see at news articles as just any other webpage. So that’s one key differentiator.”

Pricing for Aylien’s News API SaaS starts at $49 per month — for which the customer gets 30.000 articles. After that, if they wish to analyze more data, there’s a sliding scale price per article.

The startup, which was founded back in November two thousand twelve, has raised $1.3 million to date, most recently taking in a €580.000 circular from SOSV and Undertaking Ireland just last month.

READ ALSO
Restaurant waitlisting app NoWait rolls out mobile payments

Restaurant waitlisting app NoWait rolls out mobile payments

This helps speed up the time it takes to grab a meal, as you don’t have to waste time waiting for a table to become available. Presently the company is planning to speed up your dining outing at the finish of the meal, too.

70
With money to invest, Common Catalyst promotes four, including Niko Bonatsos

With money to invest, Common Catalyst promotes four, including Niko Bonatsos

The announcements come just weeks after the venture firm, which has offices in Cambridge, Ma.; New York;  and Palo Alto, closed on $845 million in new capital commitments from its investors.

94
Chatty fiction app Hooked gets an upgrade where readers can write stories, too

Chatty fiction app Hooked gets an upgrade where readers can write stories, too

Following up: Letting readers write their own stories. That arrives today with the launch of Hooked 2.0. Any Hooked reader can presently create characters and then write their legend one message at a time.

68
AirMap raises $15M Series A circular to expand its airspace management system for drones

AirMap raises $15M Series A circular to expand its airspace management system for drones

The circular was led by Common Catalyst Partners, with Lux Capital (which led the company’s $2.6 million seed round), Social Capital, TenOneTen Ventures, Bullpen Capital and the Pritzker Grouping also participating.

70