Multi‑language text analytics and web data platforms
eTek Systems LLC designs and builds distributed web crawling, search and NLP analytics systems for companies that rely on large-scale web and news data.
eTek Systems LLC designs and builds distributed web crawling, search and NLP analytics systems for companies that rely on large-scale web and news data.
Keyphrase, topic, sentiment and similarity analysis for large multilingual text collections.
High-volume crawling and scraping of websites and social networks with flexible extraction rules.
Sharded storage, clustered processing and hierarchical transport systems for big data workloads.
Web-based dashboards and reporting tools for analysts working with web and news data.
What we build
eTek Systems LLC grew out of a group of engineers who started in the early 2000s by developing websites, e‑commerce, logistics, CMS and CRM systems for companies in the US, Australia and the EU. Since 2004 the team has focused on long‑term, high‑load data platforms for demanding customers.
Over the years we have designed and implemented custom frameworks for distributed data mining and analysis, including multi‑host crawlers, hierarchical cluster engines and containers of NLP algorithms for batch processing of news and web texts.
Our solutions are used to process daily streams of news articles and social media messages, extract entities and topics, measure sentiment, detect trends and provide analysts with interactive visualizations and reports.
What we help you build
Linguistic analysis of texts: keyphrases, topics, characteristic chains, entities, similarity, clustering and sentiment on top of custom rating scales.
Discuss NLP projectMulti‑host crawlers and scrapers for websites and social networks, processing up to hundreds of thousands of news articles per day with flexible extraction rules.
Discuss crawling needsHierarchical cluster transport systems, sharding, load balancing and parallel processing pipelines for big data storage and computation.
Plan a data platformFull‑text and structured search, similarity evaluation, auto‑classification and ranking algorithms for content‑rich systems.
Improve search qualityWeb applications for analysts with complex reports, charts, timelines, dendrograms and relation graphs tailored to your data.
Design analytics UIArchitecture reviews, prototyping and research for projects involving large‑scale text data, search engines and distributed infrastructures.
Talk to an expertExamples of platforms we have engineered
Large-scale news database and analytics system that continuously crawls news from hundreds of online sources in multiple languages, aggregates tens of thousands of pages daily and generates scheduled PDF digests and dashboards for analysts, crawling news from over 650 online sources in different languages, with a daily extract of over 90,000 news pages.
Personalized news delivery service that builds individual feeds for readers, using custom filtering and recommendation algorithms to balance relevance, diversity of topics and to reduce the long tail and filter bubble effects.
Data mining and monitoring solution that extracts product details from e‑commerce websites, tracks availability and pricing, stores structured data and provides tracking tools and reports for business users. TagsReaper is advanced automated scraping tool we developed using top-notch technologies and out 10 yeas experience in data collecting and processing.
Core engineering group
eTek Systems LLC is built around a compact team of senior software engineers, solution architects and system administrators with backgrounds in applied mathematics, computer science and large‑scale internet systems.
Need Our Help? Contact Us
Share a short description of your data sources, volumes and analysis goals, and we will get back to you with possible approaches.