• Home
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy
Saturday, May 16, 2026
newsaiworld
  • Home
  • Artificial Intelligence
  • ChatGPT
  • Data Science
  • Machine Learning
  • Crypto Coins
  • Contact Us
No Result
View All Result
  • Home
  • Artificial Intelligence
  • ChatGPT
  • Data Science
  • Machine Learning
  • Crypto Coins
  • Contact Us
No Result
View All Result
Morning News
No Result
View All Result
Home Machine Learning

Detecting Anomalies in Social Media Quantity Time Sequence | by Lorenzo Mezzini | Nov, 2024

Admin by Admin
November 11, 2024
in Machine Learning
0
173b Mpuq8zeaip4lrtgfba.png
0
SHARES
3
VIEWS
Share on FacebookShare on Twitter

READ ALSO

From Knowledge Analyst to Knowledge Engineer: My 12-Month Self-Research Roadmap

Why My Coding Assistant Began Replying in Korean Once I Typed Chinese language


Analyzing a Pattern Twitter Quantity Dataset

Let’s begin by loading and visualizing a pattern Twitter quantity dataset for Apple:

Quantity and log-Quantity noticed for AAPL Twitter volumes
Picture by Creator

From this plot, we will see that there are a number of spikes (anomalies) in our information. These spikes in volumes are those we need to establish.

Wanting on the second plot (log-scale) we will see that the Twitter quantity information reveals a transparent day by day cycle, with larger exercise throughout the day and decrease exercise at evening. This seasonal sample is widespread in social media information, because it displays the day-night exercise of customers. It additionally presents a weekly seasonality, however we’ll ignore it.

Eradicating Seasonal Developments

We need to make it possible for this cycle doesn’t intrude with our conclusions, thus we’ll take away it. To take away this seasonality, we’ll carry out a seasonal decomposition.

First, we’ll calculate the shifting common (MA) of the quantity, which is able to seize the pattern. Then, we’ll compute the ratio of the noticed quantity to the MA, which supplies us the multiplicative seasonal impact.

Multiplicative impact of time on volumes
Picture by Creator

As anticipated, the seasonal pattern follows a day/evening cycle with its peak throughout the day hours and its saddle at nighttime.

To additional proceed with the decomposition we have to calculate the anticipated worth of the quantity given the multiplicative pattern discovered earlier than.

Quantity and log-Quantity noticed and anticipated for AAPL Twitter volumes
Picture by Creator

Analyzing Residuals and Detecting Anomalies

The ultimate element of the decomposition is the error ensuing from the subtraction between the anticipated worth and the true worth. We will contemplate this measure because the de-meaned quantity accounting for seasonality:

Absolute Error and log-Error after seasonal decomposition of AAPL Twitter volumes
Picture by Creator

Curiously, the residual distribution intently follows a Pareto distribution. This property permits us to make use of the Pareto distribution to set a threshold for detecting anomalies, as we will flag any residuals that fall above a sure percentile (e.g., 0.9995) as potential anomalies.

Absolute Error and log-Error quantiles Vs Pareto quantiles
Picture by Creator

Now, I’ve to do an enormous disclaimer: this property I’m speaking about is just not “True” per se. In my expertise in social listening, I’ve noticed that holds true with most social information. Aside from some proper skewness in a dataset with many anomalies.

On this particular case, we now have effectively over 15k observations, therefore we’ll set the p-value at 0.9995. Given this threshold, roughly 5 anomalies for each 10.000 observations shall be detected (assuming an ideal Pareto distribution).

Due to this fact, if we test which commentary in our information has an error whose p-value is larger than 0.9995, we get the next alerts:

Indicators anomalies of AAPL Twitter volumes
Picture by Creator

From this graph, we see that the observations with the very best volumes are highlighted as anomalies. In fact, if we need extra or fewer alerts, we will regulate the chosen p-value, preserving in thoughts that, because it decreases, it can enhance the variety of alerts.

Tags: AnomaliesDetectingLorenzoMediaMezziniNovseriesSocialtimevolume

Related Posts

Data engineer.jpg
Machine Learning

From Knowledge Analyst to Knowledge Engineer: My 12-Month Self-Research Roadmap

May 16, 2026
Valery rabchenyuk 5i ofqb0n6g unsplash scaled 1.jpg
Machine Learning

Why My Coding Assistant Began Replying in Korean Once I Typed Chinese language

May 15, 2026
Chatgpt image may 10 2026 11 10 46 pm.jpg
Machine Learning

What’s the Greatest Approach to Brainwash an LLM?

May 14, 2026
Rag article 3.jpg
Machine Learning

Hybrid Search and Re-Rating in Manufacturing RAG

May 13, 2026
Chatgpt image 5 mai 2026 02 58 40.jpg
Machine Learning

Studying Phrase Vectors for Sentiment Evaluation: A Python Copy

May 12, 2026
Batch vs stream main 1308x480 1 copy.jpg
Machine Learning

Batch or Stream? The Everlasting Information Processing Dilemma

May 10, 2026
Next Post
Fashion And Color Psychology 1024x574 1.jpg

Quantum Computing and Its Implications for Future Knowledge Infrastructure

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

POPULAR NEWS

Gemini 2.0 Fash Vs Gpt 4o.webp.webp

Gemini 2.0 Flash vs GPT 4o: Which is Higher?

January 19, 2025
Chainlink Link And Cardano Ada Dominate The Crypto Coin Development Chart.jpg

Chainlink’s Run to $20 Beneficial properties Steam Amid LINK Taking the Helm because the High Creating DeFi Challenge ⋆ ZyCrypto

May 17, 2025
Image 100 1024x683.png

Easy methods to Use LLMs for Highly effective Computerized Evaluations

August 13, 2025
Blog.png

XMN is accessible for buying and selling!

October 10, 2025
0 3.png

College endowments be a part of crypto rush, boosting meme cash like Meme Index

February 10, 2025

EDITOR'S PICK

0 6f4yz6fmmrhnfgte.jpg

Understanding the Generative AI Consumer | In direction of Knowledge Science

December 20, 2025
The20raid20on20the20fraudulent20crypto20investment20operation20by20the20european20authorities id 2eb631c2 40fa 450d 8a2d b788d0c29a7b size900.jpg

Europe Busts EUR 700 Million Crypto Fraud Community that Used Deep Faux Adverts

December 8, 2025
Kraken id 4d337104 0e27 49e1 a7d5 9c41caa4cec8 size900.jpg

Kraken Relocates Headquarters to Wyoming Following Launch of Prime Platform

June 22, 2025
Gemini generated image stpvlkstpvlkstpv scaled 1.jpg

A Sensible Information to Reminiscence for Autonomous LLM Brokers

April 17, 2026

About Us

Welcome to News AI World, your go-to source for the latest in artificial intelligence news and developments. Our mission is to deliver comprehensive and insightful coverage of the rapidly evolving AI landscape, keeping you informed about breakthroughs, trends, and the transformative impact of AI technologies across industries.

Categories

  • Artificial Intelligence
  • ChatGPT
  • Crypto Coins
  • Data Science
  • Machine Learning

Recent Posts

  • From Knowledge Analyst to Knowledge Engineer: My 12-Month Self-Research Roadmap
  • TurboQuant: Is the Compression and Efficiency Well worth the Hype?
  • How I Regularly Enhance My Claude Code
  • Home
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy

© 2024 Newsaiworld.com. All rights reserved.

No Result
View All Result
  • Home
  • Artificial Intelligence
  • ChatGPT
  • Data Science
  • Machine Learning
  • Crypto Coins
  • Contact Us

© 2024 Newsaiworld.com. All rights reserved.

Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?