• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar
  • Home
  • Contact Us

iHash

News and How to's

  • The 2024 Complete Presentation & Public Speaking Bundle for $24

    The 2024 Complete Presentation & Public Speaking Bundle for $24
  • Apple iPhone XS Max (A1921) 64GB – Gold (Grade A+ Refurbished: Wi-Fi + Unlocked) for $349

    Apple iPhone XS Max (A1921) 64GB – Gold (Grade A+ Refurbished: Wi-Fi + Unlocked)  for $349
  • Apple iPhone XR (A1984) 256GB – White (Grade A+ Refurbished: Wi-Fi + Unlocked) for $329

    Apple iPhone XR (A1984) 256GB  – White (Grade A+ Refurbished: Wi-Fi + Unlocked) for $329
  • The 2024 Google Sheets Formulas & Automation Bundle for $39

    The 2024 Google Sheets Formulas & Automation Bundle for $39
  • MEAZOR 3D Laser Measurer for $299

    MEAZOR 3D Laser Measurer  for $299
  • News
    • Rumor
    • Design
    • Concept
    • WWDC
    • Security
    • BigData
  • Apps
    • Free Apps
    • OS X
    • iOS
    • iTunes
      • Music
      • Movie
      • Books
  • How to
    • OS X
      • OS X Mavericks
      • OS X Yosemite
      • Where Download OS X 10.9 Mavericks
    • iOS
      • iOS 7
      • iOS 8
      • iPhone Firmware
      • iPad Firmware
      • iPod touch
      • AppleTV Firmware
      • Where Download iOS 7 Beta
      • Jailbreak News
      • iOS 8 Beta/GM Download Links (mega links) and How to Upgrade
      • iPhone Recovery Mode
      • iPhone DFU Mode
      • How to Upgrade iOS 6 to iOS 7
      • How To Downgrade From iOS 7 Beta to iOS 6
    • Other
      • Disable Apple Remote Control
      • Pair Apple Remote Control
      • Unpair Apple Remote Control
  • Special Offers
  • Contact us

Synthetic Data: The Cure to Data Drift?

Jun 17, 2023 by iHash Leave a Comment

Recent advancements in AI and computer vision capabilities have massively increased the scale and demand for training data. While real world data continues to dominate AI training, it is often becoming out of date in as short as six months. This is an area of concern as constantly evolving trends and the need for businesses to stay agile, leave little to no room for error in decision making.

It’s critical that organisations have available reliable, accurate training data more than ever before. Yet we recently found that almost two-thirds of organisations suffer from data drift in their training data.

Data drift is a discrepancy between the actual data processed by the deployed system and the training data used to train, validate and test the AI model that processes that real world input. This can arise as a result of various factors, including seasonal variations, climate change and even changes in fashion. Regularly monitoring the performance of a computer vision model is essential to successful deployment. If data drift is not identified in time, it can have serious implications on model performance leading to incorrect business decisions being made.

This phenomenon can be manageable if dealt with appropriately, usually requiring retraining of the model on new data but the effort needed will vary depending on the extent of the issue. This can be disruptive, causing ongoing problems for organisations and be a costly problem to solve. Therefore, detecting data drift should be a key part of the machine learning lifecycle. Ideally this should be an automated process supported by careful action. 

What actions can be taken?

Methods of dealing with data drift are often not mutually exclusive, meaning multiple strategies can and may need to be employed. An effective solution to minimising potential data drift has emerged in the form of synthetic training data. It is artificially generated from computer systems and provides the opportunity to produce greater volumes of accurate training data quickly and more cost-effectively than acquiring real world training data. But, beyond this, it can enhance the robustness of AI models by delivering training data for edge-cases that may be difficult or dangerous to repeat in the real world.

Systems that create synthetic training data allow users to generate training data on demand as opposed to waiting for real-world occurrences, enabling greater control over the training process and providing an opportunity to act before data becomes obsolete. 85% of organisations are already making use of synthetic data to train computer vision systems and of those who don’t, almost a third (29%) anticipate their organization will start using it in 2023.

How can synthetic data ensure data drift is a thing of the past?

Synthetic data offers a plethora of advantages. It’s fast to create, easy to update and cost effective when compared to acquiring real world training data. In particular annotation of real-world training data is labour intensive, time consuming, expensive and less accurate than annotation of synthetic data which is an automated and pixel accurate process. Synthetic training data can also be intelligently created in greater volumes, which is particularly beneficial in building more robust AI models. By filling in gaps and supplementing real-world data, the use of synthetic training data can alleviate the fundamental issues leading to data drift.

Another key advantage of synthetic data is the opportunity to optimise training efficiency. Large volumes of synthetic data can be generated much more rapidly than the alternative of collating real-world data. Users are therefore able to quickly gather training data for cases where new data is needed immediately.

For example, at the height of the pandemic, the mandate of face masks and social distancing meant that some AI systems were outdated, and needed to be retrained to recognise someone wearing a face covering. Another example is the deployment of electric scooters, which has also harnessed machine vision for harm detection and aids in preventing accidents. In addition to updating datasets to prevent data drift, data that is no longer relevant should be removed too. This can be done efficiently with the help of synthetic data training.

Training datasets containing private data present a risk of violating privacy regulations when used to train models. Synthetic data avoids this risk as it does not contain information traceable to individuals. Ensuring privacy compliance is essential to protecting individuals and businesses from legal and financial consequences, as well as aiding in building trust in AI. 

Overall, synthetic data provides robust and versatile datasets for AI training purposes. It does not rely on manual efforts and so, is quicker, comprehensive and more cost-effective to gather. With technological advancement and innovation, synthetic data is becoming richer, more diverse, and closely aligned to real world data. It can help to maintain user privacy and keep enterprises compliant, all of which furthers its ability to overcome the potential of data drift.

About the Author

Steve Harris, CEO of Mindtech, has over 30 years of experience in the technology market sector and holds a masters in Microprocessor Engineering from Manchester University. He has previously been instrumental in creating several European start-up organisations, with a proven track record of success in building strategic relationships and strong revenue streams with tier one companies worldwide. Prior to his current role, he has worked in a number of senior sales and business development positions at leading technology companies, such as: Imagination Technologies, Gemstar, Liberate, and Sun Microsystems, allowing him to bring a wealth of insight and expertise to Mindtech.

Sign up for the free insideBIGDATA newsletter.

Join us on Twitter: https://twitter.com/InsideBigData1

Join us on LinkedIn: https://www.linkedin.com/company/insidebigdata/

Join us on Facebook: https://www.facebook.com/insideBIGDATANOW

Source link

Share this:

  • Facebook
  • Twitter
  • Pinterest
  • LinkedIn

Filed Under: BigData

Special Offers

  • The 2024 Complete Presentation & Public Speaking Bundle for $24

    The 2024 Complete Presentation & Public Speaking Bundle for $24
  • Apple iPhone XS Max (A1921) 64GB – Gold (Grade A+ Refurbished: Wi-Fi + Unlocked) for $349

    Apple iPhone XS Max (A1921) 64GB – Gold (Grade A+ Refurbished: Wi-Fi + Unlocked)  for $349
  • Apple iPhone XR (A1984) 256GB – White (Grade A+ Refurbished: Wi-Fi + Unlocked) for $329

    Apple iPhone XR (A1984) 256GB  – White (Grade A+ Refurbished: Wi-Fi + Unlocked) for $329
  • The 2024 Google Sheets Formulas & Automation Bundle for $39

    The 2024 Google Sheets Formulas & Automation Bundle for $39
  • MEAZOR 3D Laser Measurer for $299

    MEAZOR 3D Laser Measurer  for $299

Reader Interactions

Leave a ReplyCancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Primary Sidebar

  • Facebook
  • GitHub
  • Instagram
  • Pinterest
  • Twitter
  • YouTube

More to See

Apple introduces the advanced new Apple Watch Series 9

Sep 24, 2023 By iHash

New Apple Zero-Days Exploited to Target Egyptian ex-MP with Predator Spyware

Sep 23, 2023 By iHash

Tags

* Apple attacks Cisco computer security cyber attacks cyber crime cyber news cybersecurity Cyber Security cyber security news cyber security news today cyber security updates cyber threats cyber updates data data breach data breaches google hacker hacker news Hackers hacking hacking news how to hack incident response information security iOS 7 iOS 8 iPhone Malware microsoft network security ransomware ransomware malware risk management security security breaches security vulnerabilities software vulnerability the hacker news Threat update video web applications

Latest

Secure your Elastic Cloud deployment with AWS PrivateLink traffic filter

Secure your Elastic Cloud deployment with AWS PrivateLink traffic filter

Traffic filters consist of rule(s) that specify the source of traffic, such as IP/CIDR or AWS VPC endpoint, and rule sets, which are a set of traffic filter rules. Rule sets are then associated with the deployment and can restrict access to the deployment based on those rules. By default, customers connect to deployment over […]

Apple expands the power of iCloud with new iCloud+ plans

September 18, 2023 UPDATE Apple expands the power of iCloud with new iCloud+ plans Beginning today, Apple users will have the option to choose from two additional iCloud+ plans: 6TB for $29.99 per month and 12TB for $59.99 per month. The new plans are a perfect complement to the powerful 48MP Main cameras on the […]

New Advanced Backdoor with Distinctive Malware Tactics

Sep 23, 2023THNCyber Espionage / Malware Cybersecurity researchers have discovered a previously undocumented advanced backdoor dubbed Deadglyph employed by a threat actor known as Stealth Falcon as part of a cyber espionage campaign. “Deadglyph’s architecture is unusual as it consists of cooperating components – one a native x64 binary, the other a .NET assembly,” ESET […]

The 2024 Complete Presentation & Public Speaking Bundle for $24

Expires September 23, 2123 07:59 PST Buy now and get 90% off The Complete Presentation & Public Speaking/Speech Course KEY FEATURES Become a master of public speaking and presentation with the complete Presentation and Public Speaking/Speech course. This course offers the most comprehensive and enjoyable training available on the market, with numerous exercises, examples, and […]

How to Interpret the 2023 MITRE ATT&CK Evaluation Results

Sep 22, 2023The Hacker NewsMITRE ATT&CK / Cybersecurity Thorough, independent tests are a vital resource for analyzing provider’s capabilities to guard against increasingly sophisticated threats to their organization. And perhaps no assessment is more widely trusted than the annual MITRE Engenuity ATT&CK Evaluation. This testing is critical for evaluating vendors because it’s virtually impossible to […]

insideBIGDATA AI News Briefs – 9/22/2023

Welcome insideBIGDATA AI News Briefs, our timely new feature bringing you the latest industry insights and perspectives surrounding the field of AI including deep learning, large language models, generative AI, and transformers. We’re working tirelessly to dig up the most timely and curious tidbits underlying the day’s most popular technologies. We know this field is […]

Jailbreak

Pangu Releases Updated Jailbreak of iOS 9 Pangu9 v1.2.0

Pangu has updated its jailbreak utility for iOS 9.0 to 9.0.2 with a fix for the manage storage bug and the latest version of Cydia. Change log V1.2.0 (2015-10-27) 1. Bundle latest Cydia with new Patcyh which fixed failure to open url scheme in MobileSafari 2. Fixed the bug that “preferences -> Storage&iCloud Usage -> […]

Apple Blocks Pangu Jailbreak Exploits With Release of iOS 9.1

Apple has blocked exploits used by the Pangu Jailbreak with the release of iOS 9.1. Pangu was able to jailbreak iOS 9.0 to 9.0.2; however, in Apple’s document on the security content of iOS 9.1, PanguTeam is credited with discovering two vulnerabilities that have been patched.

Pangu Releases Updated Jailbreak of iOS 9 Pangu9 v1.1.0

  Pangu has released an update to its jailbreak utility for iOS 9 that improves its reliability and success rate.   Change log V1.1.0 (2015-10-21) 1. Improve the success rate and reliability of jailbreak program for 64bit devices 2. Optimize backup process and improve jailbreak speed, and fix an issue that leads to fail to […]

Activator 1.9.6 Released With Support for iOS 9, 3D Touch

  Ryan Petrich has released Activator 1.9.6, an update to the centralized gesture, button, and shortcut manager, that brings support for iOS 9 and 3D Touch.

Copyright iHash.eu © 2023
We use cookies on this website. By using this site, you agree that we may store and access cookies on your device. Accept Read More
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT