English | 简体中文 | 繁體中文 | 한국어 | 日本語
Thursday, 20 October 2016, 15:05 JST
Share:
    

Source: Fujitsu Ltd
Fujitsu Technology to Elicit New Insights from Graph Data that Expresses Ties between People and Things
Surpassing limits of conventional deep learning for machine learning technology that acquires knowledge, verifies effectiveness of in IoT, finance, and pharmaceutical fields

KAWASAKI, Japan, Oct 20, 2016 - (JCN Newswire) - Fujitsu Laboratories Ltd. today announced the development of machine learning technology that enables highly accurate analysis of graph-structured data that expresses the relationships between people and things.

Fig. 1 Data expressed in a graph structure and tensor expression

Fig.2 tensor-based uniform expression and graph-structured data classification

Fig.3 learning of neural network and optimization of a uniform expression

Fujitsu Laboratories has now developed new technology that allows existing deep learning technology, which has already achieved extremely high accuracy in image and voice recognition, to be applied to graph-structured data. Graph-structured data has a complicated structure and mixes a variety of data, such as different sizes and methods of expression, but by transforming different data to a uniform expression called a "tensor"(1), used in cutting-edge mathematics, it becomes possible to do highly accurate machine learning on graph-structured data using deep learning technology.

This technology was used for learning the structure and activities of chemical compounds, based on data from the PubChem BioAssay(2) open database of chemical compounds. It was able to learn the relationships between the structures of several hundred thousand chemical compounds, about 100 times that of previous technology, as well as their individual activities. Also, by extracting features that could not be grasped with existing technology, it achieved about 80% accuracy in predicting activity, a 10% increase compared to existing technology.

This technology will be used as part of Human Centric AI Zinrai, Fujitsu Limited's AI technology.

Fig. 1 Data expressed in a graph structure and tensor expression
http://www.acnnewswire.com/topimg/Low_FujitsuData11020.jpg

Development Background

In recent years, drug discovery and a variety of other fields utilize composition databases, such as for finance and chemical substances. These databases handle IoT log data for communication between things, or account transactions, and continue to generate an enormous amount of data that can be expressed in a graph structure to show the relationships between people and things (Fig.1). Previously, Fujitsu Laboratories had developed technology, known as "LOD"(3) to retrieve and analyze graph-structured data. It is expected that accurately categorizing and analyzing this graph-structured data will lead to the creation of new value and the opening up of new business areas.

Issues

Previously, categorization of graph-structured data was done on the basis of whether such data contained partial graphs people had previously focused on. When categorizing large volumes of graph-structured data, however, there were many yet-to-be-expressed features in the partial graphs that had been explored beforehand, so there were limits to achieving accurate categorization.

Deep learning technology can automatically extract characteristic features from data, attracting attention to such areas as image and voice recognition, but due to the complicated structure and the variety of data sizes and expressions mixed in graph-structured data, it was difficult to apply deep learning technology to the problem.

About the Technology

Fujitsu Laboratories has now developed new deep learning technology that can learn with high accuracy from a variety of graph-structured data that express the connections between people and things. Features of the technology are as follows:

1. New tensor factorization technology converts graph-structured data to a uniform expression

This technology uses a type of mathematical expression called a tensor, an extension of vectors and matrices, to express graph-structured data that has a variety of expression formats (Fig. 1). It uses a mathematical operation called tensor factorization(4), a cutting-edge data mining technology, to transform data to a uniform expression format (Fig. 2). Conventional tensor factorization could not always transform similar graph-structured data into similar tensor expressions, but now Fujitsu Laboratories has developed a technology that can perform tensor factorization in a way that maximizes the degree of similarity to an arbitrary pattern chosen as a basis.

Fig.2 tensor-based uniform expression and graph-structured data classification
http://www.acnnewswire.com/topimg/Low_FujitsuData21020.jpg

2. Technology that optimizes uniform expressions and neural network learning

By extending the scope of application of back-propagation(5), which is commonly used in the learning process for neural networks, to tensor expressions, this technology simultaneously optimizes uniform expressions to maximize the accuracy of categorization (Fig. 3). Specifically, it updates the basis pattern for tensor expressions according to the amount of the difference in the categorization error of the neural network when the basis pattern is changed.

Fig.3 learning of neural network and optimization of a uniform expression
http://www.acnnewswire.com/topimg/Low_FujitsuData31020.jpg

Effects

With this new deep learning technology, it is now possible to use data that can be expressed with a graph structure, such as the communication logs of computers or IoT devices, financial transactions, or chemical compositions, in new analyses.

In a trial in which this technology was applied to data from the PubChem BioAssay open database of the structure and activity of chemical compounds, and then to a virtual screening, which searches for candidate chemical compounds for drugs on a computer, it was able to learn the relationships between the structure and activity of several hundred thousand chemical compounds, about 100 times what was achieved with previous technology using support vector machines(6). By extracting features that could not be grasped with previous technology, it achieved an activity prediction accuracy of about 80%, an improvement of 10% compared with existing technology. It is expected that this will greatly reduce development time and cost, which are pressing issues in drug development.

In addition, Fujitsu Laboratories conducted a trial to detect illicit activity or attacks in which this technology was applied to benchmark data(7) derived from graph-structured data representing the communication relationships between hosts. The result was that false positives were successfully reduced by more than 20% compared with existing methods using support vector machines. It is expected that this will increase the efficiency of network monitoring tasks. Beyond that, by applying this technology to such data as records of transactions with digital currencies or the financing records of social lending services, such improvements as highly accurate detection of improper monetary manipulation or sophisticated judgements of suitability for lending become possible.

Future Plans

Fujitsu Laboratories will continue to further improve the accuracy of its categorization technology for graph-structured data, aiming to bring it into practical implementation as a core technology of Human Centric AI Zinrai. In addition, Fujitsu Laboratories will continue to expand the applicability of deep learning technology to more diverse data formats, providing advanced data analysis in a variety of fields from the first half of fiscal 2017.

Endorsement

As a technology that makes it possible to learn from large volumes of diverse data from the life sciences, and that is excellent at learning from large-scale data, deep learning has also been attracting attention in the pharmaceutical industry, where designing chemical-compound feature quantities suited to various predicted effects, such as drug efficacy and side effects, is a significant issue. I expect that Fujitsu's new deep learning technology, which can automatically create a number of features suited for prediction from the learning data, will have a huge impact on the pharmaceutical field.

Professor Yasushi Okuno, Department of Biomedical Data Intelligence, Graduate School of Medicine, Kyoto University

(1) Tensor

Data representing multidimensional arrays, a generalization of the concepts of vectors and matrices.

(2) PubChem BioAssay

The world's largest dataset recording data on the structure and activity of chemical compounds in tests of medicinal efficacy and toxicity.

(3) Linked Open Data (LOD)

Open data that can be mutually connected to from anywhere in the world.

(4) Tensor factorization

A technology that factors multidimensional arrays based on the sum of the correlations between multiple elements.

(5) Backpropagation

An algorithm that reduces classification error in neural networks.

(6) Support vector machines

A machine learning method that calculates hyperplanes in multidimensional space, and that can accurately separate data.

(7) Benchmark data

DARPA Intrusion Detection Data Sets.

About Fujitsu Laboratories

Founded in 1968 as a wholly owned subsidiary of Fujitsu Limited, Fujitsu Laboratories Ltd. is one of the premier research centers in the world. With a global network of laboratories in Japan, China, the United States and Europe, the organization conducts a wide range of basic and applied research in the areas of Next-generation Services, Computer Servers, Networks, Electronic Devices and Advanced Materials. For more information, please see: http://www.fujitsu.com/jp/group/labs/en/.

Contact:
Fujitsu Laboratories Ltd.
Knowledge Information Processing Laboratory
E-mail: deeptensor@ml.labs.fujitsu.com

Fujitsu Limited
Public and Investor Relations
Tel: +81-3-3215-5259
URL: www.fujitsu.com/global/news/contacts/


Topic: Press release summary
Source: Fujitsu Ltd

Sectors: Electronics, Science & Research
https://www.acnnewswire.com
From the Asia Corporate News Network


Copyright © 2024 ACN Newswire. All rights reserved. A division of Asia Corporate News Network.

 
Fujitsu Ltd Links

http://www.fujitsu.com

https://plus.google.com/+Fujitsu

https://www.facebook.com/FujitsuJapan

https://twitter.com/Fujitsu_Global

https://www.youtube.com/user/FujitsuOfficial

https://www.linkedin.com/company/fujitsu/

Fujitsu Ltd Related News
2024年12月16日 10時07分 JST
富士通が、米IDC社のレポート「IDC MarketScape: Worldwide Digital Workplace Services 2024 Vendor Assessment」でリーダーの評価を獲得
Monday, 16 December 2024, 10:20 JST
Fujitsu recognized as Leader in IDC MarketScape: Worldwide Digital Workplace Services 2024 Vendor Assessment
2024年12月12日 10時30分 JST
富士通、世界初 脆弱性や新たな脅威への事前対策を支援するマルチAIエージェントセキュリティ技術を開発
Thursday, 12 December 2024, 11:06 JST
Fujitsu develops video analytics AI agent to support safe, secure, and efficient frontline workplaces
Thursday, 12 December 2024, 10:28 JST
Fujitsu develops world's first multi-AI agent security technology to protect against vulnerabilities and new threats
More news >>
Copyright © 2024 ACN Newswire - Asia Corporate News Network
Home | About us | Services | Partners | Events | Login | Contact us | Cookies Policy | Privacy Policy | Disclaimer | Terms of Use | RSS
US: +1 214 890 4418 | China: +86 181 2376 3721 | Hong Kong: +852 8192 4922 | Singapore: +65 6549 7068 | Tokyo: +81 3 6859 8575