TTNEWS
  • entertainment
  • Sport
  • Finance
  • tec
  • Travel
  • military
  • Parenting
  • fashion
  • game
  • history
  1. home
  2. tec

The Chinese Academy of Sciences uses math research in deep learning to help understand the effectiveness of the depth of neural network

2024-11-11 21:18:53

The success of deep learning has no need to say more.Researchers have always tried to explain the effectiveness of neural networks from the perspective of mathematics.However, because the structure of the network can be regarded as a multi -compulsory compound between high -dimensional linear transformation and non -linear transformation (such as the RELU activation function), there is actually no good mathematical tool to crack such complex structures.

Therefore, theoretical research of neural networks is often limited to the approaching, optimization, generalization, and other observed phenomena of the network.

If you put aside the theoretical limit, an indisputable fact is that a wider and deeper network always has a better effect.As small as a few layers of full -connected networks and large models as large as trillion -scale, they all consistently maintain such rules.

So how to understand the facts in theory?What role does the activation function play in it?

Compared with width, it is more challenging for depth research, because the increase in the number of layers is also accompanied by the continuous compound of non -linear functions.

A typical problem is that when the width of the model is fixed, does the depth of the model be increased than the shallow model to fit more data points?

Graduate Grace Graduate Grand School of Applied Mathematics of the Chinese Academy of Sciences, Gai Kuo completed a work of generating a network algorithm design and an explanatory work of a phenomenon.Title.

Because I am a math background, I want to do some theoretical results.However, the framework of neural network theoretical research at that time was very clear, and the remaining blank problems were very difficult.

"So that I have read the existing literature for a long time, and I have not found an original entry point." He said.

After experiencing a series of unsuccessful attempts, Gai Kuo returned to the original intuitive idea: because the width of the network is easier to analyze, such as a simple linear equation

, when the size of W is increased, the number of equations between X and Y that can be solved will also increase linearly.

If the depth can be equivalent to the width, the two layers of networks are equivalent to a single layer of large matrix, then you can find the solution of this large matrix equation by the elevation methodThis corresponds to the solution of the two layers of neural networks, which also shows that increasing the depth of the network is as effective as increasing width.

However, there are almost no tools to help calculations for the composite between the elemental non -linear activation function and the matrix multiplication, and it does not have a good optimization nature.

For example, for equations

Assumptions

p>

is the RELU or SIGMOID function, so it is difficult to solve this equation.

Because it is not a problem, even if the optimized method is used, it will not guarantee that the answer will be obtained.However, solving such an equation is an important step in his ideas.

Although it has not been further promoted, the specific form of the problem is relatively clear.Gekuo said that if the range of the activation function is widened, this equation can be found (for example, replacing the activation function with a matrix index).

The advantage of doing this is that when the two matrices are exchanged, after the matrix index function is activated, the matrix obtained is also exchanged.

In order to make the specific matrix have a exchanging properties, an additional layer of network parameters need to be added.With the exchanging nature, it is easy to solve the above equation, so you can do eliminate element in an equivalent large matrix and find a set of solutions for the three layers of functions.

In this way, he realized the original idea under this special activation function.

Specifically, after discussing the discussion of Dr. Gai Kuo and Dr. Zhang Shihua, if you can find a simple and direct example, it can explain that the network deepen a layer when there is activation functions.After that can fit more data points, this result may be more meaningful.

To this end, they extend the network parameters to the complex domain, and the activation function of the element is replaced by the element to the matrix index activation function, so that the three layers of neural networks:

Find a set of parsing solutions:

All matrices are DDimine square matrix, which shows the effectiveness of the network depth.Because if there is only one layer of network, you can only satisfy one set

In general, they have found a better example in theory, which can help people better better wayUnderstand the depth of neural network and the effectiveness of the non -linear activation function.

In the experiment, they observed that although the theoretical results are for the activation function of the matrix index, for the element of RELU or Sigmoid activation functionThe similar optimization result is observed from time to time, that is, the ability of the two layers of network fitting data points is about twice the single layer.And this may inspire other researchers to find more general conclusions.

Recently, the relevant papers are based on the "Analytical Solution of A Threeer Network with A Matrix Exponential Activity" funch. InArxiv [1].

Gai Kuo said: "Thank you Teacher Zhang Shihua for their support and encouragement. When the subject has not progressed, Mr. Zhang did not give the paper on the paper.Published pressure, and did not urge the change of the topic.In the end, I found the solution.

Reference Data:

1.https: //arxiv.org/pdf/2407.02540

Types: Stream Tree

Popular information
  • Dalian chemical institute, the latest nature nanotechnology! | 2024-11-03 18:02:12
  • It is more expensive than the aircraft carrier. TSMC purchases 2.5 billion littering machines. | 2024-11-03 18:02:20
  • Go home in the early morning of the 4th!Shenzhou 18th is ready to evacuate. Why should the astronauts return? | 2024-11-03 18:02:23
  • Notice!Ministry of National Security: Overseas spy intentions to carry out remote sensing observation of remote sensing through high -precision satellites | 2024-11-03 18:02:28
  • He, 28 -year -old Professor Ren Chuan University/Bo Dao, Jie Qing/Youqing/Humboldt Scholars, and then posted JACS! | 2024-11-03 18:02:32
  • Guo Mingzheng: Apple is expected to turn almost all products to the internal Wi-Fi chip within about three years | 2024-11-03 18:03:30
  • Will walking faster in the rain will make you less rain?Physicists analyze for you | 2024-11-03 18:03:34
  • AMD RDNA4 graphics card is expected to appear early next year will support AI frame generation technology FSR4.0 | 2024-11-03 18:04:37
  • It is reported that Apple, Samsung and Qualcomm are scrambling to acquire Intel | 2024-11-03 18:04:45
  • Apple 70/96/140W USB-C power adapter is compatible with 2024 Mac mini and image | 2024-11-03 18:04:48
  • NVIDIA consumer AI PC processor may be launched in September 2025 | 2024-11-03 18:13:59
  • The benchmark test shows that the performance of M4 MAX GPU is only 13% lower than M2 Ultra than M2 Ultra than M3 MAX | 2024-11-03 18:14:06
  • The new Magic Mouse and Magic Keyboard cannot be compatible with the old version of MacOS | 2024-11-03 18:14:10
  • AMD revealed that Ryzen 7 9800X3D performance information game performance increased by 8% compared to the previous generation | 2024-11-03 18:14:20
  • TOPTON Tuo Futong launched D13 mini host: wooden case, dual -net port, starting from $ 399 | 2024-11-03 18:15:29
  • Investment of US $ 10 billion, subsidized 825 million US dollars, American gambling EUV lithography | 2024-11-03 18:17:49
  • 12 departments: Exploring the cross application of nuclear technology in future industries such as quantum computing, new energy storage, brain interface and other future industries | 2024-11-03 18:17:53
  • Yangzhou was selected by the country and sat with Nanjing Suzhou | 2024-11-03 18:27:24
  • Lunar Lake is the only one!Intel future processor no longer integrates memory | 2024-11-03 18:27:50
  • Black Shark 67W Nitrogen Moisturizer Listing: 2C1A Three Exit quickly, 129 yuan | 2024-11-03 18:28:03
  • Interview with Luo Feng, Vice President of Iqoo Products: Behind the pricing of Iqoo 13 is the future investment investment | 2024-11-03 18:28:09
  • Interview with Honor CEO Zhao Ming: Adhere to technological innovation betting AI GT series will focus on the market of young people | 2024-11-03 18:28:13
  • Equipped with PowerVR GPU and Tensor G4 variants composed of different CPUs appear in the running database | 2024-11-03 18:28:15
  • If you choose the M4 Mac MINI memory and SSD, you can buy one new money to buy one new one | 2024-11-03 18:28:19
  • Scientific researchers found the "longevity key" of perovskite solar cells | 2024-11-03 18:30:42
  • Undverted cognition!Chinese scientists have measured the speed of quantum entanglement, and will it confirm that death is just an illusion? | 2024-11-03 18:30:48
  • The monogamy to degenerate the male penis bone, will it still cause intelligence to decline? | 2024-11-03 18:30:52
  • Play game crushing Intel 285K!AMD Ryzen 7 9800X3D will be launched on November 7th 3699 yuan | 2024-11-03 18:32:48
  • Musk said that the brain machine interface can solve most diseases, and the cost of large -scale mass production will be equivalent to the mobile phone. | 2024-11-03 18:35:39
  • 2024 Nobel Prize winner David Baker founded a new company. AI design a new drug form -antibody cage, derived from the Science thesis | 2024-11-03 18:35:43
Latest
Happy Bao | Dream of Dream Medicine Bloom -Graduate Students from our hospital won the second prize of the Capital Medical University 2024 Speech Contest Personal evaluation of the 30 -year history of the best lineup of Ma Lai can barely enter the bench The domestic "three major airlines" C919 aircraft gathered in Chengdu!Practicing flying and going north Shangguang Parents do this to break the "only performance theory", and each child can bloom unique light She, sent a letterless letter to the moon Old cup bonus super lpl champion!Attract the old We Korean aid: I want to support the championship, I want to raise children In 2025, the statutory holidays have been added for 2 days. What is the true attitude of tourism practitioners? What does the PPT look like by Lin updated?After reading it, I was cried by ugliness Beiqing: After the National Football Team Barin will return to Xiamen for the afternoon of the 15th, the 15th day of the 15th National Football Team VS Balling 23 people List exposure: Xie Wen can suspend the race!Fourth goalkeeper+flying wing failure selection The best element of the Super League announced, Wei Shihao missed, 4 people in Shanghai, Luneng 2, Wu Xi was surprised Ding Yongxun fell in love with Zhao Xuehua at first sight and loved each other for 37 years. Now the 36 -year -old son makes him worry Ye Ke has not seen Huang Xiaoming in the checkup. Paparas have been exposed to 5 months of pregnancy. The 24 -year -old Fan Ye compared the 28 -year -old Shi Yunpeng. Aspen: Mbappe has been silent after joining Real Madrid for 4 months. Only 2 interviews have been accepted in 4 months Chen Meng withdrew!Instead, Sun Yingsha was forced to have both and gave birth to 2 questions. What did Ma Lin think? It is the lowest -key county in Shanxi, with world -class famous mountains, but few people know the county name! Ye Ke promised to return the net to cancel the account again as a demon again!Huang Xiaoming is lazy, Ye Ke is alone for pregnancy! Big!Blast Yang Zi accompanied by sleeping: evidence has been obtained, involved in many popular actresses, more synthetic streams out "The Lane Family" Zhuang rushed to drive away his parents and forced Pengfei to buy a house. Only then did you know that Zhuang Chaoying was very lucky Wang Xiaofei hit Big S: A wedding with Ma Xiaomei will be re -wedd, Ma Xiaomei threw out propositions Who is the most runner -up in the World Cup?The Dutch Three Entry Finals are defeated, Argentina is tied, and the other team is the most The Clippers regretted, Harden 19+6+7, after the game, Harden walked towards the Rockets bench, hugged Ethan to pay attention to Rockets 111-103 Clippers!Player scores are released: 4 people are full, 3 people pass, 2 people pull their hips! No accident, this will become the main framework of Amurin Manchester United! Choose a rational student in kindergarten?Can "Run Run" Education really win on the starting line? British PS5 Pro price or permanent drop: the largest retail sales leading price reduction The 19 -day box office broke through 3.3 billion and won the global championship. The strongest movie this year was born. Daolang: The Macau concert is well received, and I won the two former CCTV host of Zhao Pu Li Xiaomeng. 2024 Inner Entertainment is to be exploded, only one year, the gap is obvious Lei Jun is jealous. Xiaomi SU7 has 100,000 offline and 100,000 orders. Guangzhou house tickets can buy new houses in the city!Expert: It is necessary to make the demolished households realized that in the past It is said that the Chery FR Division was established, and the headquarters is located in Shanghai Win 20 points!Du Feng won the first victory, and the new aid 23 points hit a new high. The chef should describe the job from the teachers and students and parents, and the supplier will not run away. Thirty points defeated, the CBA Journey really walked over?The three major foreign aids in Zhejiang broke out, and genius teenagers burst and burst The transformation of Jinji Co., Ltd. suffered a decline of 922 million large orders to terminate the operating pressure of 8.82 million yuan in the first three quarters. 4 games 0 goals!The number of shooters is lost, Alda is deeply trapped in the mud, and the situation of winning the championship deteriorates Crown: 0 to 4!The fatigue is shown. In the first round, Trump took Ding Junhui lightly Korean dramas have evolved to the male lead, no male lead is required The performance benchmark covers the global stock market and gold, and the world is actively configured with FOF to open the closed period CITIC Securities A shares increased by 60% of Yuexiu Capital to reduce holdings by 1% or cash out 5 billion yuan Academician Qian Qihu: The domestic shield is going to the world. I hope that the younger generation of "Chinese tunnelers" will create further glory The whole Chinese class rushes for another year!BLG is highly likely to renew the contract, ELK renewal conditions are only one The husband takes the child to buy milk powder to bring it back to the PS5 Pro, his wife asked the Internet online, and the merchant sent God to assist The most surprising seven players?Downs first!Hilde, three goals on the list! Shenzhen Chuanyin Communications under Chuanyin was awarded the title of new "Little Giant" enterprise in the national specialized specialty The stock price of the market fell by 6.43%in the afternoon of US $ 13.37 in the US real estate investment Sure enough!6 major foreign aid blessings, old acquaintances return, Zhou Peng will lead the Shenzhen team to take off The Egyptian Museum opened the night exhibition tourists with close contact with Pharaoh

©2024 ttnews All rights reserved

Privacy Policy | Service Terms | contact us