Draft:SeLU (Redes Neuronales)

Submission declined on 24 November 2024 by SafariScribe (talk).

This is the English language Wikipedia; we can only accept articles written in the English language. Please provide a high-quality English language translation of your submission. Have you visited the Wikipedia home page? You can probably find a version of Wikipedia in your language.

If you would like to continue working on the submission, click on the "Edit" tab at the top of the window.
If you have not resolved the issues listed above, your draft will be declined again and potentially deleted.
If you need extra help, please ask us a question at the AfC Help Desk or get live help from experienced editors.
Please do not remove reviewer comments or this notice until the submission is accepted.

Where to get help

If you need help editing or submitting your draft, please ask us a question at the AfC Help Desk or get live help from experienced editors. These venues are only for help with editing and the submission process, not to get reviews.
If you need feedback on your draft, or if the review is taking a lot of time, you can try asking for help on the talk page of a relevant WikiProject. Some WikiProjects are more active than others so a speedy reply is not guaranteed.

How to improve a draft

Wikipedia:Contributing to Wikipedia – a basic overview on how to edit Wikipedia.
Help:Wikitext – how to use the markup
Help:Referencing for beginners – how to include references
Wikipedia:Article development – how to develop your article
Wikipedia:Writing better articles – how to improve your article
Wikipedia:Verifiability – make sure your article includes reliable third-party sources

You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article.

Improving your odds of a speedy review

To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags.

Add tags to your draft

Editor resources

Easy tools: Citation bot (help) | Advanced: Fix bare URLs

Declined by SafariScribe 5 days ago. Last edited by SafariScribe 5 days ago. Reviewer: Inform author.

Resubmit

Please note that if the issues are not fixed, the draft will be declined again.

Definición

Gráfica de la función de activación SeLU

La función de activación SELU (Scaled Exponential Linear Units) es una función diseñada para inducir la auto normalización en redes neuronales. Es decir, a medida que las activaciones de este tipo se propagan a través de las capas de la red, comienzan a converger a una media cero y varianza uno.^[1]

Fórmula

La función de activación SeLU se define como:

$SeLU(x)=\lambda {\begin{cases}\alpha e^{x}-\alpha ,&{\text{si }}x\leq 0\\x,&{\text{si }}x>0\ \end{cases}}$

donde los valores de α y λ se obtienen al resolver ecuaciones de punto fijo.

Condiciones para utilizar la función SeLU

La función de activación SeLU debe tener:

Valores negativos y positivos para controlar la media.
Derivadas cercanas a cero para reducir la varianza si es demasiado grande en las capas inferiores.
Una pendiente mayor que uno para aumentar la varianza si es demasiado pequeña en las capas inferiores.
Una curva continua.^[2]

Diferencias contra otras funciones de activación

ReLU

La función ReLU es una función de activación no linear fácil de implementar.

$ReLU(x)={\begin{cases}0,&{\text{si }}x\leq 0\\\ x,&{\text{si }}x>0\end{cases}}$

Por un lado, la función ReLU tiene un menor costo computacional y es más fácil de usar y entender. No obstante, la función SeLU no puede “morir” gracias a que el exponente añadido permite valores negativos.

ELU

La función ELU es una función de activación que tiene un comportamiento exponencial para entradas negativas.

$ELU(x)={\begin{cases}\alpha e^{x}-\alpha ,&{\text{si }}x\leq 0\\x,&{\text{si }}x>0\end{cases}}$

A diferencia de la SELU, la función de activación ELU es más simple y tiene un menor costo computacional. Sin embargo, la actualización constante de valores negativos hace que la función SeLU sea más precisa ya que la red aprende más rápido. De igual manera, la función ELU carece del factor de escala λ.^[3]^[4]

SeRLU

A diferencia de la SeLU, que crece de manera monótona, la SERLU tiene una función en forma de montículo formulada como $xe^{-x}$ . La función con forma de montículo asegura que SERLU tenga una respuesta insignificante para entradas negativas grandes, mientras que la SELU tiene respuestas negativas constantes para estas entradas.^[5]

Ventajas y Desventajas

Ventajas

No es necesario utilizar la normalización de Batch, ni ningún otro tipo de normalización, ya que se asegura de que la varianza y la media se mantenga estable a lo largo de la red.
Se puede utilizar en clasificación binaria y multiclase.
Ayuda con problemas de gradientes.

Desventajas

Funciona mejor con una combinación específica de inicialización de pesos, lograda mediante el método de inicialización normal de LeCun, por lo que otros métodos podrían no producir los resultados esperados.
Al ser una función relativamente reciente y menos común que otras funciones de activación, su investigación resulta más compleja.^[6]

References

^ Klambauer, Günter; Unterthiner, Thomas; Mayr, Andreas (September 7th, 2017). "Self-Normalizing Neural Networks". arXiv:1706.02515v5 [cs.LG]. {{cite arXiv}}: Check date values in: |date= (help)
^ Huang, Zhen; Ng, Tim; Liu, Leo; Mason, Henry; Zhuang, Xiaodan; Liu, Daben (March 23th, 2020). "SNDCNN: SELF-NORMALIZING DEEP CNNs WITH SCALED EXPONENTIAL LINEAR UNITS FOR SPEECH RECOGNITION". arXiv:1910.01992 [cs.LG]. {{cite arXiv}}: Check date values in: |date= (help)
^ Marchisio, Alberto; Hanif, Muhammad Abdullah; Rehman, Semeen; Martina, Maurizio; Shafique, Muhammad (October 27th, 2018). "A Methodology for Automatic Selection of Activation Functions to Design Hybrid Deep Neural Networks". {{cite web}}: Check date values in: |date= (help)
^ Nguyen, Anh; Pham, Khoa; Ngo, Dat; Ngo, Thanh; Pham, Lam (April 5th, 2021). "An Analysis of State-of-the-art Activation Functions For Supervised Deep Neural Network". arXiv:2104.02523 [cs.LG]. {{cite arXiv}}: Check date values in: |date= (help)
^ Zhang, Guoqiang; Li, Haopeng (July 27th, 2018). "Effectiveness of Scaled Exponentially-Regularized Linear Units (SERLUs)". arXiv:1807.10117 [cs.LG]. {{cite arXiv}}: Check date values in: |date= (help)
^ Upasani, Tanmay (2024-09-22). "SeLU: Why and why not?". Medium. Retrieved 2024-11-23.

[1] Klambauer, Günter; Unterthiner, Thomas; Mayr, Andreas (September 7th, 2017). "Self-Normalizing Neural Networks". arXiv:1706.02515v5 [cs.LG]. {{cite arXiv}}: Check date values in: |date= (help)

[2] Huang, Zhen; Ng, Tim; Liu, Leo; Mason, Henry; Zhuang, Xiaodan; Liu, Daben (March 23th, 2020). "SNDCNN: SELF-NORMALIZING DEEP CNNs WITH SCALED EXPONENTIAL LINEAR UNITS FOR SPEECH RECOGNITION". arXiv:1910.01992 [cs.LG]. {{cite arXiv}}: Check date values in: |date= (help)

[3] Marchisio, Alberto; Hanif, Muhammad Abdullah; Rehman, Semeen; Martina, Maurizio; Shafique, Muhammad (October 27th, 2018). "A Methodology for Automatic Selection of Activation Functions to Design Hybrid Deep Neural Networks". {{cite web}}: Check date values in: |date= (help)

[4] Nguyen, Anh; Pham, Khoa; Ngo, Dat; Ngo, Thanh; Pham, Lam (April 5th, 2021). "An Analysis of State-of-the-art Activation Functions For Supervised Deep Neural Network". arXiv:2104.02523 [cs.LG]. {{cite arXiv}}: Check date values in: |date= (help)

[5] Zhang, Guoqiang; Li, Haopeng (July 27th, 2018). "Effectiveness of Scaled Exponentially-Regularized Linear Units (SERLUs)". arXiv:1807.10117 [cs.LG]. {{cite arXiv}}: Check date values in: |date= (help)

[6] Upasani, Tanmay (2024-09-22). "SeLU: Why and why not?". Medium. Retrieved 2024-11-23.

[1]

[2]

[3]

[4]

[5]

[6]