Natural Language Processing with Deep Learning
Plan
- What is Natural Language Processing? The nature of human language
- What is Deep Learning
- Why is language understanding difficult
- Intro to the application of Deep Learning of NLP
What is Natural Language Processing?
NLP Applications
- Spell checking, keyword search, finding synonyms
- Extracting information from websites
- product price, dates, location, people or company names
- Classifying: reading level of school texts, positive/negative sentiment of longer documents
- Machine translation
- Spoken dialogue
- Complex question answering
What’s special about human language?
A human language is a system specifically constructed to convey the speaker/writer’s meaning
- Not just an environmental signal, it’s a deliberate communication
- Using an encoding which little kids can quickly learn
A human language is a discrete/symbolic/categorical signaling system
What’s Deep Learning
-
Deep learning is a subfield of machine learning
-
Most machine learning methods work well because of human-designed representations and input features
-
Machine learning becomes just optimizing weights to best make a final prediction
-
Representation learning attempts to automatically learn good features or representations
-
Deep learning algorithms attempt to learn representation and an output
Raw signals -> representation that can be used for features
Reasons for Exploring Deep Learning
- Manually designed features are often over-specified, incomplete and take a long time to design and validate
- Learned Features are easy to adapt, fast to learn
- Deep learning provides a very flexible, universal, learnable framework for representing world, visual and linguistic information
- Deep learning can learn unsupervised and supervised
Other reasons
- Large amounts of training data favor deep learning
- Faster machines and multicore CPU/GPUs favor Deep learning
- New models, algorithms, ideas
- Better ,more flexible learning of intermediate representations
- Effective end-to-end joint system learning
- Effective learning methods for using context and transferring between tasks
Improved performance
- Multivariate Calculus, Linear Algebra - MATH 51, CME 100
- Basic Probability and Statistics - CS 109
- Fundamentals of Machine Learning - CS229 or CS221
- loss functions
- taking simple derivatives
- performing optimization with gradient descent
Why is NLP hard
왜 NLP 가 어려운지 이해를 해야한다…
- Complexity in representing, learning and using linguistic/situational/world/visual knowledge
- Human languages are ambiguous (unlike programming and other formal languages)
- Human language interpretation depends on real world, common sense, and contextual knowledge
People & Papers
- Danqi Chen
- A Fast and Accurate Dependency Parser using Neural Networks / EMNLP 2014 / SyntaxNet(?)
- Reading Wikipedia to Answer Open-Domain Questions / ACL 2017
'Class' 카테고리의 다른 글
[cs224n] Lecture 9 Machine Translation and Advanced Recurrent LSTMs and GRUs (0) | 2018.04.11 |
---|---|
[cs224n] Lecture 4 Word Window Classification and Neural Networks (0) | 2018.03.25 |
Algorithm 정리 (0) | 2015.05.08 |
computer network 정리 (0) | 2015.05.08 |
OS(Operating System) 정리 (2) | 2015.04.29 |