logo

Natural Language Processing

Cybersecurity

What does it do?

Natural language processing is the study of human language as it can be understood by computers - words and their grammatical context being put into a strict logic, but also more importantly strict logic that allows for the understanding of more fuzzy logic words and their grammatical context.

Natural language processing has its challenges, in that the languages that real people use do not often conform to formal language structure  (Kamp & Reyle 1993). If they did, then translating them into data that a computer can process for its own tasks would, hypothetically, be roughly as simple as converting a program from one language to another. Attempts to push real-life language into unambiguous, perfectly rational and logical form, have mostly failed so far  (JONES 2012), and so natural language processing is the endeavour to move in the other direction, and form a digital understanding of the language(s) that people already use, that can then be converted into a formal language.

Natural language processing has a rich history dating back to the 1950s, but currently uses neural natural language processing(Gomez-Perez, Denaux & Garcia-Silva 2020),  which involves feature learning and machine learning methods in the style of deep neural networks to process terabytes of examples of human text. This would logically be supervised learning (Zhou 2018), meaning it requires manual correction for mistakes, because unsupervised learning would be circular in logic - it would require a machine that already knows valid language in order to validate the language.

Natural language processing is found most commonly in daily life in personal virtual assistants (Ana Berdasco et al. 2019), such as Alexa or Google Assistant. These apps listen at all times for a command which is intended to be difficult to encounter in ordinary conversation. For Alexa, it's "Hey Alexa," and for Google Assistant,  it's "Okay Google." They send the recording over the internet to their respective servers for a two-step process: First the voice is run through voice recognition (Kamath, Liu & Whitaker 2019), and then the voice, now text, is run through the natural language processing of the home servers. The additional computing power allows it to be done with minimal turnaround time. The natural language processing allows the server to guess with increasing certainty what it was that the user meant for their virtual assistant to hear, and then to guess again what it was they wanted their virtual assistant to do. The virtual assistant then executes the orders sent by the home server.

Virtual assistants, in this manner, have a variety of functions, including playing music, turning on or off devices, setting timers and other reminders, as well as many others  (Ana Berdasco et al. 2019). In this manner, and using only their voice, a user can enact various kinds of utilitarian tasks that can be done by appliances and other machinery in their home.

Natural language processing is also used by Facebook (‘Facebook wants you to put a Portal camera and microphone in your home’ 2018) for advertising purposes. While users have the Facebook app installed, the Facebook app listens for all words said in proximity to it, and sends that over the internet to its home servers. Similar to Alexa and Google Assistant, Facebook's high computing power allows it to use a two step process, that of converting the speech to text, and the text to language data, rather efficiently. Facebook corporate then stores this data as market research, which it uses for targeted advertisement placement as well as sellable market research for other companies.  

What is the likely impact?

A major way that NLP could impact the world is that as computers become better at understanding complex “human language”, such as differentiating between contexts, e.g. when one yells “fire” at a building that is on fire versus “fire” when one instructs a marksman to shoot (Kamath, Liu & Whitaker 2019). Another example would be when one says the word “sheep” and it looks in its language database and finds the word “sheep”, but will probably then act on the assumption that a sheep is black, because the colour of sheep is commonly listed as black. It won’t induce that the sheep is white because it’s rare that we have to say a sheep is white, as it is perceived as ‘common-sense’. 

It is when that computer can understand ‘common-senses’ and important contexts in when words are said that we will be able to have seamless conversations about deep subjects and with emotions. Robots will be able to have AI that can talk in a manner indistinguishable from a human. We could see NLP integrated even more into our everyday life; it could lead to a major upheaval of the service sector, as we could potentially see AI with advanced NLP replace the need for actual humans. 

How will this affect you?

There are two potential ways that NLP in combination with AI will affect both the individual and the labour force as a whole.  (Frey & Osborne 2017; Frank et al. 2019; Arntz, Gregory & Zierahn 2016; Pajarinen, Rouvinen & Ekeland 2015; Brzeski & Burk 2015; ‘The computerisation of European jobs’ n.d.; James Doubek 2017; Acemoglu & Restrepo 2017)

  

References

- Acemoglu, D & Restrepo, P 2017, ‘Robots and Jobs: Evidence from US Labor Markets.” w23285’,.
- Ana Berdasco, Gustavo López, Ignacio Diaz, Luis Quesada & Luis A. Guerrero 2019, ‘User Experience Comparison of Intelligent Personal Assistants: Alexa, Google Assistant, Siri and Cortana’, Proceedings, vol. 31, no. 1, p. 51.
- Arntz, M, Gregory, T & Zierahn, U 2016, The risk of automation for jobs in OECD countries: A comparative analysis, Paris: OECD Publishing, Paris.
- Brzeski, C & Burk, I 2015, ‘Die Roboter kommen’, Folgen der Automatisierung für den deutschen Arbeitsmarkt. INGDiBa Economic Research, vol. 30.
- ‘Facebook wants you to put a Portal camera and microphone in your home’ 2018, CSO (Online).
- Frank, MR, Autor, D, Bessen, JE, Brynjolfsson, E, Cebrian, M, Deming, DJ, Feldman, M, Groh, M, Lobo, J, Moro, E, Wang, D, Youn, H &Rahwan, I 2019, ‘Toward understanding the impact of artificial intelligence on labor’, Proc Natl Acad Sci U S A, vol. 116, no. 14, pp. 6531–6539.
- Frey, CB & Osborne, MA 2017, ‘The future of employment: How susceptible are jobs to computerisation?’, Technological forecasting & social change, vol. 114, no. January, pp. 254–280. 
- Gomez-Perez, JM, Denaux, R & Garcia-Silva, A 2020, A Practical Guide to Hybrid Natural Language Processing: Combining Neural Models and Knowledge Graphs for NLP, Cham: Springer International Publishing AG, Cham. 
- James Doubek 2017, ‘Automation Could Displace 800 Million Workers Worldwide By 2030, Study Says’, All Tech Considered [BLOG]. 
- JONES, GM 2012, ‘In the Land of Invented Languages: Esperanto Rock Stars, Klingon Poets, Loglan Lovers, and the Mad Dreamers Who Tried to Build a Perfect Language – By Arika Okrent’, , vol. 22, no. 2, pp. E115–E116. 
- Kamath, Uday author, Liu, John & Whitaker, James 2019, ‘Deep Learning for NLP and Speech Recognition’ John author Liu, James author Whitaker, & SpringerLink (Online service) (eds),. 
- Kamp, Hans author & Reyle, U 1993, ‘From Discourse to Logic Introduction to Modeltheoretic Semantics of Natural Language, Formal Logic and Discourse Representation Theory’ U author Reyle (ed.),. 
- Pajarinen, M, Rouvinen, P & Ekeland, A 2015, ‘Computerization threatens one-third of Finnish and Norwegian employment’, EtlaBrief, vol. 34, pp. 1–8. 
- ‘The computerisation of European jobs’, viewed 17 October 2020, . 
- Zhou, Z-H 2018, ‘A brief introduction to weakly supervised learning’, National Science Review, vol. 5, no. 1, pp. 44–53.