Through the eyes of a machine - Young World Club
100

Through the eyes of a machine

  • POSTED ON: 22 Feb, 2024
  • TOTAL VIEWS: 401 Views
  • POSTED BY: Archana Subramanian | Text: Veena Prasad
  • ARTICLE POINTS: 100 Points

Can you identify these pictures?

Of course you can. How would you describe them? Pictures of humans, perhaps? And you might notice that some are smiling, and some are not. You might mentally classify them as cheerful people and grumpy people. You may instead, focus on those with glasses and those without. Or you may notice that some people are wearing jackets, and some are not.
There are so many ways of classifying a few pictures!

Let’s perform a thought experiment now. Let’s assume you are not of the human species. Maybe you’re an alien, maybe you’re a machine – it doesn’t matter as long as you’re not human – and you have no idea what a human is, what a smile is, what glasses and jackets are, what “classifying” means.

But I have to teach you all that. Where do I start? I start with what is known as “training data”. My training data for smile recognition would be a set of pictures (the more the better), and a label that explains what it is. Each picture goes into the machine, along with the label. At the end of the training, I would give this machine a completely new image, and see if it recognises a smile or not.

What about other expressions? And glasses and jackets? It’s a LOT more work, and a LOT more training data!

Specifically, I would need hundreds of thousands of images showing every possible human expression, all of them labelled correctly. I would need a variety of clothing examples, again labelled correctly. And then glasses. And every other element that I might expect to find in the environment that I am training the machine on. This is machine learning.

What is machine intelligence?

If your machine can go beyond its training data, understand images that are not part of the original data set, and, in some way, extrapolate the learning, it can be called intelligent.

This extrapolation is built using statistics-based algorithms that deploy a variety of techniques to fine-tune a model for different objectives, such as reading expressions, classifying images based on clothing, understanding indoor and outdoor pictures, X-rays, satellite images and anything that humans can make sense of.

In a similar way, models are trained to understand written texts and their styles — formal writing, casual writing, funny writing, the complete works of Shakespeare, academic text books, medical diagnoses, and anything that humans can read.

When we put all this learning together, we can come up with an AI app that can create a completely new image based on instructions such as “generate an image of a child smiling in a playground”. Or anything really, limited only by the human imagination and, of course, the training dataset.

Now, let’s pretend you are a machine that has just learnt to recognise smiling faces. Can you click on all of them?

preload imagepreload image
April
January
February
March
April
May
June
July
August
September
October
November
December
2025
1925
1926
1927
1928
1929
1930
1931
1932
1933
1934
1935
1936
1937
1938
1939
1940
1941
1942
1943
1944
1945
1946
1947
1948
1949
1950
1951
1952
1953
1954
1955
1956
1957
1958
1959
1960
1961
1962
1963
1964
1965
1966
1967
1968
1969
1970
1971
1972
1973
1974
1975
1976
1977
1978
1979
1980
1981
1982
1983
1984
1985
1986
1987
1988
1989
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
SunMonTueWedThuFriSat
30
31
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
1
2
3
00:00
01:00
02:00
03:00
04:00
05:00
06:00
07:00
08:00
09:00
10:00
11:00
12:00
13:00
14:00
15:00
16:00
17:00
18:00
19:00
20:00
21:00
22:00
23:00