WEBVTT

00:00.149 --> 00:02.730
in this video we are discussing pig

00:02.730 --> 00:05.370
latin data model we know that pig latin

00:05.370 --> 00:07.410
is the high-level language which is

00:07.410 --> 00:08.760
available with pig

00:08.760 --> 00:11.280
so pig latin data model we are going to

00:11.280 --> 00:13.380
discuss we shall discuss it with one

00:13.380 --> 00:15.619
diagram and some explanations just

00:15.619 --> 00:18.630
consider this diagram so here you are

00:18.630 --> 00:20.760
having the bag and we are having that

00:20.760 --> 00:22.470
tupple you can find that this is this

00:22.470 --> 00:24.900
row will be known as a couple and this

00:24.900 --> 00:27.090
is a bag so bag is consisting of

00:27.090 --> 00:29.730
multiple tuples and here it is a field

00:29.730 --> 00:33.899
and there is a filled value so that data

00:33.899 --> 00:37.410
model of pig latin the data model of pig

00:37.410 --> 00:39.930
latin is fully nested and it allows

00:39.930 --> 00:43.559
complex non atomic data types so this

00:43.559 --> 00:46.320
pig latin data model supports complex

00:46.320 --> 00:49.230
non atomic data types what are they so

00:49.230 --> 00:51.329
they can be a map and at Apple you can

00:51.329 --> 00:53.489
find that this is at Apple and then what

00:53.489 --> 00:55.500
is the map map means collection of

00:55.500 --> 00:58.530
key-value pairs and that map and tableau

00:58.530 --> 01:00.960
can be considered as complex non had to

01:00.960 --> 01:03.600
make data types so the above is the

01:03.600 --> 01:06.810
diagrammatic representation of piglet

01:06.810 --> 01:09.240
ins data model so let us discuss all

01:09.240 --> 01:13.500
these terminologies one by one so at

01:13.500 --> 01:15.240
first we are starting with this atom a

01:15.240 --> 01:17.670
single value in pig latin the

01:17.670 --> 01:20.159
irrespective of the data is known as

01:20.159 --> 01:24.090
atom so atom stores as a string and this

01:24.090 --> 01:27.570
can be used as string and number so atom

01:27.570 --> 01:30.600
is the single value at a time and it can

01:30.600 --> 01:32.130
be it will be stored in the form of

01:32.130 --> 01:34.650
string and can be operated as a string

01:34.650 --> 01:38.369
or in number a piece of data or a simple

01:38.369 --> 01:41.400
atomic value is known as a field as an

01:41.400 --> 01:44.520
example ashame can be considered as an

01:44.520 --> 01:47.009
atom next we are going for the tupple

01:47.009 --> 01:49.710
now what is the tupple a record which is

01:49.710 --> 01:52.500
formed by an ordered set of fields is

01:52.500 --> 01:54.899
known as a topple so you can find that

01:54.899 --> 01:57.509
this this atomic value is also known as

01:57.509 --> 01:59.909
a field and then order set of fields

01:59.909 --> 02:01.110
will form one tupple

02:01.110 --> 02:03.299
as an example you can consider or shem

02:03.299 --> 02:07.920
comma 25 is an example of a tuple next

02:07.920 --> 02:10.289
we are going for the bag the collection

02:10.289 --> 02:12.750
of tuples will be known as a bag and

02:12.750 --> 02:15.210
each tuple can have any number of fields

02:15.210 --> 02:18.360
a bag is represented within curly braces

02:18.360 --> 02:21.000
a bag will be represented within curly

02:21.000 --> 02:23.010
braces as an example we can consider

02:23.010 --> 02:25.440
that within curly braces were having one

02:25.440 --> 02:28.140
tupple that is a shim comma 25 another

02:28.140 --> 02:28.740
tuple

02:28.740 --> 02:31.230
Rahul comma 30 another tuple hawisher

02:31.230 --> 02:34.020
command 23 so bag is nothing but

02:34.020 --> 02:36.120
collection of tuples and should be

02:36.120 --> 02:40.320
enclosed within curly braces next one we

02:40.320 --> 02:43.380
are going for this map a map or data map

02:43.380 --> 02:46.440
is a set of key value pairs it is a set

02:46.440 --> 02:49.350
of key value pairs the key needs to be

02:49.350 --> 02:52.050
of type character array and should be

02:52.050 --> 02:54.090
unique the key who should have some

02:54.090 --> 02:56.580
unique values there and the value might

02:56.580 --> 02:58.709
be of any type and it can be represented

02:58.709 --> 03:01.470
or it should be represented within

03:01.470 --> 03:04.140
square brackets you can find that as an

03:04.140 --> 03:08.340
example name number is head awesome and

03:08.340 --> 03:11.850
each number is our 25 and it has been

03:11.850 --> 03:14.670
enclosed within third brackets within

03:14.670 --> 03:16.410
the square brackets that means it is

03:16.410 --> 03:18.660
representing our map here

03:18.660 --> 03:21.150
so map is nothing but collection of key

03:21.150 --> 03:21.840
value pairs

03:21.840 --> 03:23.820
he will have the distinct values and

03:23.820 --> 03:25.920
values can have of different data types

03:25.920 --> 03:28.440
next one we are having the relation now

03:28.440 --> 03:30.870
what is a deletion a deletion is a bag

03:30.870 --> 03:34.170
of tuples and the deletions in pig latin

03:34.170 --> 03:37.080
are unordered and there is no guarantee

03:37.080 --> 03:39.330
that tuples are processed in any

03:39.330 --> 03:42.239
particular order and that is known as a

03:42.239 --> 03:45.510
relation so the relation is a bag of

03:45.510 --> 03:48.090
tuples actually so now this is the

03:48.090 --> 03:51.060
concept and this is the piglet in data

03:51.060 --> 03:52.860
model and we have discussed all the

03:52.860 --> 03:54.780
respective terminologies in the more

03:54.780 --> 03:56.810
details for your better understanding

03:56.810 --> 04:00.500
thanks for watching this video
