WEBVTT

00:00.000 --> 00:04.600
In this video we are discussing
Pig Latin data model

00:04.600 --> 00:09.100
We know that Pig Latin is a high-level
language which is available with Pig

00:09.100 --> 00:16.260
So, Pig Latin data model we are going to discuss. We
shall discuss it with one diagram and some explanations

00:16.260 --> 00:21.340
Just consider this diagram. So here we are
having the bag and we are having the tuple

00:21.340 --> 00:25.960
You can find that this is this row will
be known as a tappel. And this is a bag

00:25.960 --> 00:32.619
So bag is consisting of multiple tuples. here
it is a field and there is a field value

00:32.619 --> 00:40.080
So the data model of pig Latin, the data model
of peak Latin is fully nested and it allows

00:40.080 --> 00:48.280
complex non-atomic data types. So this Pig Latin
data model supports complex non-atomic data types

00:48.280 --> 00:54.580
What are they? So they can be a map and a tipple. You
can find that this is a tipple and then what is a map

00:54.580 --> 01:03.980
Map means collection of key value pairs. that map and double can be
considered as complex non-atomic data types. So the above is the

01:03.980 --> 01:09.620
diagrammatical representation of P.Glatin's
data model. So let us discuss all these

01:09.620 --> 01:16.740
terminology is one by one. So at first we are
starting with this atom. A single value in

01:16.740 --> 01:24.160
P-Gillatin irrespective of the data is known
as atom. So atom stores as a string and this

01:24.160 --> 01:30.840
can be used as string and number. So, atom
is a single value at a time and it can be

01:30.840 --> 01:35.660
it will be stored in the form of string
and can be operated as a string or a number

01:36.220 --> 01:43.040
A piece of data or a simple atomic value is
known as a field. As an example, a shame can

01:43.040 --> 01:49.280
be considered as an atom. Next, we are going
for the tapple. Now, what is a tappell? A record

01:49.280 --> 01:53.660
which is formed by an ordered set
of fields is known as a tappel

01:54.160 --> 02:05.524
So you can find that this atomic value is also known as a field and then ordered set of
fields will form one tuple as an example you can consider or shim comma 25 is an example

02:05.524 --> 02:12.125
of a tuple next we are going for the bag the
collection of tuples will be known as a bag

02:12.125 --> 02:18.364
and each tuple can have any number of fields
a bag is represented within curly braces

02:18.364 --> 02:24.045
a bag will be represented within corley braces
as an example we can consider that within

02:24.045 --> 02:29.165
carly braces were having one tuple that
is a shim comma 25 another tappell Rahul

02:29.165 --> 02:35.885
30 another topple hovishek comma 23 so bag is
nothing but collection of tuples and should

02:35.885 --> 02:44.205
be enclosed within curly braces next one we are
going for this map a map or data map is a set of

02:44.205 --> 02:50.845
key value pairs it is a set of key value
pairs the key needs to be of type character

02:50.845 --> 02:56.825
array and should be unique. The key should have
some unique values there. And the value might be

02:56.825 --> 03:02.625
of any type and it can be represented or it
should be represented within square brackets

03:02.625 --> 03:10.924
You can find that as an example, name number
is head or sheim and age number is our 25

03:11.165 --> 03:16.364
And it has been enclosed within third brackets,
within the square brackets. That means it is

03:16.364 --> 03:26.084
representing a map here. So map is nothing but collection of key value pairs,
key will have the distinct values and values can have of different data types

03:26.545 --> 03:31.605
Next one we are having the relation. Now, what
is a deletion? A relation is a bag of tuples

03:31.605 --> 03:38.665
and the relations in pig latin are unordered and
there is no guarantee that tuples are processed

03:38.665 --> 03:46.125
in any particular order and that is known as
a relation. So the relation is a bag of tuples

03:46.125 --> 03:51.785
actually. So, now this is the concept and
this is the Pig Latin data model and we have

03:51.785 --> 03:56.924
discussed all the respective terminologies into
more details for your better understanding

03:57.465 --> 03:58.545
Thanks for watching this video
