Skip to content

Latest commit

 

History

History
492 lines (352 loc) · 19.5 KB

index.md

File metadata and controls

492 lines (352 loc) · 19.5 KB
tags leadImage
Scalar Types, Abstract Types, Filtering, Slicing
illustration_02.jpg

Chapter 2 - At the Hotel in Bistritz

We continue to read the story as we think about the database we need to store the information. The important information is in bold:

Jonathan Harker has found a hotel in Bistritz, called the Golden Krone Hotel. He gets a welcome letter there from Dracula, who is waiting in his castle. Jonathan Harker will have to take a horse-driven carriage to get there tomorrow. Jonathan Harker is originally from London. The innkeeper at the Golden Krone Hotel seems very afraid of Dracula. He doesn't want Jonathan to leave and says it will be dangerous, but Jonathan doesn't listen. An old lady gives Jonathan a golden crucifix and says that it will protect him. Jonathan is embarrassed, and takes it to be polite. Jonathan has no idea how much it will help him later.

Now we are starting to see some detail about the city of Bistritz: it contains a hotel. To represent this, we could add another property to City, and call it important_places. That's where places like the Golden Krone Hotel could go. We're not sure if places will be their own object types yet, so we'll just make it an array of strings for now: important_places: array<str>; We can put the names of important places in there for the time being and develop it into a link later. It will now look like this:

type City {
  required name: str;
  modern_name: str;
  important_places: array<str>;
}

After doing a migration, let's insert Bistritz with the important_places property:

insert City {
  name := 'Bistritz',
  modern_name := 'Bistrița',
  important_places := ['Golden Krone Hotel'],
};

After this insert, our database will have two cities called Bistritz. It would be nice to delete one of them but first we will need to learn the filter and delete keywords to do so. We will learn filter in this chapter, and delete in the next, and then we will delete our duplicate Bistritz.

Enums, scalar types, and extending

Our game is going to need to have player characters in it, which means that users of the game are going to have to choose what type of character they want to be. The book is probably based in 1893, and our game will let the characters use classes that make sense for this time period. That means that we will need some sort of type that allows us to select from a number of options. EdgeDB has a type called an enum that will let us do this.

Simple types like int64 are what are known as scalar types, which means that they only hold a single value at a time. An enum allows us to select from multiple options, but it is a scalar type too because you can only pick one. The other types (City, NPC) are object types because they can hold multiple values at the same time.

The other keyword we will see for the first time is extending, which means to take a type as a base and extend it. This keyword gives you all the power of the type that you are extending, and lets you add more on top. Enums in EdgeDB are created by extending a base enum type with the list of values (choices) we want to choose from. The syntax looks like this:

scalar type Class extending enum<Rogue, Mystic, Merchant>;

Feel free to add more classes if you want! But in our schema we will just have three classes for simplicity.

Did you notice that scalar type ends with a semicolon and the other types don't? That's because the object types have a {} to make a full expression. But here on a single line we don't have {} so we need the semicolon to show that the expression ends here.

To choose between the values in an enum, you just use a . (a dot). For our enum, that means we can choose Class.Rogue, Class.Mystic, or Class.Merchant.

This Class type is going to be for player characters in our game, not the NPC objects from the book (their stories, choices and statistics are already decided). That means that we will need both an PC type and an NPC type. These two types need to be different, but they are pretty similar to each other! They both have a name, places_visited, and probably some other properties that we will add later. This is a good case for a base type. Inside this base type we can put all the properties that are common to PC and NPC, which will use the extending keyword to gain the properties that Person has.

This part of the schema will look like this...except for one change we will make.

type Person {
  required name: str;
  multi places_visited: City;
}

type PC extending Person {
  required class: Class;
}

type NPC extending Person;

So what change should we make? Well, the Person type at the moment is a concrete type, which means that we would be able to insert Person objects, PC objects, and NPC objects. But we only want to use Person as a base for other types to extend from, not a concrete type that we can insert. So in this case we will change type Person to abstract type Person.

With that change done, the schema now looks like this. Let's do a migration now.

abstract type Person {
  required name: str;
  multi places_visited: City;
}

type PC extending Person {
  required class: Class;
}

type NPC extending Person;

Because Person is an abstract type, we can't insert it directly. Let's try it though and see what the error is! Give this query a try:

insert Person {name := 'Mr. HasAName'};

As expected, we can't insert this Person type because it is abstract.

error: QueryError: cannot insert into abstract object type 'default::Person'
  ┌─ <query>:1:8
  │
1 │ insert Person {name := 'Mr. HasAName'};
  │        ^^^^^^ error

However, select on an abstract type is just fine. A select on a Person type will select all of the objects of all the types that extend Person. So let's add a PC object now so that the result of a select Person query will be more interesting.

We'll make a PC called Emil Sinclair who is a mystic. He hasn't visited anywhere yet so he won't have any places_visited to link to.

insert PC {
  name := 'Emil Sinclair',
  class := Class.Mystic,
};

Note that we didn't just write Mystic, because we have to choose the enum Class and then make a choice of one of the values. Just typing Mystic would make EdgeDB think that we were trying to link to objects of a type called Mystic.

However, EdgeDB also allows you to choose an enum just by writing a string that matches the value. So this insert would have worked too:

insert PC {
  name := 'Emil Sinclair',
  class := 'Mystic',
};

But if you were to type something like class := 'Wrestler' it wouldn't work, because Wrestler is not one of the values inside the Class enum.

edgedb error: InvalidValueError: invalid input value for enum 
'default::Class': "Wrestler"

Now let's do a select on the abstract type Person to see the objects inside, plus the properties common to both: name and places_visited.

select Person {
  name,
  places_visited: {
    name
  }
};

This gives us an output showing objects of both the PC and NPC type.

{
  default::NPC {
    name: 'Jonathan Harker',
    places_visited: {
      default::City {name: 'Munich'},
      default::City {name: 'Buda-Pesth'},
      default::City {name: 'Bistritz'},
    },
  },
  default::PC {name: 'Emil Sinclair', places_visited: {}},
}

Casting

Casting means to quickly change one type into another. Casting is used a lot in EdgeDB because it is strict about types, and will refuse to do operations on two types that are different. A lot of casting is done automatically out of convenience, such as with numbers. Take this query for example:

select 9 + 9.9;

The 9 in the query is an int64, while 9.9 is a float64. And though 9 and 9.9 are different types, EdgeDB will not generate an error here and will just give the output of 18.9, returning a float64. You can confirm that here:

select (9 + 9.9) is float64;

This will give true, while select (9 + 9.9) is float32; gives false.

When a cast is done without you needing to type anything extra, it is known as an implicit cast.

When you need to do the cast yourself, it is called an explicit cast. In this case you can indicate the type using <> angle brackets. For example, this will generate an error:

select '9' + 9;

EdgeDB tells us the exact problem here:

error: InvalidTypeError: operator '+' cannot be applied to operands of type
'std::str' and 'std::int64'
  ┌─ <query>:1:8
  │
1 │ select '9' + 9;
  │        ^^^^^^^ Consider using an explicit type cast or a conversion function.

And to fix it, just use the angle brackets:

select <int64>'9' + 9;

And you will get 18, a 64-bit integer.

But don't worry, a cast won't work if the input is invalid. You can't just cast a bunch of letters into an <int64> for example:

db> select <int64>"Hi I'm a number please add me to" + 9;
edgedb error: InvalidValueError: invalid input syntax for type std::int64:
"Hi I'm a number please add me to"

You can cast more than once at a time if you need to. This example isn't something you will need to do but shows how you can cast over and over again if you want:

select <str><int64><str><int32>50 is str;

Did you notice that the output for this query is {true}? That's because casting works from right to left, with the final cast on the far left. You can follow the casts by reading from right to left using the word "into":

  • "The number 50 into an int32 into a string into an int64 into a string".

Or you can read it from left to right using the word "from":

  • "A string from an int64 from a string from an int32 from the number 50".

Also note that casting is almost always only for scalar types: user-created object types like City and Person are too complex to simply cast into.

There is one exception to this as of EdgeDB 3.0: a uuid can be cast into an object type. This makes sense because each object in EdgeDB has a unique id, and why not be able to cast from this uuid into the object it belongs to?

Let's give this a try by first picking any id from our Person type with select Person.id;. Found one? Good, now paste that into the REPL, surround it with quotes to make it a str, and then use <uuid> to cast it. Like this:

select <uuid>'82348556-1928-11ee-ab8c-efed3acb0f00';

Good! And now let's see if we can cast from a uuid into a City. Use select City.id;, pick an id from the list, and then try casting it into a City. Let's also display its name and important_places property while we are at it.

The query below almost works, but will generate an error. Can you guess why?

select <City><uuid>'6b62fbc6-1975-11ee-9693-cff543bb8205' {
  name, 
  important_places 
  };

Ah yes, there it is:

error: QueryError: shapes cannot be applied to scalar type 'std::str'
  ┌─ <query>:1:20
  │
1 │   select <City><uuid>'6b62fbc6-1975-11ee-9693-cff543bb8205' {
  │ ╭────────────────────^
2 │ │   name,
3 │ │   important_places
4 │ │ };
  │ ╰─^ error

We told EdgeDB to give a shape to str which obviously doesn't have properties like name and important_places. No problem: just wrap everything in parentheses first.

select (<City><uuid>'6b62fbc6-1975-11ee-9693-cff543bb8205') {
  name, 
  important_places 
  };

With the parentheses, the query can be seen as this: "Take a string, try to cast it into a uuid, and try to cast it into a City. Take this result and display its name and important_places properties. The query will now work, and display one of the cities we added. It should look something like this.

{default::City {name: 'Munich', important_places: {}}}

Filtering, like and ilike, and indexing

To finish up the second chapter, let's learn how to filter. You can use filter after the curly brackets in select to only show certain objects instead of showing every object for a type. Let's filter to only show Person types that have the property name that equals 'Emil Sinclair':

select Person {
  name,
  places_visited: {name},
} filter .name = 'Emil Sinclair';

filter .name is short for filter Person.name. You can write filter Person.name too if you want - it's the same thing.

The output now only shows a single PC object with a name that matches our filter:

{default::PC {name: 'Emil Sinclair', places_visited: {}}}

Let's filter the cities now. EdgeDB has two keywords called like or ilike which allow us to to match on parts of a string instead of just a whole string. They are slightly different:

  • like is case sensitive, so "Bistritz" matches "Bistritz" but "bistritz" does not match "Bistritz".
  • ilike is not case sensitive (the i in ilike means insensitive), so "Bistritz" matches both "BiStRitz" and "bisTRITz".

You can also add % on the left and/or right which means match anything before or after. Here are some examples with the matched part in bold:

  • like Bistr% matches "Bistritz", but not "bistritz".
  • ilike '%IsTRiT%' matches "Bistritz".
  • like %athan Harker matches "Jonathan Harker".
  • ilike %n h% matches "Jonathan Harker".

Let's filter to get all the cities that start with a capital B. That means we'll need like because it needs to be case sensitive:

select City {
  name,
  modern_name,
} filter .name like 'B%';

Here is the result:

{
  default::City {name: 'Buda-Pesth', modern_name: 'Budapest'},
  default::City {name: 'Bistritz', modern_name: 'Bistrița'},
}

You can also index a string with [] square brackets, starting at 0. For example, the indexes in the string 'Jonathan' look like this:

J o n a t h a n
0 1 2 3 4 5 6 7

So 'Jonathan'[0] is 'J' and 'Jonathan'[4] is 't'.

We can then use this method to filter too. In the next query we are checking to see if the first character is a 'B' or not:

select City {
  name,
  modern_name,
} filter .name[0] = 'B'; # First character must be 'B'

That gives the same result. Careful though: if you set the number too high then it will try to search outside of the string, which is an error. If we change 0 to 18 (filter .name[18] = 'B';), we'll get this error:

edgedb error: InvalidValueError: string index 18 is out of bounds
(on line 4, column 16)

Plus, if you have any City types with a name of '', even a search for index 0 will cause an error.

Let's try to make that error happen.

First we will insert a City object with '' for a name:

db> insert City {
 name := ''
 };
{default::City {id: 5d01e634-f2c7-11ed-87af-d7dfced50628}}

And now our former query doesn't work, because EdgeDB will come across a City object with a name property that it can't index into.

db> select City {
  name,
  modern_name,
 } filter .name[0] = 'B';
edgedb error: InvalidValueError: string index 0 is out of bounds
(on line 4, column 16)

So a good rule of thumb is to not use raw indexes when filtering unless you are sure that there will be a value, and that the value will be long enough. In the next chapter we will learn how to use constraints to ensure that your data matches certain expectations you have, such as minimum length.

So what if you want to make sure that you won't get an error with an index number that might be too high? Here you can use like or ilike instead. If you replace the .name[0] part in the query above with .name ilike 'B%' we don't get an error, and the query still checks to see if there is a 'B' at index 0.

select City {
  name,
  modern_name,
 } filter .name ilike 'B%';

The result is now all City objects that start with 'B', and the City object with a name of '' is no longer getting in the way.

 {
  default::City {name: 'Buda-Pesth', modern_name: 'Budapest'},
  default::City {name: 'Bistritz', modern_name: 'Bistrița'},
}

Slicing

You can also slice a string to get a piece of it. Let's look at the index values for 'Jonathan' again and think about how we could slice it.

J o n a t h a n
0 1 2 3 4 5 6 7

Slices represent a part of a string that starts at one index and ends before another one. In other words, if you take a "slice" of it between indexes 2 and 5, you get 'nat' ('Jonathan'[2:5] = 'nat'), because it starts at 2 and goes up to 5 - but not including index 5. It's sort of like when you phone your friends to tell them that you're 'at their house': you're not telling them that you're inside it. So [2:5] represents a slice that starts at index 2 and is "at" (in front of) index 5.

In the same way, selecting a slice of 'Jonathan' up to index 7 will show up to and including index 6:

db> select 'Jonathan'[0:7];
{'Jonatha'}

In this case, you could search up to index 8. You could even search up to index 1000, as slicing doesn't involve trying to directly access an index so it won't generate an error if there is no value at that index. Or you can use [0:] for an open-ended slice that starts at 0 and ends when the string ends. So all three of these queries will work without generating an error:

db> select 'Jonathan'[0:8];
{'Jonathan'}
db> select 'Jonathan'[0:1000];
{'Jonathan'}
db> select 'Jonathan'[0:];
{'Jonathan'}

This also means that you can use slicing to safely search for values at a certain index even if the value might be an empty string. We still have that City object in the database with a name of '', so select City.name[0] generates an error:

db> select City.name[0];
edgedb error: InvalidValueError: string index 0 is out of bounds
(on line 1, column 18)

However, if we change the query to db> select City.name[0:1]; then it will look for a slice instead of an exact character index, and the query will work:

db> select City.name[0:1];
{'M', 'B', 'B', ''}

You can also use negative numbers to slice from the other end of a string. Negative index values are counted from the end of 'Jonathan', which is 8, so -1 corresponds to 8 - 1: index number 7. Let's prove this with a query:

db> select 'Jonathan'[7] = 'Jonathan'[-1];
{true}

And this query will show all the characters in the string 'Jonathan' starting from index 1 and going up to the second last index:

db> select 'Jonathan'[1:-1];
{'onatha'}

Let's end the chapter with a quick note. Did you notice one of the queries used # to add a comment? Comments in EdgeDB are simple: anything to the right of # on a line gets ignored.

So this...

select 1893#0503 is the first day of the book Dracula when...
;

...returns {1893}.

Here is all our code so far up to Chapter 2.

Time to practice

  1. Change the following select to display {100} by casting: select '99' + '1';

  2. Select all the City types that start with 'Mu' (case sensitive).

  3. Select the third letter (i.e. index number 2) of the name of every NPC.

  4. If some NPC objects might have a name of '', how would you find the third letter of the name of every NPC?

  5. Imagine an abstract type called HasAString:

    abstract type HasAString {
      string: str
    };
    

    How would you change the Person type to extend HasAString?

  6. This query only shows the id numbers of the places visited. How do you show their name?

    select Person {
      places_visited
    };

See the answers here.

Up next: Jonathan gets into the carriage and travels into the cold mountains.