TypeScript
TypeScript copied to clipboard
[Proposal] Type assertion statement (type cast) at block-scope level
This is a proposal in order to simplify the way we have to deal with type guards in TypeScript in order to enforce the type inference.
The use case is the following. Let us assume we have dozens (and dozens) of interfaces as the following:
Code
interface AARect {
x: number; // top left corner
y: number; // top left corner
width: number;
height: number;
}
interface AABox {
x: number; // center
y: number; // center
halfDimX: number;
halfDimY: number;
}
interface Circle {
x: number; // center
y: number; // center
radius: number;
}
// And much more...
And we have a union type like this one:
type Geometry = AARect | AABox | Circle | // ... And much more
It is quite easy to discriminate a type from another with hasOwnProperty or the in keyword:
function processGeometry(obj: Geometry): void {
if ("width" in obj) {
let width = (obj as AARect).width;
// ...
}
if ("halfDimX" in obj) {
let halfDimX = (obj as AABox).halfDimX;
// ...
}
else if ("radius" in obj) {
let radius = (obj as Circle).radius;
// ...
}
// And much more...
}
But, as we can see, this is quite burdensome when we need to manipulate obj inside each if block, since we need to type cast each time we use obj.
A first way to mitigate this issue would be to create an helper variable like this:
if ("width" in obj) {
let helpObj = obj as AARect;
let width = helpObj.width;
// ...
}
But this is not really satisfying since it creates an artefact we will find in the emitted JavaScript file, which is here just for the sake of the type inference.
So another solution could be to use user-defined type guard functions:
function isAARect(obj: Geometry): obj is AARect {
return "width" in obj;
}
function isAABox(obj: Geometry): obj is AABox {
return "halfDimX" in obj;
}
function isCircle(obj: Geometry): obj is Circle {
return "radius" in obj;
}
// And much more...
function processGeometry(obj: Geometry): void {
if (isAARect(obj)) {
let width = obj.width;
// ...
}
if (isAABox(obj)) {
let halfDimX = obj.halfDimX;
// ...
}
else if (isCircle(obj)) {
let radius = obj.radius;
// ...
}
// And much more...
}
But again, I find this solution not really satisfying since it still creates persistent helpers functions just for the sake of the type inference and can be overkill for situations when we do not often need to perform type guards.
So, my proposal is to introduce a new syntax in order to force the type of an identifier at a block-scope level.
function processGeometry(obj: Geometry): void {
if ("width" in obj) {
assume obj is AARect;
let width = obj.width;
// ...
}
if ("halfDimX" in obj) {
assume obj is AABox;
let halfDimX = obj.halfDimX;
// ...
}
else if ("radius" in obj) {
assume obj is Circle;
let radius = obj.radius;
// ...
}
// And much more...
}
Above, the syntax assume <identifier> is <type> gives the information to the type inference that inside the block, following this annotation, <identifier> has to be considered as <type>. No need to type cast any more. Such a way has the advantage over the previous techniques not to generate any code in the emitted JavaScript. And in my opinion, it is less tedious than creating dedicated helper functions.
This syntax can be simplified or changed. For instance we could just have :
<identifier> is <obj>
without a new keyword assume, but I am unsure this would be compliant with the current grammar and design goals of the TypeScript team.
Nevertheless, whatever any welcomed optimization, I think the general idea is relevant for making TypeScript clearer, less verbose in the source code and in the final JavaScript, and less tedious to write when we have to deal with union types.
Technically we could just consider in as a form of type guards.
But I can still imagine something that says "assume the type of this entity is so-and-so for this block".
That would be nice if the in keyword could be a form of type guard.
Although, I am unsure it would be easy to handle such a following case with type inference only:
interface A {
x: number;
z: number;
}
interface B {
y: number;
z: number;
}
interface C {
x: number;
y: number;
}
type Union = A | B | C;
function check(obj: Union): void {
if ("x" in obj && "z" in obj) {
// obj is now treated as an instance of type A
}
else if ("y" in obj && "z" in obj) {
// obj is now treated as an instance of type B
}
else if ("x" in obj && "y" in obj) {
// obj is now treated as an instance of type C
}
}
If such a case with combination of in cannot be handled in a short term, the single discriminating in as a form of type guard would be nice though for many use cases.
If combination of in can be handled anytime soon, the syntax assume <identifier> is <type> would still be relevant in general cases since it is a factored version of inline type cast.
This looks like the same concept as #9946, but with different syntax.
@yortus Thanks for the reference. Yes, I am glad I am not the only one who felt the need of such a concept even though the previous proposal insisted on type narrowing. What I propose is more than type narrowing, but type overcasting.
For instance, let us assume a variable obj has a type A, whatever this type is at a given point of the code. We would be able force its type to B with the new syntax (or another, I don't really care),
let obj: A; // Type of obj is A
// ...
assume obj is B; // Type of obj is B
// ...
This is would not only apply to type narrowing, but also to type casting. As said before, we could consider this proposal as a way to type cast an identifier at a block-scope level. Thinking of that, I am wondering if this could not be better to use as instead of is in order to underline the connection between this proposal and inline type cast:
assume obj as B;
So what you are really proposing is a type assertion statement that is otherwise exactly like the existing type assertion expression. For example:
function fn(arg: P) {...}
// Type assertion as an expression (existing syntax)
let x: Q;
f(x as P); // 'x as P' is a type assertion expression
// Type assertion as a statement (proposed new syntax)
let y: Q;
assume y is P; // this is a type assertion statement
f(y); // y has type P here
As with other statement/expression equivalences, this proposed new form of type assertion is used for its side-effects rather than its result, although obviously these are compile-time side-effects, since type assertions are elided at runtime. And as you mention you want the side-effects limited to the block in which the assertion appears.
@yortus I like your distinction between statement and expression. So yes, type assertion statement is probably the best way to name this proposal.
Syntactic possibilities:
1. As suggested in #9946: declare <identifier>: <type>.
The use of keyword declare is interesting since it already exists, and emphasizes on the fact this is an annotation that will not be emitted in the JavaScript. However, it makes me think more of ambient declarations and could suggest we do not have an instanciation of <identifier> before, which is misleading. Also, I am quite afraid of conflicts with existing use cases of declare.
2. My primary proposal: assume <identifier> is <type>.
It adds a new keyword to existing ones which can be seen as undesirable. However, I think we need a new keyword to express this new concept of type assertion statement. I first suggested the keyword is to introduce the type, since I had type guards in mind at the begining. But in fact, it is less related to type guard than type assertion (type cast).
3. Type assertion statement syntax: assume <identifier> as <type>.
This my favorite syntax up to now. The as keyword, instead of is, underline the connection with its inline counterpart, the type assertion expression.
4. Short syntax A: <identifier> is <type>
This syntax was suggested in my original proposal but it looks like more than an expression than a statement. And such a syntax could be more relevant for future proposals where type guards are directly involved.
5. Short syntax B: <identifier> as <type>
This syntax would lead to a conflict with type assertion expression. I do not think the current grammar to be able to distinguish a type assertion expression from a type assertion statement written this way, since it is today perfectly valid to write a type assertion expression like the following code:
let obj: any;
obj as number; // Type assertion expression
So my preference goes for syntax (3): assume <identifier> as <type>
Extra notes
- Type can be any valid type, including union types.
- A new type assertion statement for a same identifier overrides previous ones. For instance:
let obj: A; // Type of obj is A
...
assume obj as B; // Type of obj is B
...
assume obj as C; // Type of obj is C
...
For reference, the spec on type assertion expressions (section 4.16) is here.
The rules and use-cases described there for type assertions would presumably apply to this proposal in exactly the same way.
BTW I see you're only proposing this to work on simple identifiers, whereas type assertion expressions work on arbitrary expressions, like foo.bar as string. Type guards also work with properties, like if (isString(foo.bar)) {foo.bar.length}. It might be useful to consider at least some expressions as valid in type assertion statements, for example assert foo.bar as string;
In my mind, <identifier> included both simple variable identifiers and properties. But after checking the grammar specifications, I should have been more precise. ;)
So yes, both identifiers and properties should be allowed in type assertion statements.
But again, I find this solution not really satisfying since it still creates persistent helpers functions just for the sake of the type inference and can be overkill for situations when we do not often need to perform type guards.
@yahiko00 User defined type guard functions actually are an important part of the emitted code. They may determine control flow at runtime. Also this is not type inference, it is type assertion (or type assumption :stuck_out_tongue: ). If you want shorthand type guards you can write
function is<T>(value, condition: boolean): value is T {
return condition;
}
then you can write the following
function processGeometry(obj: Geometry): void {
if (is<AARect>(obj, "width" in obj)) {
let width = obj.width;
// ...
}
if (is<AABox>(obj, "halfDimX" in obj)) {
let halfDimX = obj.halfDimX;
// ...
}
else if (is<Circle>(obj, "radius" in obj)) {
let radius = obj.radius;
// ...
}
}
I do not deny the usefulness of type guard functions. But there are cases where I do not want type guards functions. First, as expressed before, because it adds verbosity where it is not needed. Your workaround above is interesting but adds a layer of abstraction, which is less clear than a plain test if("width" in obj), in my opinion. You still need a persistent helper function to perform type inference. Said in a different manner, I want the language to help me, not being on my way.
Also, even if I know performance is not the priority of the TypeScript language, unfortunately, I feel better when I can minimize function calls in interpreted languages, especially in situations where speed is important, like in video games.
Fair enough.
if("width" in obj) could perhaps be a type guard in a future version of the language but the inferred type could not really be AARect. Rather it would have to be something like { width: any }.
@aluanhaddad if("width" in obj) could narrow obj to AARect if the declared type of obj is a union like AARect | AABox | Circle.
@yortus that's an excellent point
Consider using a discriminated union with a type literal https://basarat.gitbooks.io/typescript/content/docs/types/discriminated-unions.html 🌹
That is true. But I see a drawback to this approach since it adds an helper property which could be bigger in size (bytes) than the sum of the others. If I have to manipulate thousands or tens thousands of object, this is a luxury I cannot afford in some situations. Also, it is not always possible to use an helper property if structures behind interfaces do not implement such property and we do not have the possibility (lack of rights or access to the source code) to do so.
FWIW I asked for this a long time ago but once we got let/const I've just done the following (before discriminated unions):
interface AARect {
x: number; // top left corner
y: number; // top left corner
width: number;
height: number;
}
interface AABox {
x: number; // center
y: number; // center
halfDimX: number;
halfDimY: number;
}
interface Circle {
x: number; // center
y: number; // center
radius: number;
}
type Geometry = AARect | AABox | Circle; // ... And much more
function processGeometry(obj: Geometry): void {
if ("width" in obj) {
let objT = obj as AARect;
let width = objT.width;
// ...
}
if ("halfDimX" in obj) {
let objT = obj as AABox;
let halfDimX = objT.halfDimX;
// ...
}
else if ("radius" in obj) {
let objT = obj as Circle;
let radius = objT.radius;
// ...
}
// And much more...
}
:rose:
Logged #10485 because we always prefer to just have narrowing happen automatically.
We've wanted a syntactic space for this for a while but haven't found anything that isn't ugly or looks like an expression with side effects.
All the current suggestions seem to propose a means of altering the type within the current scope, e.g.:
if (a instanceof A) {
b is B;
c is C;
do_something(a, b, c);
}
~~How about making the scope of the type-cast explicitly block-scoped?~~ Nope! see comments below.
if (a instanceof A) {
using (b as B, c as C) {
do_something(a, b, c);
}
}
If I had to pick from the options listed by @yahiko00, my favorite is (1):
The use of keyword
declareis interesting since it already exists, and emphasizes on the fact this is an annotation that will not be emitted in the JavaScript.
👍
However, it makes me think more of ambient declarations and could suggest we do not have an instanciation of
<identifier>before, which is misleading.
That was actually the thing that gave me the same idea - in fact, I was intuitively just trying it when I ran into this problem, hoping it would just work.
I like the fact that this resembles ambient declarations, because it is ambient (doesn't emit any code) and it is a declaration - even if it isn't quite an "ambient declaration" in the Typescript sense, it seems pretty intuitive.
Also, I am quite afraid of conflicts with existing use cases of
declare.
Unlikely to happen, I think - since ambient declarations are always top-level declarations?
Regarding (3) I find the introduction of a keyword assume is sort of inconsistent with other type-casts, which don't require a keyword.
Even if (5) is visually ambiguous with other type-casts, I like that option as well, as it's the closest relative to other type-casts in Typescript. It's actually not inconsistent with JS itself, where, for example, stand-alone expressions like a && b;, even if they're completely inert, are syntactically valid.
I don't think it would be wise for there to be a surrounding block since that would alter the meaning of let and const declarations.
It would be safer to scope the effect of the assertion to the enclosing block.
a && b; sounds good at first but it could have side effects so it's not inert like, say, a pragma.
Consider:
const checkedFsKey = Symbol();
F[checkedFsKey] = new Set<F>;
F[Symbol.hasInstance] = function (x: object) {
const hasInstance = // whatever
if (hasInstance) {
F[checkedFsKey].add(x);
}
return hasInstance;
};
@aluanhaddad good point, yes - introducing a block doesn't make sense.
The syntax can be something like
function incoming(thing: array | number, another: boolean | string): void {
// thing is array or number
if (checkSomething()) {
guard thing as array, another as string {
// thing is array, another is string
console.log(thing.length, another.toLowerCase())
}
} else {
guard thing as number {
// thing is number
console.log(thing.toFixed())
}
}
}
I like the original proposal's syntax, because if you need to narrow the scope of the type assertion, you can just add a block and make the assertion in there.
Regarding the alteration of the type of already declared variables in the current scope, with let and cost, the compiler can just throw an error complaining about multiple declarations, which makes total sense.
In my view, this type of type assertion should be nothing more than a type declaration, with the condition that the variable already exists in a higher scope.
Also, I don't agree with the use of declare keyword here. It should be something new or something unrelated, so their meanings and use don't conflict with each other.
@rzvc I personally would prefer both of these forms:
-
assume obj is Foo;- This statement within a block scope is equivalent to the following, whereassumeis a unique global name not exposed to the user:// Original assume obj is Foo; // Polyfilled while (!assume<Foo>(obj)); // never loops // Declaration function assume<T>(obj: any): obj is T { return true } -
obj is Foo- This expression is exactly equivalent tocast<Foo>(obj), wherecastis defined asfunction cast<T>(obj: T): T { return obj }
haven't found anything that isn't ugly or looks like an expression with side effects.
declare typeof foo: Foo? Not introducing a new keyword and literally (re)declaring the type of foo as Foo. And as TypeScript users we all know declare won't affect emitted JavaScript.
@saschanaz If we're going down this road, I'd prefer
declare typeof foo = Foo; because : usually means "the thing on the right is the type of the thing on the left". But that wouldn't be the case here (Foo is the type of foo, not the type of typeof foo).
If we're going down this road, I'd prefer
declare typeof foo = Foo;
That would also be good, but I'm slightly afraid that some people would confuse with typeof foo === "string".
Fair enough...
Then declare foo as Foo ? unambiguous, no new keyword, no confusion possible with as-expression, declare means no impact to JS code...
@lemoinem That's basically my proposal, just using declare (which doesn't make as much sense - you're not declaring anything) instead of a contextual assume.
Indeed, declare could be misleading.