add __rvalue(expression) builtin #17050

WalterBright · 2024-11-03T06:03:37Z

This adds the __rvalue(expression) builtin, which causes expression to be treated as an rvalue, even if it is an lvalue.

dlang-bot · 2024-11-03T06:03:40Z

Thanks for your pull request, @WalterBright!

Bugzilla references

Your PR doesn't reference any Bugzilla issue.

If your PR contains non-trivial changes, please reference a Bugzilla issue or create a manual changelog.

Testing this PR locally

If you don't have a local development environment setup, you can use Digger to test this PR:

dub run digger -- build "master + dmd#17050"

nordlow · 2024-11-04T12:35:38Z

Can this __rvalue(...) be used in place of core.lifetime.move(...)?

nordlow · 2024-11-04T15:17:47Z

Will the call to use in

S s;
__rvalue(s)
use(s);

be either

a defined behavior where s is set to S.init by __rvalue(s)
a compiler error (optionally in @safe code) at least in the case where S has indirections or destructors, or
an undefined (@system) behavior when S has indirections or destructors

?

I prefer option 2 and is the closest to what Rust does. Can it be implemented with constant-time overhead by adding a status bit to the (parameter) variable declaration indicating that its contents has been invalidated by a move?

I recently saw a https://youtu.be/08gvuBC-MIE?t=1839 in which Jon Kalb admits that the standard committe made a mistake by forcing moved from object to in a so called "fully formed state" instead of a so called "partially formed state". As this prevents certain kinds of optimizations. For details see https://youtu.be/08gvuBC-MIE?t=1839.

WalterBright · 2024-11-04T21:30:08Z

@nordlow I'll write a proper document for this after I figure out just what the end result will be!

nordlow · 2024-11-05T07:42:30Z

Thanks. Please see updates to my comment at #17050 (comment).

WalterBright · 2024-11-05T08:38:31Z

My opinion on move semantics is that once you've moved s to t, then s's lifetime is over, and it should be in the default initialized state (a concept C++ doesn't have).

nordlow · 2024-11-05T08:55:53Z

My opinion on move semantics is that once you've moved s to t, then s's lifetime is over, and it should be in the default initialized state (a concept C++ doesn't have).

Is this realized by

applying, if present, move constructor of S moving s to t and
resetting all the bytes at s to S.init?

nordlow · 2024-11-05T08:59:51Z

Have you considered it making it a compiler error to access s after it has been moved? If not, why? I'm asking because this would lead to slight better performance in debug mode at least. And this is one of the reasons why Rust has this behavior.

WalterBright · 2024-11-06T03:37:32Z

DIP: https://github.com/WalterBright/documents/blob/master/rvalue.md

WalterBright · 2024-11-06T05:10:58Z

Have you considered it making it a compiler error to access s after it has been moved?

Yes, but it requires Data Flow Analysis, which is slow.

nordlow · 2024-11-06T06:18:48Z

Have you considered it making it a compiler error to access s after it has been moved?

Yes, but it requires Data Flow Analysis, which is slow.

Ok, thanks.

Afaict, the complexity of supporting Rust-style r-value semantics depends on the context in which __rvalue would be used. For instance, in

S use(S);
S x;
auto y = __rvalue(x)
use(y); // allowed
use(x);  // disallowed

such a analysis could be implemented in the compiler with negligible overhead using an extra status bit in the Declaration node.

But in the general case I realize now that the compiler needs to recurse into all function calls that are passed l-values by reference as arguments.

Do you have a good reference to which data flow analysis in general and its applications such as this one?

nordlow · 2024-11-06T06:25:55Z

I currently experimenting with using __rvalue defined in the branch of this MR in my code. Is __rvalue currently supposed to be wrapped in core.lifetime.move? If so, is

static if (__traits(compiles, { int x; const y = __rvalue(x); })) {
	import core.stdc.string : memcpy;
	T move(T)(return scope ref T source) @trusted {
		scope(exit) {
			static immutable init = T.init;
			memcpy(&source, &init, T.sizeof);
		}
		return __rvalue(source);
	}
	void move(T)(ref T source, ref T destination) @trusted {
		scope(exit) {
			static immutable init = T.init;
			memcpy(&source, &init, T.sizeof);
		}
		destination = __rvalue(source);
	}
} else
	public import core.lifetime : move;

/// unary move()
pure nothrow @nogc @safe unittest {
	auto x = S(42);
	assert(x == S(42));
	const y = move(x);
	assert(y == S(42));
	assert(x == S.init);
}

/// binary move()
pure nothrow @nogc @safe unittest {
	auto x = S(42);
	assert(x == S(42));
	S y;
	move(x, y);
	assert(y == S(42));
	assert(x == S.init);
}

version(unittest) {
	struct S { @disable this(this); int x; }
}

a suitable rewrite of the core.lifetime.move overloads?

TurkeyMan · 2024-11-06T10:44:43Z

__rvalue() is a bad name for a move() intrinsic, the answer you seek is yes, __rvalue is exactly a replacement for move(), and it should be named move(). I'll argue for this before it's merged, but we're making very good progress here! :)

TurkeyMan · 2024-11-06T10:45:50Z

Also no, this can't be 'wrapped', it's an intrinsic; it needs to be renamed move, you can't wrap it in a function named move.

TurkeyMan · 2024-11-06T10:47:14Z

The end goal is to completely delete core.lifetime. Don't try to shoehorn this in there; that stuff is all dead.

nordlow · 2024-11-06T11:33:10Z

What about the binary overload of move() and moveEmplace?

I'm asking because this MR needs to include the druntime modifications to core.lifetime that makes full use of __rvalue for the sake of deletion of the current very convoluted implementations of move and moveEmplace in core.lifetime.

It's important to note that current behaviour of core.lifetime.move conditionally resets the T source to its T.init value when certain conditions hold for T; specifically

"If T is a struct with a destructor or postblit defined, source is reset to its .init value after it is moved into target, otherwise it is left unchanged. "

I'm not sure this is in line with the behavior of __rvalue that Walter proposes in this MR.

See https://dlang.org/phobos/core_lifetime.html#.move for details.

Luckily druntime is now part of the dmd repo.

tgehr · 2024-11-06T12:28:44Z

I agree with @nordlow that __rvalue is more low-level than move. E.g., what happens if you pass a non-POD struct twice to the same function call using __rvalue.

nordlow · 2024-11-06T12:58:57Z

I agree with @nordlow that __rvalue is more low-level than move. E.g., what happens if you pass a non-POD struct twice to the same function call using __rvalue.

Nevertheless, I personally believe it is highly preferable to make all the overloads of move and moveEmplace become builtins using their existing name. This is gonna be a breaking change for code that rely on those symbols being templates but projects can be adjusted.

Btw, I'm working on migrating std.traits to become builtin __traits for the sake of reducing template bloat in std.traits. I yet again remind us all of the fact that the C++ standard, per definition, enforces implementations to lower all symbols in std.traits to builtins. For the same reason that I believe that most (or all) druntime's traits and std.traits should be converted to builtin __traits.

WalterBright · 2024-11-06T18:24:03Z

such a analysis could be implemented in the compiler with negligible overhead using an extra status bit in the Declaration node.

Such certainly looks tempting, but it falls short as soon as the flow control becomes non-trivial. The flow analysis I know comes from class notes in a class I took on optimizations.

WalterBright · 2024-11-07T05:58:58Z

emplace() will be replaced with the placement new operator, I posted a DIP on it in the development forum.

WalterBright · 2024-11-07T06:00:21Z

E.g., what happens if you pass a non-POD struct twice to the same function call using __rvalue.

Currently it will get destructed twice.

WalterBright · 2024-11-08T14:53:18Z

This work is heavily based on Timon Gehr's and Manu Evans' contributions.

TurkeyMan · 2024-11-09T01:08:35Z

E.g., what happens if you pass a non-POD struct twice to the same function call using __rvalue.

Currently it will get destructed twice.

Another case naturally resolved with caller-destruction!

WalterBright added Enhancement WIP Work In Progress - not ready for review or pulling labels Nov 3, 2024

WalterBright requested a review from ibuclaw as a code owner November 3, 2024 06:03

WalterBright force-pushed the __rvalue branch 4 times, most recently from 0f383fd to 60c0f9e Compare November 3, 2024 07:24

thewilsonator added Needs Changelog A changelog entry needs to be added to /changelog Needs Spec PR A PR updating the language specification needs to be submitted to dlang.org labels Nov 3, 2024

WalterBright force-pushed the __rvalue branch 5 times, most recently from 082c48d to 6e0e44f Compare November 4, 2024 08:07

WalterBright force-pushed the __rvalue branch from 6e0e44f to 86b02a7 Compare November 5, 2024 06:40

add __rvalue(expression) builtin

8ccedfb

WalterBright force-pushed the __rvalue branch from 86b02a7 to 8ccedfb Compare November 5, 2024 06:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add __rvalue(expression) builtin #17050

add __rvalue(expression) builtin #17050

WalterBright commented Nov 3, 2024

dlang-bot commented Nov 3, 2024

nordlow commented Nov 4, 2024

nordlow commented Nov 4, 2024 •

edited

Loading

WalterBright commented Nov 4, 2024

nordlow commented Nov 5, 2024 •

edited

Loading

WalterBright commented Nov 5, 2024

nordlow commented Nov 5, 2024 •

edited

Loading

nordlow commented Nov 5, 2024 •

edited

Loading

WalterBright commented Nov 6, 2024

WalterBright commented Nov 6, 2024

nordlow commented Nov 6, 2024 •

edited

Loading

nordlow commented Nov 6, 2024 •

edited

Loading

TurkeyMan commented Nov 6, 2024

TurkeyMan commented Nov 6, 2024

TurkeyMan commented Nov 6, 2024

nordlow commented Nov 6, 2024 •

edited

Loading

tgehr commented Nov 6, 2024

nordlow commented Nov 6, 2024 •

edited

Loading

WalterBright commented Nov 6, 2024

WalterBright commented Nov 7, 2024

WalterBright commented Nov 7, 2024

WalterBright commented Nov 8, 2024

TurkeyMan commented Nov 9, 2024

add __rvalue(expression) builtin #17050

Are you sure you want to change the base?

add __rvalue(expression) builtin #17050

Conversation

WalterBright commented Nov 3, 2024

dlang-bot commented Nov 3, 2024

Bugzilla references

Testing this PR locally

nordlow commented Nov 4, 2024

nordlow commented Nov 4, 2024 • edited Loading

WalterBright commented Nov 4, 2024

nordlow commented Nov 5, 2024 • edited Loading

WalterBright commented Nov 5, 2024

nordlow commented Nov 5, 2024 • edited Loading

nordlow commented Nov 5, 2024 • edited Loading

WalterBright commented Nov 6, 2024

WalterBright commented Nov 6, 2024

nordlow commented Nov 6, 2024 • edited Loading

nordlow commented Nov 6, 2024 • edited Loading

TurkeyMan commented Nov 6, 2024

TurkeyMan commented Nov 6, 2024

TurkeyMan commented Nov 6, 2024

nordlow commented Nov 6, 2024 • edited Loading

tgehr commented Nov 6, 2024

nordlow commented Nov 6, 2024 • edited Loading

WalterBright commented Nov 6, 2024

WalterBright commented Nov 7, 2024

WalterBright commented Nov 7, 2024

WalterBright commented Nov 8, 2024

TurkeyMan commented Nov 9, 2024

nordlow commented Nov 4, 2024 •

edited

Loading

nordlow commented Nov 5, 2024 •

edited

Loading

nordlow commented Nov 5, 2024 •

edited

Loading

nordlow commented Nov 5, 2024 •

edited

Loading

nordlow commented Nov 6, 2024 •

edited

Loading

nordlow commented Nov 6, 2024 •

edited

Loading

nordlow commented Nov 6, 2024 •

edited

Loading

nordlow commented Nov 6, 2024 •

edited

Loading