diff --git a/src/attributes/limits.md b/src/attributes/limits.md
index 517637e5dc..65cb55b43a 100644
--- a/src/attributes/limits.md
+++ b/src/attributes/limits.md
@@ -15,10 +15,10 @@ syntax to specify the recursion depth.
#![recursion_limit = "4"]
macro_rules! a {
- () => { a!(1) };
- (1) => { a!(2) };
- (2) => { a!(3) };
- (3) => { a!(4) };
+ () => { a!(1); };
+ (1) => { a!(2); };
+ (2) => { a!(3); };
+ (3) => { a!(4); };
(4) => { };
}
diff --git a/src/destructors.md b/src/destructors.md
index 82ca73cddc..cf7ab0aca2 100644
--- a/src/destructors.md
+++ b/src/destructors.md
@@ -1,7 +1,7 @@
# Destructors
When an [initialized] [variable] or [temporary] goes out of
-[scope](#drop-scopes) its *destructor* is run, or it is *dropped*. [Assignment]
+[scope](#drop-scopes), its *destructor* is run, or it is *dropped*. [Assignment]
also runs the destructor of its left-hand operand, if it's initialized. If a
variable has been partially initialized, only its initialized fields are
dropped.
@@ -154,7 +154,7 @@ temporary variable that holds the result of that expression when used in a
[place context], unless it is [promoted].
Apart from lifetime extension, the temporary scope of an expression is the
-smallest scope that contains the expression and is for one of the following:
+smallest scope that contains the expression and is one of the following:
* The entire function body.
* A statement.
@@ -246,7 +246,8 @@ loop {
### Constant promotion
Promotion of a value expression to a `'static` slot occurs when the expression
-could be written in a constant, borrowed, and dereferencing that borrow where
+could be written in a constant and borrowed, and that borrow could be dereferenced
+where
the expression was originally written, without changing the runtime behavior.
That is, the promoted expression can be evaluated at compile-time and the
resulting value does not contain [interior mutability] or [destructors] (these
diff --git a/src/dynamically-sized-types.md b/src/dynamically-sized-types.md
index 1a5fae7270..cab1ec510e 100644
--- a/src/dynamically-sized-types.md
+++ b/src/dynamically-sized-types.md
@@ -18,7 +18,7 @@ types">DSTs. Such types can only be used in certain cases:
types">DSTs.
Unlike with generic type parameters, `Self: ?Sized` is the default in trait definitions.
* Structs may contain a DST as the
- last field, this makes the struct itself a
+ last field; this makes the struct itself a
DST.
> **Note**: [variables], function parameters, [const] items, and [static] items must be
diff --git a/src/expressions.md b/src/expressions.md
index 35e7916149..88f7fbac96 100644
--- a/src/expressions.md
+++ b/src/expressions.md
@@ -179,11 +179,11 @@ move the value. Only the following place expressions may be moved out of:
* [Variables] which are not currently borrowed.
* [Temporary values](#temporaries).
* [Fields][field] of a place expression which can be moved out of and
- doesn't implement [`Drop`].
+ don't implement [`Drop`].
* The result of [dereferencing][deref] an expression with type [`Box`] and
that can also be moved out of.
-When moving out of a place expression that evaluates to a local variable, the
+After moving out of a place expression that evaluates to a local variable, the
location is deinitialized and cannot be read from again until it is
reinitialized. In all other cases, trying to use a place expression in a value
expression context is an error.
@@ -206,7 +206,7 @@ The following expressions can be mutable place expression contexts:
* Dereference of a variable, or field of a variable, with type `&mut T`. Note:
This is an exception to the requirement of the next rule.
* Dereferences of a type that implements `DerefMut`: this then requires that
- the value being dereferenced is evaluated is a mutable place expression context.
+ the value being dereferenced is evaluated in a mutable place expression context.
* [Array indexing] of a type that implements `IndexMut`: this
then evaluates the value being indexed, but not the index, in mutable place
expression context.
diff --git a/src/expressions/array-expr.md b/src/expressions/array-expr.md
index 65b3e24917..432a45ad80 100644
--- a/src/expressions/array-expr.md
+++ b/src/expressions/array-expr.md
@@ -21,7 +21,7 @@ The syntax for the second form is two expressions separated by a semicolon (`;`)
The expression before the `;` is called the *repeat operand*.
The expression after the `;` is called the *length operand*.
It must have type `usize` and be a [constant expression], such as a [literal] or a [constant item].
-An array expression of this form creates an array with the length of the value of the legnth operand with each element a copy of the repeat operand.
+An array expression of this form creates an array with the length of the value of the length operand with each element being a copy of the repeat operand.
That is, `[a; b]` creates an array containing `b` copies of the value of `a`.
If the length operand has a value greater than 1 then this requires that the type of the repeat operand is [`Copy`] or that it must be a [path] to a constant item.
diff --git a/src/expressions/loop-expr.md b/src/expressions/loop-expr.md
index 1ca62095c4..308f3e3466 100644
--- a/src/expressions/loop-expr.md
+++ b/src/expressions/loop-expr.md
@@ -148,7 +148,7 @@ for n in 1..11 {
assert_eq!(sum, 55);
```
-A for loop is equivalent to the following block expression.
+A `for` loop is equivalent to a `loop` expression containing a [`match` expression] as follows:
```rust,ignore
diff --git a/src/expressions/method-call-expr.md b/src/expressions/method-call-expr.md
index 5e7caa5e47..748e7dc1e4 100644
--- a/src/expressions/method-call-expr.md
+++ b/src/expressions/method-call-expr.md
@@ -71,7 +71,7 @@ These cases require a [disambiguating function call syntax] for method and funct
***Warning:*** For [trait objects], if there is an inherent method of the same name as a trait method, it will give a compiler error when trying to call the method in a method call expression.
Instead, you can call the method using [disambiguating function call syntax], in which case it calls the trait method, not the inherent method.
There is no way to call the inherent method.
-Just don't define inherent methods on trait objects with the same name a trait method and you'll be fine.
+Just don't define inherent methods on trait objects with the same name as a trait method and you'll be fine.
diff --git a/src/glossary.md b/src/glossary.md
index e345634a84..8a3a270a18 100644
--- a/src/glossary.md
+++ b/src/glossary.md
@@ -125,7 +125,7 @@ implementation.
A variable is initialized if it has been assigned a value and hasn't since been
moved from. All other memory locations are assumed to be uninitialized. Only
-unsafe Rust can create such a memory without initializing it.
+unsafe Rust can create a memory location without initializing it.
### Local trait
diff --git a/src/identifiers.md b/src/identifiers.md
index e57be8407a..c6361c4db9 100644
--- a/src/identifiers.md
+++ b/src/identifiers.md
@@ -2,8 +2,8 @@
> **Lexer:**\
> IDENTIFIER_OR_KEYWORD :\
-> XID_start XID_continue\*\
-> | `_` XID_continue+
+> XID_Start XID_Continue\*\
+> | `_` XID_Continue+
>
> RAW_IDENTIFIER : `r#` IDENTIFIER_OR_KEYWORD *Except `crate`, `self`, `super`, `Self`*
>
@@ -12,29 +12,59 @@
> IDENTIFIER :\
> NON_KEYWORD_IDENTIFIER | RAW_IDENTIFIER
-An identifier is any nonempty Unicode string of the following form:
+
+Identifiers follow the specification in [Unicode Standard Annex #31][UAX31] for Unicode version 13.0, with the additions described below. Some examples of identifiers:
-Either
+* `foo`
+* `_identifier`
+* `r#true`
+* `Москва`
+* `東京`
-* The first character has property [`XID_start`].
-* The remaining characters have property [`XID_continue`].
+The profile used from UAX #31 is:
-Or
+* Start := [`XID_Start`], plus the underscore character (U+005F)
+* Continue := [`XID_Continue`]
+* Medial := empty
-* The first character is `_`.
-* The identifier is more than one character. `_` alone is not an identifier.
-* The remaining characters have property [`XID_continue`].
+> **Note**: Identifiers starting with an underscore are typically used to indicate an identifier that is intentionally unused, and will silence the unused warning in `rustc`.
-> **Note**: [`XID_start`] and [`XID_continue`] as character properties cover the
-> character ranges used to form the more familiar C and Java language-family
-> identifiers.
+Identifiers may not be a [strict] or [reserved] keyword without the `r#` prefix described below in [raw identifiers](#raw-identifiers).
+
+Zero width non-joiner (ZWNJ U+200C) and zero width joiner (ZWJ U+200D) characters are not allowed in identifiers.
+
+Identifiers are restricted to the ASCII subset of [`XID_Start`] and [`XID_Continue`] in the following situations:
+
+* [`extern crate`] declarations
+* External crate names referenced in a [path]
+* [Module] names loaded from the filesystem without a [`path` attribute]
+* [`no_mangle`] attributed items
+* Item names in [external blocks]
+
+## Normalization
+
+Identifiers are normalized using Normalization Form C (NFC) as defined in [Unicode Standard Annex #15][UAX15]. Two identifiers are equal if their NFC forms are equal.
+
+[Procedural][proc-macro] and [declarative][mbe] macros receive normalized identifiers in their input.
+
+## Raw identifiers
A raw identifier is like a normal identifier, but prefixed by `r#`. (Note that
the `r#` prefix is not included as part of the actual identifier.)
Unlike a normal identifier, a raw identifier may be any strict or reserved
keyword except the ones listed above for `RAW_IDENTIFIER`.
-[strict]: keywords.md#strict-keywords
+[`extern crate`]: items/extern-crates.md
+[`no_mangle`]: abi.md#the-no_mangle-attribute
+[`path` attribute]: items/modules.md#the-path-attribute
+[`XID_Continue`]: http://unicode.org/cldr/utility/list-unicodeset.jsp?a=%5B%3AXID_Continue%3A%5D&abb=on&g=&i=
+[`XID_Start`]: http://unicode.org/cldr/utility/list-unicodeset.jsp?a=%5B%3AXID_Start%3A%5D&abb=on&g=&i=
+[external blocks]: items/external-blocks.md
+[mbe]: macros-by-example.md
+[module]: items/modules.md
+[path]: paths.md
+[proc-macro]: procedural-macros.md
[reserved]: keywords.md#reserved-keywords
-[`XID_start`]: http://unicode.org/cldr/utility/list-unicodeset.jsp?a=%5B%3AXID_Start%3A%5D&abb=on&g=&i=
-[`XID_continue`]: http://unicode.org/cldr/utility/list-unicodeset.jsp?a=%5B%3AXID_Continue%3A%5D&abb=on&g=&i=
+[strict]: keywords.md#strict-keywords
+[UAX15]: https://www.unicode.org/reports/tr15/tr15-50.html
+[UAX31]: https://www.unicode.org/reports/tr31/tr31-33.html
diff --git a/src/items/extern-crates.md b/src/items/extern-crates.md
index 7c2f9ad762..f4dc735b09 100644
--- a/src/items/extern-crates.md
+++ b/src/items/extern-crates.md
@@ -64,7 +64,7 @@ by using an underscore with the form `extern crate foo as _`. This may be
useful for crates that only need to be linked, but are never referenced, and
will avoid being reported as unused.
-The [`macro_use` attribute] works as usual and import the macro names
+The [`macro_use` attribute] works as usual and imports the macro names
into the [`macro_use` prelude].
## The `no_link` attribute
diff --git a/src/items/external-blocks.md b/src/items/external-blocks.md
index 9443276d4b..9ad100fbd5 100644
--- a/src/items/external-blocks.md
+++ b/src/items/external-blocks.md
@@ -17,7 +17,7 @@ External blocks provide _declarations_ of items that are not _defined_ in the
current crate and are the basis of Rust's foreign function interface. These are
akin to unchecked imports.
-Two kind of item _declarations_ are allowed in external blocks: [functions] and
+Two kinds of item _declarations_ are allowed in external blocks: [functions] and
[statics]. Calling functions or accessing statics that are declared in external
blocks is only allowed in an `unsafe` context.
diff --git a/src/items/functions.md b/src/items/functions.md
index f882a24634..183561c921 100644
--- a/src/items/functions.md
+++ b/src/items/functions.md
@@ -74,7 +74,7 @@ If the first parameter is a _SelfParam_, this indicates that the function is a
function] in a [trait] or [implementation].
A parameter with the `...` token indicates a [variadic function], and may only
-be used as the last parameter of a [external block] function. The variadic
+be used as the last parameter of an [external block] function. The variadic
parameter may have an optional identifier, such as `args: ...`.
## Function body
diff --git a/src/items/generics.md b/src/items/generics.md
index e9c11ffcfa..0a9703335b 100644
--- a/src/items/generics.md
+++ b/src/items/generics.md
@@ -37,7 +37,7 @@ Generic parameters are in scope within the item definition where they are
declared. They are not in scope for items declared within the body of a
function as described in [item declarations].
-[References], [raw pointers], [arrays], [slices][arrays], [tuples], and
+[References], [raw pointers], [arrays], [slices], [tuples], and
[function pointers] have lifetime or type parameters as well, but are not
referred to with path syntax.
@@ -274,6 +274,7 @@ struct Foo<#[my_flexible_clone(unbounded)] H> {
[array repeat expression]: ../expressions/array-expr.md
[arrays]: ../types/array.md
+[slices]: ../types/slice.md
[associated const]: associated-items.md#associated-constants
[associated type]: associated-items.md#associated-types
[block]: ../expressions/block-expr.md
diff --git a/src/items/implementations.md b/src/items/implementations.md
index cb3598e8e6..ee651cee54 100644
--- a/src/items/implementations.md
+++ b/src/items/implementations.md
@@ -87,8 +87,8 @@ fn main() {
## Trait Implementations
A _trait implementation_ is defined like an inherent implementation except that
-the optional generic type declarations is followed by a [trait] followed
-by the keyword `for`. Followed by a path to a nominal type.
+the optional generic type declarations are followed by a [trait], followed
+by the keyword `for`, followed by a path to a nominal type.
@@ -265,7 +265,7 @@ impl<'a> HasAssocType for Struct {
Implementations may contain outer [attributes] before the `impl` keyword and
inner [attributes] inside the brackets that contain the associated items. Inner
-attributes must come before any associated items. That attributes that have
+attributes must come before any associated items. The attributes that have
meaning here are [`cfg`], [`deprecated`], [`doc`], and [the lint check
attributes].
diff --git a/src/items/modules.md b/src/items/modules.md
index ff9cae078c..2a0ad55c58 100644
--- a/src/items/modules.md
+++ b/src/items/modules.md
@@ -65,7 +65,7 @@ contents in a file named `mod.rs` within that directory. The above example can
alternately be expressed with `crate::util`'s contents in a file named
`util/mod.rs`. It is not allowed to have both `util.rs` and `util/mod.rs`.
-> **Note**: Previous to `rustc` 1.30, using `mod.rs` files was the way to load
+> **Note**: Prior to `rustc` 1.30, using `mod.rs` files was the way to load
> a module with nested children. It is encouraged to use the new naming
> convention as it is more consistent, and avoids having many files named
> `mod.rs` within a project.
diff --git a/src/lifetime-elision.md b/src/lifetime-elision.md
index 201e34410c..e8a20e2e82 100644
--- a/src/lifetime-elision.md
+++ b/src/lifetime-elision.md
@@ -103,7 +103,7 @@ If neither of those rules apply, then the bounds on the trait are used:
// For the following trait...
trait Foo { }
-// These two are the same as Box has no lifetime bound on T
+// These two are the same because Box has no lifetime bound on T
type T1 = Box;
type T2 = Box;
diff --git a/src/names/preludes.md b/src/names/preludes.md
index 6027763157..4c1568ada6 100644
--- a/src/names/preludes.md
+++ b/src/names/preludes.md
@@ -3,7 +3,7 @@
A *prelude* is a collection of names that are automatically brought into scope
of every module in a crate.
-These prelude names are not part of the module itself, they are implicitly
+These prelude names are not part of the module itself: they are implicitly
queried during [name resolution]. For example, even though something like
[`Box`] is in scope in every module, you cannot refer to it as `self::Box`
because it is not a member of the current module.
diff --git a/src/patterns.md b/src/patterns.md
index 7676d5255b..8cd8af042f 100644
--- a/src/patterns.md
+++ b/src/patterns.md
@@ -13,7 +13,6 @@
> | [_IdentifierPattern_]\
> | [_WildcardPattern_]\
> | [_RestPattern_]\
-> | [_ObsoleteRangePattern_]\
> | [_ReferencePattern_]\
> | [_StructPattern_]\
> | [_TupleStructPattern_]\
@@ -384,6 +383,9 @@ match slice {
// `end` is a slice of everything but the first element, which must be "a".
["a", end @ ..] => println!("ends with: {:?}", end),
+ // 'whole' is the entire slice and `last` is the final element
+ whole @ [.., last] => println!("the last element of {:?} is {}", whole, last),
+
rest => println!("{:?}", rest),
}
@@ -404,7 +406,15 @@ match tuple {
> **Syntax**\
> _RangePattern_ :\
-> _RangePatternBound_ `..=` _RangePatternBound_
+> _InclusiveRangePattern_\
+> | _HalfOpenRangePattern_\
+> | _ObsoleteRangePattern_
+>
+> _InclusiveRangePattern_ :\
+> _RangePatternBound_ `..=` _RangePatternBound_
+>
+> _HalfOpenRangePattern_ :\
+> | _RangePatternBound_ `..`
>
> _ObsoleteRangePattern_ :\
> _RangePatternBound_ `...` _RangePatternBound_
@@ -417,11 +427,20 @@ match tuple {
> | [_PathInExpression_]\
> | [_QualifiedPathInExpression_]
-Range patterns match values that are within the closed range defined by its lower and
-upper bounds. For example, a pattern `'m'..='p'` will match only the values `'m'`, `'n'`,
-`'o'`, and `'p'`. The bounds can be literals or paths that point to constant values.
+Range patterns match values within the range defined by their bounds. A range pattern may be
+closed or half-open. A range pattern is closed if it has both a lower and an upper bound, and
+it matches all the values between and including both of its bounds. A range pattern that is
+half-open is written with a lower bound but not an upper bound, and matches any value equal to
+or greater than the specified lower bound.
+
+For example, a pattern `'m'..='p'` will match only the values `'m'`, `'n'`, `'o'`, and `'p'`. For an integer the
+pattern `1..` will match 9, or 9001, or 9007199254740991 (if it is of an appropriate size), but
+not 0, and not negative numbers for signed integers. The bounds can be literals or paths that point
+to constant values.
-A pattern a `..=` b must always have a ≤ b. It is an error to have a range pattern
+A half-open range pattern in the style `a..` cannot be used to match within the context of a slice.
+
+A pattern `a..=b` must always have a ≤ b. It is an error to have a range pattern
`10..=0`, for example.
The `...` syntax is kept for backwards compatibility.
@@ -453,6 +472,12 @@ println!("{}", match ph {
_ => unreachable!(),
});
+# let uint: u32 = 5;
+match uint {
+ 0 => "zero!",
+ 1.. => "positive number!",
+};
+
// using paths to constants:
# const TROPOSPHERE_MIN : u8 = 6;
# const TROPOSPHERE_MAX : u8 = 20;
@@ -733,6 +758,10 @@ is irrefutable. When matching a slice, it is irrefutable only in the form with
a single `..` [rest pattern](#rest-patterns) or [identifier
pattern](#identifier-patterns) with the `..` rest pattern as a subpattern.
+Within a slice, a half-open range pattern like `a..` must be enclosed in parentheses,
+as in `(a..)`, to clarify it is intended to match a single value.
+A future version of Rust may give the non-parenthesized version an alternate meaning.
+
## Path patterns
> **Syntax**\
diff --git a/src/tokens.md b/src/tokens.md
index 15d8468a01..8cdf6dd577 100644
--- a/src/tokens.md
+++ b/src/tokens.md
@@ -585,7 +585,7 @@ usages and meanings are defined in the linked pages.
| `@` | At | [Subpattern binding]
| `_` | Underscore | [Wildcard patterns], [Inferred types], Unnamed items in [constants], [extern crates], and [use declarations]
| `.` | Dot | [Field access][field], [Tuple index]
-| `..` | DotDot | [Range][range], [Struct expressions], [Patterns]
+| `..` | DotDot | [Range][range], [Struct expressions], [Patterns], [Range Patterns][rangepat]
| `...` | DotDotDot | [Variadic functions][extern], [Range patterns]
| `..=` | DotDotEq | [Inclusive Range][range], [Range patterns]
| `,` | Comma | Various separators
@@ -646,6 +646,7 @@ them are referred to as "token trees" in [macros]. The three types of brackets
[patterns]: patterns.md
[question]: expressions/operator-expr.md#the-question-mark-operator
[range]: expressions/range-expr.md
+[rangepat]: patterns.md#range-patterns
[raw pointers]: types/pointer.md#raw-pointers-const-and-mut
[references]: types/pointer.md
[sized]: trait-bounds.md#sized
diff --git a/src/type-coercions.md b/src/type-coercions.md
index 2b48a0645e..45d81d3860 100644
--- a/src/type-coercions.md
+++ b/src/type-coercions.md
@@ -147,9 +147,11 @@ Coercion is allowed between the following types:
and where `U` can be obtained from `T` by [unsized coercion](#unsized-coercions).
+* Function item types to `fn` pointers
+
* Non capturing closures to `fn` pointers
* `!` to any `T`
diff --git a/src/types/closure.md b/src/types/closure.md
index 7cfb3b059f..ecdb337196 100644
--- a/src/types/closure.md
+++ b/src/types/closure.md
@@ -2,38 +2,46 @@
A [closure expression] produces a closure value with a unique, anonymous type
that cannot be written out. A closure type is approximately equivalent to a
-struct which contains the captured variables. For instance, the following
+struct which contains the captured values. For instance, the following
closure:
```rust
+#[derive(Debug)]
+struct Point { x: i32, y: i32 }
+struct Rectangle { left_top: Point, right_bottom: Point }
+
fn f String> (g: F) {
println!("{}", g());
}
-let mut s = String::from("foo");
-let t = String::from("bar");
-
-f(|| {
- s += &t;
- s
-});
-// Prints "foobar".
+let mut rect = Rectangle {
+ left_top: Point { x: 1, y: 1 },
+ right_bottom: Point { x: 0, y: 0 }
+};
+
+let c = || {
+ rect.left_top.x += 1;
+ rect.right_bottom.x += 1;
+ format!("{:?}", rect.left_top)
+};
+// Prints "Point { x: 2, y: 1 }".
```
generates a closure type roughly like the following:
-
+
```rust,ignore
struct Closure<'a> {
- s : String,
- t : &'a String,
+ left_top : &'a mut Point,
+ right_bottom_x : &'a mut i32,
}
impl<'a> FnOnce<()> for Closure<'a> {
type Output = String;
fn call_once(self) -> String {
- self.s += &*self.t;
- self.s
+ self.left_top.x += 1;
+ self.right_bottom_x += 1;
+ format!("{:?}", self.left_top)
}
}
```
@@ -42,48 +50,153 @@ so that the call to `f` works as if it were:
```rust,ignore
-f(Closure{s: s, t: &t});
+f(Closure{ left_top: rect.left_top, right_bottom_x: rect.left_top.x });
```
## Capture modes
-The compiler prefers to capture a closed-over variable by immutable borrow,
+The compiler prefers to capture a value by immutable borrow,
followed by unique immutable borrow (see below), by mutable borrow, and finally
by move. It will pick the first choice of these that is compatible with how the
-captured variable is used inside the closure body. The compiler does not take
-surrounding code into account, such as the lifetimes of involved variables, or
+captured value is used inside the closure body. The compiler does not take
+surrounding code into account, such as the lifetimes of involved variables or fields, or
of the closure itself.
-If the `move` keyword is used, then all captures are by move or, for `Copy`
-types, by copy, regardless of whether a borrow would work. The `move` keyword is
-usually used to allow the closure to outlive the captured values, such as if the
-closure is being returned or used to spawn a new thread.
+## Capture Precision
+
+The precise path that gets captured is typically the full path that is used in the closure, but there are cases where we will only capture a prefix of the path.
+
+
+### Shared prefix
+
+In the case where a path and one of the ancestor’s of that path are both captured by a closure, the ancestor path is captured with the highest capture mode among the two captures,`CaptureMode = max(AncestorCaptureMode, DescendantCaptureMode)`, using the strict weak ordering
+
+`ImmBorrow < UniqueImmBorrow < MutBorrow < ByValue`.
-Composite types such as structs, tuples, and enums are always captured entirely,
-not by individual fields. It may be necessary to borrow into a local variable in
-order to capture a single field:
+Note that this might need to be applied recursively.
```rust
-# use std::collections::HashSet;
-#
-struct SetVec {
- set: HashSet,
- vec: Vec
-}
+# fn move_value(_: T){}
+let s = String::from("S");
+let t = (s, String::from("T"));
+let mut u = (t, String::from("U"));
+
+let c = || {
+ println!("{:?}", u); // u captured by ImmBorrow
+ u.1.truncate(0); // u.0 captured by MutBorrow
+ move_value(u.0.0); // u.0.0 captured by ByValue
+};
+```
-impl SetVec {
- fn populate(&mut self) {
- let vec = &mut self.vec;
- self.set.iter().for_each(|&n| {
- vec.push(n);
- })
- }
-}
+Overall the closure will capture `u` by `ByValue`.
+
+### Wild Card Patterns
+Closures only capture data that needs to be read, which means the following closures will not capture `x`
+
+```rust
+let x = 10;
+let c = || {
+ let _ = x;
+};
+
+let c = || match x {
+ _ => println!("Hello World!")
+};
```
-If, instead, the closure were to use `self.vec` directly, then it would attempt
-to capture `self` by mutable reference. But since `self.set` is already
-borrowed to iterate over, the code would not compile.
+### Capturing references in move contexts
+
+Moving fields out of references is not allowed. As a result, in the case of move closures, when values accessed through a shared references are moved into the closure body, the compiler will truncate right before a dereference.
+
+```rust
+struct T(String, String);
+
+let mut t = T(String::from("foo"), String::from("bar"));
+let t = &mut t;
+let c = move || t.0.truncate(0); // closure captures `t`
+```
+
+### Raw pointer dereference
+Because it is `unsafe` to dereference a raw pointer, closures will only capture the prefix of a path that runs up to, but not including, the first dereference of a raw pointer.
+
+```rust,
+struct T(String, String);
+
+let t = T(String::from("foo"), String::from("bar"));
+let t = &t as *const T;
+
+let c = || unsafe {
+ println!("{}", (*t).0); // closure captures t
+};
+```
+
+### Reference into unaligned `struct`s
+
+Because it is `unsafe` to hold references to unaligned fields in a structure, closures will only capture the prefix of the path that runs up to, but not including, the first field access into an unaligned structure.
+
+```rust
+#[repr(packed)]
+struct T(String, String);
+
+let t = T(String::from("foo"), String::from("bar"));
+let c = || unsafe {
+ println!("{}", t.0); // closure captures t
+};
+```
+
+
+### `Box` vs other `Deref` implementations
+
+The implementation of the [`Deref`] trait for [`Box`] is treated differently from other `Deref` implementations, as it is considered a special entity.
+
+For example, let us look at examples involving `Rc` and `Box`. The `*rc` is desugared to a call to the trait method `deref` defined on `Rc`, but since `*box` is treated differently by the compiler, the compiler is able to do precise capture on contents of the `Box`.
+
+[`Box`]: ../special-types-and-traits.md#boxt
+[`Deref`]: ../special-types-and-traits.md#deref-and-derefmut
+
+#### Non `move` closure
+
+In a non `move` closure, if the contents of the `Box` are not moved into the closure body, the contents of the `Box` are precisely captured.
+
+```rust
+# use std::rc::Rc;
+
+struct S(i32);
+
+let b = Box::new(S(10));
+let c_box = || {
+ println!("{}", (*b).0); // closure captures `(*b).0`
+};
+
+let r = Rc::new(S(10));
+let c_rc = || {
+ println!("{}", (*r).0); // closure caprures `r`
+};
+```
+
+However, if the contents of the `Box` are moved into the closure, then the box is entirely captured. This is done so the amount of data that needs to be moved into the closure is minimized.
+
+```rust
+struct S(i32);
+
+let b = Box::new(S(10));
+let c_box = || {
+ let x = (*b).0; // closure captures `b`
+};
+```
+
+#### `move` closure
+
+Similarly to moving contents of a `Box` in a non-`move` closure, reading the contents of a `Box` in a `move` closure will capture the `Box` entirely.
+
+```rust
+struct S(i32);
+
+let b = Box::new(S(10));
+let c_box = || {
+ println!("{}", (*b).0); // closure captures `b`
+};
+```
## Unique immutable borrows in captures
@@ -106,13 +219,14 @@ let z = &x;
In this case, borrowing `x` mutably is not possible, because `x` is not `mut`.
But at the same time, borrowing `x` immutably would make the assignment illegal,
-because a `& &mut` reference may not be unique, so it cannot safely be used to
+because a `& &mut` reference might not be unique, so it cannot safely be used to
modify a value. So a unique immutable borrow is used: it borrows `x` immutably,
but like a mutable borrow, it must be unique. In the above example, uncommenting
the declaration of `y` will produce an error because it would violate the
uniqueness of the closure's borrow of `x`; the declaration of z is valid because
the closure's lifetime has expired at the end of the block, releasing the borrow.
+
## Call traits and coercions
Closure types all implement [`FnOnce`], indicating that they can be called once
@@ -156,12 +270,13 @@ following traits if allowed to do so by the types of the captures it stores:
The rules for [`Send`] and [`Sync`] match those for normal struct types, while
[`Clone`] and [`Copy`] behave as if [derived]. For [`Clone`], the order of
-cloning of the captured variables is left unspecified.
+cloning of the captured values is left unspecified.
+
Because captures are often by reference, the following general rules arise:
-* A closure is [`Sync`] if all captured variables are [`Sync`].
-* A closure is [`Send`] if all variables captured by non-unique immutable
+* A closure is [`Sync`] if all captured values are [`Sync`].
+* A closure is [`Send`] if all values captured by non-unique immutable
reference are [`Sync`], and all values captured by unique immutable or mutable
reference, copy, or move are [`Send`].
* A closure is [`Clone`] or [`Copy`] if it does not capture any values by
@@ -178,3 +293,275 @@ Because captures are often by reference, the following general rules arise:
[`Sync`]: ../special-types-and-traits.md#sync
[closure expression]: ../expressions/closure-expr.md
[derived]: ../attributes/derive.md
+
+## Drop Order
+
+If a closure captures a field of a composite types such as structs, tuples, and enums by value, the field's lifetime would now be tied to the closure. As a result, it is possible for disjoint fields of a composite types to be dropped at different times.
+
+```rust
+{
+ let tuple =
+ (String::from("foo"), String::from("bar")); // --+
+ { // |
+ let c = || { // ----------------------------+ |
+ // tuple.0 is captured into the closure | |
+ drop(tuple.0); // | |
+ }; // | |
+ } // 'c' and 'tuple.0' dropped here ------------+ |
+} // tuple.1 dropped here -----------------------------+
+```
+
+
+## Overall Capture analysis algorithm
+
+* Input:
+ * Analyzing the closure C yields a mapping of `Place -> Mode` that are accessed
+ * Access mode is `ref`, `ref uniq`, `ref mut`, or `by-value` (ordered least to max)
+ * For a `Place` that is used in two different access modes within the same closure, the mode reported from closure analysis is the maximum access mode.
+ * Note: `ByValue` use of a `Copy` type is seen as a `ref` access mode.
+ * Closure mode is `ref` or `move`
+* Output:
+ * Minimal `(Place, Mode)` pairs that are actually captured
+* Cleanup and truncation
+ * Generate C' by mapping each (Mode, Place) in C:
+ * `(Place1, Mode1) = ref_opt(unsafe_check(Place, Mode))`
+ * `(Place2, Mode2)` = if this is a ref closure:
+ * `ref_xform(Place1, Mode1)`
+ * else:
+ * `move_xform(Place1, Mode1)`
+ * Add `(Place3, Mode3) = truncate_move_through_drop(Place2, Mode2)` to C'.
+* Minimization
+ * Until no rules apply:
+ * For each two places (P1, M1), (P2, M2) where P1 is a prefix of P2:
+ * Remove both places from the set
+ * Add (P1, max(M1, M2)) into the set
+* Helper functions:
+ * `unsafe_check(Place, Mode) -> (Place, Mode)`
+ * "Ensure unsafe accesses occur within the closure"
+ * If Place contains a deref (at index `i`) of a raw pointer:
+ * Let `(Place1, Mode1) = truncate_place(Place, Mode, i)`
+ * Return (Place1, Mode1)
+ * If Mode is `ref *` and the place contains a field of a packed struct at index `i`:
+ * Let `(Place1, Mode1) = truncate_place(Place, Mode, i)`
+ * Return (Place1, Mode1)
+ * Else
+ * Return (Place, Mode)
+ * `move_xform(Place, Mode) -> (Place, Mode)` (For move closures)
+ * If place contains a deref at index `i`:
+ * Let `(Place1, _) = truncate_place(Place, Mode, i)`
+ * Return (Place1, ByValue)
+ * Else:
+ * Return (Place, ByValue)
+ * Note that initially we had considered an approach where "Take ownership if data being accessed is owned by the variable used to access it (or if closure attempts to move data that it doesn't own). That is when taking ownership only capture data that is found on the stack otherwise reborrow the reference.". This cause a bug around lifetimes, check [rust-lang/rust#88431](https://github.com/rust-lang/rust/issues/88431).
+ * `ref_xform(Place, Mode) -> (Place, Mode)` (for ref closures)
+ * "If taking ownership of data, only move data from enclosing stack frame."
+ * Generate C' by mapping each (Mode, Place) in C
+ * If Mode is ByValue and place contains a deref at index `i`:
+ * Let `(Place1, _) = truncate_place(Place, Mode, i)`
+ * Return (Place1, ByValue)
+ * Else:
+ * Return (Place, Mode)
+ * `ref_opt(Place, Mode) -> (Place, Mode)`
+ * "Optimization: borrow the ref, not data owned by ref."
+ * Disjoint capture over immutable reference doesn't add too much value because the fields can either be borrowed immutably or copied.
+ * Edge case: Field that is accessed via the referece lives longer than the reference.
+ * Resolution: Only consider the last Deref
+ * If Place is (Base, Projections), where Projections is a list of size N.
+ * For all `i, 0 <= i < N`, Projections[i] != Deref
+ * Return (Place, Mode)
+ * If `l, 0 <= l < N` is the last/rightmost Deref Projection i.e. for any `i, l < i < N` Projection[i] != Deref,
+ and `Place.type_before_projection(l) = ty::Ref(.., Mutability::Not)`
+ * Let Place1 = (Base, Projections[0..=l])
+ * Return (Place1, Ref)
+ * `truncate_move_through_drop(Place1, Mode1) -> (Place, Mode)`
+ * Rust doesn't permit moving out of a type that implements drop
+ * In the case where we do a disjoint capture in a move closure, we might end up trying to move out of drop type
+ * We truncate move of not-Copy types
+ * If Mode1 != ByBalue
+ * return (Place1, Mode1)
+ * If there exists `i` such that `Place1.before_projection(i): Drop` and `Place1.ty()` doesn't impl `Copy`
+ * then return `truncate_place(Place1, Mode1, i)`
+ * Else return (Place1, Mode1)
+ * Check [rust-lang/rust#88476](https://github.com/rust-lang/rust/issues/88476) for examples.
+ * `truncate_place(Place, Mode, len) -> (Place, Mode)`
+ * "Truncate the place to length `len`, i.e. upto but not including index `len`"
+ * "If during truncation we drop Deref of a `&mut` and the place was being used by `ref mut`, the access to the truncated place must be unique"
+ * Let (Proj_before, Proj_after) = Place.split_before(len)
+ * If Mode == `ref mut` and there exists `Deref` in `Proj_after` at index `i` such that `Place.type_before_projection(len + i)` is `&mut T`
+ * Return (Place(Proj_before, ..InputPlace), `ref-uniq`)
+ * Else Return (Place(Proj_before, ..InputPlace), Mode)
+
+## Key examples
+
+### box-mut
+
+This test shows how a `move` closure can sometimes capture values by mutable reference, if they are reached via a `&mut` reference.
+
+```rust
+struct Foo { x: i32 }
+
+fn box_mut() {
+ let mut s = Foo { x: 0 } ;
+
+ let px = &mut s;
+ let bx = Box::new(px);
+
+
+ let c = move || bx.x += 10;
+ // Mutable reference to this place:
+ // (*(*bx)).x
+ // ^ ^
+ // | a Box
+ // a &mut
+}
+```
+
+
+```ignore
+Closure mode = move
+C_in = {
+ (ref mut, (*(*bx)).x)
+}
+C_out = C_in
+```
+
+Output is the same: `C' = C`
+
+### Packed-field-ref-and-move
+
+When you have a closure that both references a packed field (which is unsafe) and moves from it (which is safe) we capture the entire struct, rather than just moving the field. This is to aid in predictability, so that removing the move doesn't make the closure become unsafe:
+
+```rust
+#[repr(packed)]
+struct Packed { x: String }
+
+# fn use_ref(_: &T) {}
+# fn move_value(_: T) {}
+
+fn main() {
+ let packed = Packed { x: String::new() };
+
+ let c = || {
+ use_ref(&packed.x);
+ move_value(packed.x);
+ };
+
+ c();
+}
+```
+
+
+```ignore
+Closure mode = ref
+C_in = {
+ (ref mut, packed)
+}
+C_out = C_in
+```
+
+### Optimization-Edge-Case
+
+This test shows an interesting edge case. Normally, when we see a borrow of something behind a shared reference (`&T`), we truncate to capture the entire reference, because that is more efficient (and we can always use that reference to reach all the data it refers to). However, in the case where we are dereferencing two shared references, we have to be sure to preserve the full path, since otherwise the resulting closure could have a shorter lifetime than is necessary.
+
+```edition2021
+struct Int(i32);
+struct B<'a>(&'a i32);
+
+struct MyStruct<'a> {
+ a: &'static Int,
+ b: B<'a>,
+}
+
+fn foo<'a, 'b>(m: &'a MyStruct<'b>) -> impl FnMut() + 'static {
+ let c = || drop(&m.a.0);
+ c
+}
+
+```
+
+
+```ignore
+Closure mode = ref
+C_in = {
+ (ref mut, *m.a)
+}
+C_out = C_in
+```
+
+# Edition 2018 and before
+
+## Closure types difference
+
+In Edition 2018 and before, a closure would capture variables in its entirety. This means that for the example used in the [Closure types](#closure-types) section, the generated closure type would instead look something like this:
+
+
+```rust,ignore
+struct Closure<'a> {
+ rect : &'a mut Rectangle,
+}
+
+impl<'a> FnOnce<()> for Closure<'a> {
+ type Output = String;
+ fn call_once(self) -> String {
+ self.rect.left_top.x += 1;
+ self.rect.right_bottom.x += 1;
+ format!("{:?}", self.rect.left_top)
+ }
+}
+```
+and the call to `f` would work as follows:
+
+```rust,ignore
+f(Closure { rect: rect });
+```
+
+## Capture precision difference
+
+Composite types such as structs, tuples, and enums are always captured in its entirety,
+not by individual fields. As a result, it may be necessary to borrow into a local variable in order to capture a single field:
+
+```rust
+# use std::collections::HashSet;
+#
+struct SetVec {
+ set: HashSet,
+ vec: Vec
+}
+
+impl SetVec {
+ fn populate(&mut self) {
+ let vec = &mut self.vec;
+ self.set.iter().for_each(|&n| {
+ vec.push(n);
+ })
+ }
+}
+```
+
+If, instead, the closure were to use `self.vec` directly, then it would attempt
+to capture `self` by mutable reference. But since `self.set` is already
+borrowed to iterate over, the code would not compile.
+
+If the `move` keyword is used, then all captures are by move or, for `Copy`
+types, by copy, regardless of whether a borrow would work. The `move` keyword is
+usually used to allow the closure to outlive the captured values, such as if the
+closure is being returned or used to spawn a new thread.
+
+Regardless of if the data will be read by the closure, i.e. in case of wild card patterns, if a variable defined outside the closure is mentioned within the closure the variable will be captured in its entirety.
+
+## Drop order difference
+
+As composite types are captured in their entirety, a closure which captures one of those composite types by value would drop the entire captured variable at the same time as the closure gets dropped.
+
+```rust
+{
+ let tuple =
+ (String::from("foo"), String::from("bar"));
+ {
+ let c = || { // --------------------------+
+ // tuple is captured into the closure |
+ drop(tuple.0); // |
+ }; // |
+ } // 'c' and 'tuple' dropped here ------------+
+}
+```
diff --git a/src/types/slice.md b/src/types/slice.md
index e780277642..6ba5e7d211 100644
--- a/src/types/slice.md
+++ b/src/types/slice.md
@@ -7,13 +7,12 @@
A slice is a [dynamically sized type] representing a 'view' into a sequence of
elements of type `T`. The slice type is written as `[T]`.
-To use a slice type it generally has to be used behind a pointer for example
-as:
+Slice types are generally used through pointer types. For example:
-* `&[T]`, a 'shared slice', often just called a 'slice', it doesn't own the
- data it points to, it borrows it.
-* `&mut [T]`, a 'mutable slice', mutably borrows the data it points to.
-* `Box<[T]>`, a 'boxed slice'
+* `&[T]`: a 'shared slice', often just called a 'slice'. It doesn't own the
+ data it points to; it borrows it.
+* `&mut [T]`: a 'mutable slice'. It mutably borrows the data it points to.
+* `Box<[T]>`: a 'boxed slice'
Examples:
diff --git a/src/types/tuple.md b/src/types/tuple.md
index 3877a2672f..df21e1cdf4 100644
--- a/src/types/tuple.md
+++ b/src/types/tuple.md
@@ -22,7 +22,7 @@ And so on.
The type of each field is the type of the same position in the tuple's list of types.
For convenience and historical reasons, the tuple type with no fields (`()`) is often called *unit* or *the unit type*.
-It's one value is also called *unit* or *the unit value*.
+Its one value is also called *unit* or *the unit value*.
Some examples of tuple types:
diff --git a/triagebot.toml b/triagebot.toml
index fc4b19e20f..4059f91906 100644
--- a/triagebot.toml
+++ b/triagebot.toml
@@ -1,6 +1,6 @@
[relabel]
allow-unauthenticated = [
- "A-*", "New Content", "Language Cleanup", "Easy", "Formatting", "Enhancement", "Bug",
+ "S-*", "A-*", "New Content", "Language Cleanup", "Easy", "Formatting", "Enhancement", "Bug",
]
[assign]