FSCANF(2) FSCANF(2) NAME fscanf, scanf, sscanf, vfscanf - scan formatted input SYNOPSIS #include <u.h> #include <stdio.h> int fscanf(FILE *f, char *format, ...) int scanf(char *format, ... ) int sscanf(char *s, char *format, ...) int vfscanf(FILE *stream, char *format, char *args) DESCRIPTION Fscanf reads from the named input stream f (see fopen(2)) under control of the string pointed to by format that speci- fies the admissible input sequences and how they are to be converted for assignment, using subsequent arguments as pointers to the objects to receive the converted input. If there are insufficient arguments for the format, the behav- ior is undefined. If the format is exhausted while argu- ments remain, the excess arguments are evaluated (as always) but are otherwise ignored. Scanf and sscanf are the same, but they read from stdin and the character string s, respectively. Vfscanf is like scanf, except the args argument is a pointer to an argument in an argument list of the calling function and the effect is as if the calling function's argument list from that point on is passed to the scanf routines. The format is composed of zero or more directives: one or more white-space characters; an ordinary character (not %); or a conversion specification. Each conversion specifica- tion is introduced by the character %. After the %, the following appear in sequence: An optional assignment-suppressing character *. An optional decimal integer that specifies the maximum field width. An optional h, l (ell) or L indicating the size of the receiving object. The conversion specifiers d, i, and n shall be preceded by h if the corresponding argument is a pointer to short rather than a pointer to int, or by l if it is a pointer to long. Similarly, the con- version specifiers o, u, and x shall be preceded by h Page 1 Plan 9 (printed 9/19/24) FSCANF(2) FSCANF(2) if the corresponding argument is a pointer to unsigned short rather than a pointer to unsigned, or by l if it is a pointer to unsigned long. Finally, the conversion specifiers e, f, and g shall be preceded by l if the corresponding argument is a pointer to double rather than a pointer to float, or by L if it is a pointer to long double. If an h, l, or L appears with any other conversion specifier, the behavior is undefined. A character that specifies the type of conversion to be applied. The valid conversion specifiers are described below. Fscanf executes each directive of the format in turn. If a directive fails, as detailed below, fscanf returns. Fail- ures are described as input failures (due to the unavail- ability of input), or matching failures (due to inappropri- ate input). A directive composed of white space is executed by reading input up to the first non-white-space character (which remains unread), or until no more characters can be read. A directive that is an ordinary character is executed by reading the next character from the stream. If if differs from the one comprising the directive, the directive fails, and the differing and subsequent characters remain unread. A directive that is a conversion specification defines a set of matching input sequences, as described below for each specifier. A conversion specification is executed in the following steps: Input white-space characters (as specified by isspace, see ctype(2)) are skipped, unless the specification includes a [, c, or n specifier. An input item is read from the stream, unless the specifica- tion includes an n specifier. An input item is defined as the longest sequence of input characters (up to any speci- fied maximum field width) which is an initial subsequence of a matching sequence. The first character, if any, after the input item remains unread. If the length of the input item is zero, the execution of the directive fails: this condi- tion is a matching failure, unless an error prevented input from the stream, in which case it is an input failure. Except in the case of a % specifier, the input item (or, in the case of a %n directive, the count of input characters) is converted to a type appropriate to the conversion speci- fier. If the input item is not a matching sequence, the execution of the directive fails: this condition is a Page 2 Plan 9 (printed 9/19/24) FSCANF(2) FSCANF(2) matching failure. Unless assignment suppression was indi- cated by a *, the result of the conversion is placed in the object pointed to by the first argument following the format argument that has not already received a conversion result. If this object does not have an appropriate type, or if the result of the conversion cannot be represented in the space provided, the behavior is undefined. The following conversion specifiers are valid: d Matches an optionally signed decimal integer, whose format is the same as expected for the subject sequence of the strtol (see atof(2)) function with 10 for the base argument. The corresponding argument shall be a pointer to int. i Matches an optionally signed decimal integer, whose format is the same as expected for the subject sequence of the strtol function with 0 for the base argument. The corresponding argument shall be a pointer to int. o Matches an optionally signed octal integer, whose for- mat is the same as expected for the subject sequence of the strtoul (see atof(2)) function with 8 for the base argument. The corresponding argument shall be a pointer to unsigned int. u Matches an optionally signed decimal integer, whose format is the same as expected for the subject sequence of the strtoul function with 10 for the base argument. The corresponding argument shall be a pointer to unsigned int. x Matches an optionally signed hexadecimal integer, whose format is the same as expected for the subject sequence of the strtoul function with 16 for the base argument. The corresponding argument shall be a pointer to unsigned int. e,f,g Matches an optionally signed floating-point number, whose format is the same as expected for the subject string of the strtod (see atof(2)) function. The cor- responding argument shall be a pointer to float. s Matches a sequence of non-white-space characters. The corresponding argument shall be a pointer to the ini- tial character of an array large enough to accept the sequence and a terminating NUL (0) character, which will be added automatically. [ Matches a nonempty sequence of characters from a set Page 3 Plan 9 (printed 9/19/24) FSCANF(2) FSCANF(2) of expected characters (the scanset). The correspond- ing argument shall be a pointer to the initial charac- ter of an array large enough to accept the sequence and a terminating NUL character, which will be added automatically. The conversion specifier includes all subsequent characters in the format string, up to and including the matching right brace (]). The charac- ters between the brackets (the scanlist) comprise the scanset, unless the character after the left bracket is a circumflex (^), in which case the scanset con- tains all characters that do not appear in the scan- list between the circumflex and the right bracket. As a special case, if the conversion specifier begins with [] or [^], the right bracket character is in the scanlist and the next right bracket character is the matching right bracket that ends the specification. If a - character is in the scanlist and is not the first, nor the second where the first character is a ^, nor the last character, the behavior is implementation-defined (in Plan 9: the scanlist includes all characters in the ASCII (sic) range between the two characters on either side of the -). c Matches a sequence of characters of the number speci- fied by the field width (1 if no field width is pre- sent in the directive). The corresponding argument shall be a pointer to the initial character of an array large enough to accept the sequence. No NUL character is added. P Matches an implementation-defined set of sequences, which should be the same as the set of sequences that may be produced by the %P conversion of the fprintf(2) function (in Plan 9, a hexadecimal number). The cor- responding argument shall be a pointer to a pointer to void. The interpretation of the input item is imple- mentation defined; however, for any input item other than a value converted earlier during the same program execution, the behavior of the %P conversion is unde- fined. n No input is consumed. The corresponding argument shall be a pointer to integer into which is written the number of characters read from the input stream so far by this call to fscanf. Execution of a %n direc- tive does not increment the assignment count returned at the completion of fscanf. % Matches a single %; no conversion or assignment occurs. The complete conversion specification shall be %%. Page 4 Plan 9 (printed 9/19/24) FSCANF(2) FSCANF(2) If a conversion specification is invalid, the behavior is undefined. The conversion specifiers E, G, and X are also valid and behave the same as, respectively, e, g, and x. If end-of-file is encountered during input, conversion is terminated. If end-of-file occurs before any characters matching the current directive have been read (other than leading white space, where permitted), execution of the cur- rent directive terminates with an input failure; otherwise, unless execution of the current directive is terminated with a matching failure, execution of the following directive (if any) is terminated with an input failure. If conversion terminates on a conflicting input character, the offending input character is left unread in the input stream. Trailing white space (including newline characters) is left unread unless matched by a directive. The success of literal matches and suppressed assignments is not directly determinable other than via the %n directive. The return value from fscanf is the number of input items assigned, which can be fewer than provided for, or even zero, in the event of an early matching failure. However, if an input failure occurs before any conversion, EOF is returned. SOURCE /sys/src/libstdio SEE ALSO fopen(2), fgetc(2) BUGS Does not know about UTF. Page 5 Plan 9 (printed 9/19/24)