external/cde - Personal Git space

mirror of git://git.code.sf.net/p/cdesktopenv/code synced 2025-03-09 15:50:02 +00:00

Author	SHA1	Message	Date
Johnothan King	a0eeb14787	Stop the time keyword overriding errexit (#351 ) This bug was first reported in <https://www.illumos.org/issues/7694>. The time keyword currently overrides the errexit shell option, allowing failing scripts to continue after an error: $ cat 1.sh #!/bin/sh time false # This should cause the script to exit echo FAILURE true $ ksh -o errexit 1.sh real 0m0.00s user 0m0.00s sys 0m0.00s FAILURE src/cmd/ksh93/sh/xec.c: - When the time keyword runs a command, pass the errexit state flag to the sh_exec call. This state flag is required for ksh to exit when a command fails while the errexit option is on. src/cmd/ksh93/tests/basic.sh: - Add a regression test based on the reproducer.	2021-11-29 20:12:15 +01:00
Martijn Dekker	f508660ddf	Revert "Fix defining types conditionally and/or in subshells (re: `8ced1daa`)" This reverts commit `2b9cbbbc8e`. This is not ready for prime time. Crashses when running a $PS2 discipline function. This needs fixing and more testing in development before making it into the 1.0 branch. In the meantime, that terrible problem with types is back, sorry about that.	2021-11-29 20:08:53 +01:00
Martijn Dekker	2b9cbbbc8e	Fix defining types conditionally and/or in subshells (re: `8ced1daa`) This commit mitigates the effects of the hack explained in the referenced commit so that dummy built-in command nodes added by the parser for declaration/assignment purposes do not leak out into the execution level, except in a relatively harmless corner case. Something like if false; then typeset -T Foo_t=(integer -i bar) fi will no longer leave a broken dummy Foo_t declaration command. The same applies to declaration commands created with enum. The corner case remaining is: $ ksh -c 'false && enum E_t=(a b c); E_t -a x=(b b a c)' ksh: E_t: not found Since the 'enum' command is not executed, this should have thrown a syntax error on the 'E_t -a' declaration: ksh: syntax error at line 1: `(' unexpected This is because the -c script is parsed entirely before being executed, so E_t is recognised as a declaration built-in at parse time. However, the 'not found' error shows that it was successfully eliminated at execution time, so the inconsistent state will no longer persist. This fix now allows another fix to be effective as well: since built-ins do not know about virtual subshells, fork a virtual subshell into a real subshell before adding any built-ins. src/cmd/ksh93/sh/parse.c: - Add a pair of functions, dcl_hactivate() and dcl_dehacktivate(), that (de)activate an internal declaration built-ins tree into which check_typedef() can pre-add dummy type declaration command nodes. A viewpath from the main built-ins tree to this internal tree is added, unifying the two for search purposes and causing new nodes to be added to the internal tree. When parsing is done, we close that viewpath. This hides those pre-added nodes at execution time. Since the parser is sometimes called recursively (e.g. for command substitutions), keep track of this and only activate and deactivate at the first level. - We also need to catch errors. This is done by setting libast's error_info.exit variable to a dcl_exit() function that tidies up and then passes control to the original (usually sh_exit()). - sh_cmd(): This is the most central function in the parser. You'd think it was sh_parse(), but $(modern)-form command substitutions use sh_dolparen() instead. Both call sh_cmd(). So let's simply add a dcl_hacktivate() call at the beginning and a dcl_deactivate() call at the end. - assign(): This function calls path_search(), which among many other things executes an FATH search, which may execute arbitrary code at parse time (!!!). So, regardless of recursion level, forcibly dehacktivate() to avoid those ugly parser side effects returning in that context. src/cmd/ksh93/bltins/enum.c: b_enum(): - Fork a virtual subshell before adding a built-in. src/cmd/ksh93/sh/xec.c: sh_exec(): - Fork a virtual subshell when detecting typeset's -T option. Improves fix to https://github.com/ksh93/ksh/issues/256	2021-11-29 09:02:07 +01:00
Martijn Dekker	43cd8da2fe	Fix 'command' prefix in enum type def pre-parsing (re: 1dc18346) Symptom: $ ksh -c 'command enum -i P_t=(a b); P_t -A v=([f]=b); typeset -p v' ksh: syntax error at line 1: `(' unexpected Expected: no syntax error, and output of 'P_t -A v=([f]=b)'. src/cmd/ksh93/sh/parse.c: check_typedef(): - For enum, skip over any possible 'command' prefixes before pre-parsing options with optget (or, technically, skip anything else that might come before 'enum', though I don't think anything else is possible). - The sh_addbuiltin() call at the end to pre-add the builtin obtained the node pointer to the built-in and the node flags from the parser tree. This did not work if a 'command' prefix was present. However, we don't actually need this. For parsing purposes, the BLT_DCL flag for a declaration built-in is sufficient; this is what gets the parser to accept assignment-arguments including parentheses. So just apply that. In addition, let's point it to an actual dummy built-in, 'true' (SYSTRUE), so that if a user does run something like 'if false; then enum Foo_t=(...); fi', the leaked Foo_t dummy at least won't do anything (not even crash).	2021-11-28 21:16:19 +01:00
Martijn Dekker	c9ca0ff531	typeset equivalents: use 'typeset' in error messages (re: `1fbbeaa1`) When giving an invalid or incompatible option to a typeset option equivalent command (former default alias) such as 'compound' or 'integer', the resulting usage messages are incorrect. Example: $ ksh -c 'compound -T foo=(typeset -a bar[1]=23)' ksh: compound: -T cannot be used with other options Usage: compound [-bflmnprstuxACHS] [-a[[type]]] [-i[base]] [-E[n]] [-F[n]] [-L[n]] [-M[mapping]] [-R[n]] [-X[n]] [-h string] [-T[tname]] [-Z[n]] [name[=value]...] Or: compound -f [name...] Or: compound -m [name=name...] Or: compound -n [name=name...] Or: compound -T [tname[=(type definition)]...] Help: compound [ --help \| --man ] 2>&1 The error message is wrong (there were no other options) and some of the listed usages are invalid, like 'compound -f'. Typeset option equivalent commands should just use 'typeset' in all their error messages to avoid confusion. This is done by setting error_info.id to the name of the typeset builtin.	2021-11-28 21:16:17 +01:00
Martijn Dekker	7318afc278	jobs.c: refactor SIGHUP handling; document bug fixed (re: `62cf88d0`) There is quite a bit of no-op code in the job_hup() function due to conditions that always test false. This commit removes that code and clarifies the rest, making the purpose of this function clear. job_hup() (before `62cf88d0`: job_terminate()) is called via job_walk() by sh_done() in fault.c to issue SIGHUP, the "hang up" signal, to every background job's process group when the current session is ungracefully disconnected. (One way to trigger such a disconnection is to forcibly terminate a ssh session by typing '~.' on a new prompt.) The bug that Solaris patch 260-22964338 fixed is that ksh then killed all non-disowned jobs' process groups without considering that ksh still remembers a job even when all its processes are finished (have the P_DONE flag). In that condition, the process group ID may well be reused by another process by now, so it is dangerous to killpg() it; we risk killing unrelated processes! This is not a hypothetical problem; the Solaris patch exists because this happened to a Solaris customer. However, the bug exists on all operating systems. It's rarely triggered but serious, and it's more likely to occur on heavy workloads that re-use process/group IDs a lot. And it's on every currently released non-Solaris version of ksh93. Eesh. src/cmd/ksh93/sh/jobs.c: src/cmd/ksh93/include/jobs.h: - Remove job_terminate() which was unused as of `62cf88d0`. It could have been fixed instead of replaced. Oh well. - Refactor job_hup(): - Remove code that will never be executed because, at those points, it is known that pw->p_pgrp != 0. - Simplify the loop that checks that there is at least one non-P_DONE process so it doesn't need a flag. For documentation purposes, below is a reproducer for the bug before the Solaris patch. It is rather involved. 1. Compile the C program below (cpid). 2. In one terminal, 'ssh localhost'. 3. Within the ssh session: - 'exec -a-ksh /path/to/buggy/ksh' to get a ksh login shell. - 'sleep 1 &' and let it finish. Note down the reported PID. That is the one we will reuse. Let's say 26650. 4. In another terminal, run: ./cpid 26650 (the PID from the previous step). Now wait until it says "PID 26650 is ready"; it has now succeeded at re-using that PID, and will just sit there. This process will never voluntarily terminate. If we have the bug, the termination of this process will be the symptom. 5. In the first terminal, forcibly terminate the ssh session by typing, on a new prompt: ~. (tilde, dot). This triggers the buggy routine to issue SIGHUP to all of ksh's background jobs. 6. In the second terminal, the bug is reproduced if cpid has been terminated, reporting 'waitpid return 26650, status 0x0001', so ksh just killed this process that it had nothing to do with. (Note that status 0x0001 refers to being killed by signal 1 which is SIGHUP.) cpid.c follows (written by George Lijo, tweaked by me): #include <stdio.h> #include <stdlib.h> #include <unistd.h> #include <signal.h> #include <sys/wait.h> int main(int argc, char *argv[]) { pid_t pid, rpid, opid; int i, status, npid; if (argc != 2) { fprintf(stderr, "Usage: cpid <PID to re-use>\n"); exit(1); } rpid = atoi(argv[1]); opid = getpid(); for (;;) { if ((pid = fork()) == 0) { setpgrp(); pause(); _exit(0); } if (pid == rpid) break; kill(pid, SIGKILL); waitpid(pid, NULL, 0); if (opid < rpid && pid > rpid) printf("Cannot create PID %d\n", rpid); opid = pid; } printf("PID %d is ready\n", pid); i = waitpid(pid, &status, 0); printf("waitpid return %d, status 0x%4.4x\n", i, status); return status; }	2021-11-25 19:29:17 +01:00
Martijn Dekker	f3433a696a	Reset sh.arithrecursion in sh_exit() instead (re: `d50d3d7c`) Since the arithmetic recursion level only becomes incorrect when an error interrupts the arithmetic subsystem, and all such error messages call sh_exit(), it should be good enough to reset it there, so we don't need to do that for nearly every sh_exec() run.	2021-11-25 10:26:09 +01:00
Martijn Dekker	27ccdd2517	Fix parentheses in sh_{push,pop}context macros The lack of parentheses around the shp parameter expansion made it impossible to pass something like &sh as the first parameter.	2021-11-25 04:11:41 +01:00
Johnothan King	84ded2d0c4	Backport the ksh93v- rm builtin to fix 'rm -d' (#348 ) The -d flag implemented in the rm builtin is completely broken. No matter what you do it refuses to remove directories, even if -r is also passed. Reproducer: $ mkdir /tmp/empty $ PATH=/opt/ast/bin rm -d /tmp/empty rm: /tmp/empty: directory $ PATH=/opt/ast/bin rm -dr /tmp/empty rm: /tmp/empty: directory not removed [Is a directory] Additionally, the description of 'rm -d' in the man page contradicts how it's specified in <https://www.austingroupbugs.net/view.php?id=802>. The ksh93v- rm builtin fixed nearly all of these issues, so I've backported it to 93u+m and applied one additional fix for 'rm -rd'. src/lib/libcmd/rm.c: - Backported the fixes from the ksh93v- rm builtin's -d flag when used on empty directories. - Backported the man page update for rm(1) from ksh93v-. - The ksh93v- rm builtin had one additional bug that caused the -r option to fail when combined with -d. This was fixed by overriding -d if -r is also passed. src/cmd/ksh93/tests/builtins.sh: - Add regression tests for the rm builtin's -d option.	2021-11-25 03:52:05 +01:00
Martijn Dekker	2d65148fad	arith.c: scope(): de-obfuscate some code This function adds the NV_ADD flag to its 'flags' variable for nv_serach() calls subject to some checks. However, every call that uses that variable explicitly turns off the NV_ADD bit again. A search in the ast-open-history repo reveals that this check briefly made a difference between versions 2010-06-25 and 2010-08-11, but it's been a complete no-op ever since. src/cmd/ksh93/sh/arith.c: scope(): - Remove no-op code. - Resolve the constant expressions involving the 'flags' variable, get rid of the variable, and just indicate the flag bitmasks directly in the nv_search() calls. - Detangle and split up the excessively long 'if' construct. No change in behaviour. Previously noticed by Kurtis Rader for ksh2020: `d5ce3b05`	2021-11-25 03:25:39 +01:00
Martijn Dekker	214308f81e	'.': disable ksh function lookup in POSIX mode POSIXly, '.' loads only files, not functions. This only applies to '.', not 'source' (which is not in POSIX). src/cmd/ksh93/bltins/misc.c: b_source(): - For ksh function lookup, add an additional check that we're not in POSIX mode and running the '.' (SYSDOT) builtin.	2021-11-24 09:12:39 +01:00
Martijn Dekker	c0334e32a1	[1.0 release prep] Remove tilde expansion discipline Defining a .sh.tilde.get or .sh.tilde.set discipline function to extend tilde expansion works well as long as the discipline function doesn't get interrupted (e.g. with Crtl+C) or produce an error message. Either of those will cause the shell to become unstable and crash. This feature is now removed from the 1.0 branch as it is not ready for prime time. It can return to a release branch if/when we manage to fix it on the master branch. Related: https://github.com/ksh93/ksh/issues/346	2021-11-24 07:46:58 +01:00
Martijn Dekker	40de1e92b0	[1.0 release prep] Block namespace defs in ksh functions In 93v-/ksh2020, namespace defs in any function are a syntax error. This commit blocks namespace defs for ksh functions only, at the execution level. This follows some of AT&T original intention while working around some of the known bugs with namespaces. Related: https://github.com/ksh93/ksh/issues/325	2021-11-24 07:31:22 +01:00
Martijn Dekker	e3d91ffa90	nv_associative(): finally use proper check for enum (re: b98e32fc) As of the previous commit, I finally know how to properly check for a variable of a type created by 'enum'. We need to check for both the NV_UINT16 attribute and the ENUM_disc discipline. Also: - regression test tweaks - add missing tests for previous commit (f600a5ea)	2021-11-24 02:06:08 +01:00
Martijn Dekker	a66cd72f7d	arith: implement range checking for enum types Within arithmetic expressions, enumeration values of variables of a type created with the 'enum' command translate to index numbers from 0 to the number of elements minus 1. However, there was no range checking on this in the arithmetic subsystem, allowing the assignment of out-of-range values that did not correspond to any enumeration value. Variables of an enum type are internally unsigned short integers (NV_UINT16), like those created with 'integer -su', except with an additional discipline function (ENUM_disc). src/cmd/ksh93/bltins/enum.c, src/cmd/ksh93/include/builtins.h: - To implement range checking, the arithmetic system needs access to the 'nelem' (number of elements) member of 'struct Enum'. This is only defined locally in enum.c. We could move that to name.h so arith.c can access it, but enum.c has code that supports compiling as standalone. So, instead, define a quick extern function, b_enum_elem(), that does the necessary type conversion and returns a type's number of elements. - Add --man documentation for the arithmetic subsystem behaviour for enum types. Tell the enuminfo() function, which dynamically inserts values into the documentation, how to process new \f tags 'lastv' (the last-defined value) and 'lastn' (the number of the last element). src/cmd/ksh93/sh/arith.c: arith(): - For NV_UINT16 variables with an ENUM_disc discipline, check the range using b_enum_elem() and error out if necessary. Resolves: https://github.com/ksh93/ksh/issues/335	2021-11-23 22:10:40 +01:00
Johnothan King	e26937b36a	Add support for 'stty size' to the libcmd 'stty' builtin (#342 ) This commit adds support for 'stty size' to the stty builtin, as defined in <https://austingroupbugs.net/view.php?id=1053>. The size mode is used to display the terminal's number of rows and columns. Note that stty isn't included in the default list of builtin commands; testing this addition requires adding CMDLIST(stty) to the table of builtins in src/cmd/ksh93/data/builtins.c. src/lib/libcmd/stty.c: - Add support for the size mode to the stty builtin. This mode is only used to display the terminal's number of rows and columns, so error out if any arguments are given that attempt to set the terminal size.	2021-11-23 15:38:14 +01:00
Martijn Dekker	10ef74e1a2	shtests: unignore SIGCONT For some reason, Void Linux (with musl libc) sets SIGCONT to ignored on the Linux console, causing the 'sleep -s' test in builtins.sh to fail spuriously as it relies on SIGCONT to work. src/cmd/ksh93/tests/shtests: - Reset SIGCONT using the unadvertised 'trap + SIGCONT' feature. Resolves: https://github.com/ksh93/ksh/issues/301	2021-11-22 16:55:51 +01:00
Martijn Dekker	74730c8ac7	test/[: Improve error status > 1 (re: `7003aba4`, `cd2cf236`, `ef1f53b5`) As I got to know the code better, it now seems painfully obvious that getting test/[ to issue an exit status >= 2 on error only requires a simple check in sh_exit() in fault.c, which is called whenever the shell issues an error message.	2021-11-22 15:37:04 +01:00
Martijn Dekker	8ced1daadf	Fix enum type definition pre-parsing for shcomp and dot/source Parser limitations prevent shcomp or source from handling enum types correctly: $ cat /tmp/colors.sh enum Color_t=(red green blue orange yellow) Color_t -A Colors=([foo]=red) $ shcomp /tmp/colors.sh > /dev/null /tmp/colors.sh: syntax error at line 2: `(' unexpected $ source /tmp/colors.sh /bin/ksh: source: syntax error: `(' unexpected Yet, for types created using 'typeset -T', this works. This is done via a check_typedef() function that preliminarily adds the special declaration builtin at parse time, with details to be filled in later at execution time. This hack will produce ugly undefined behaviour if the definition command creating that type built-in is then not actually run at execution time before the type built-in is accessed. But the hack is necessary because we're dealing with a fundamental design flaw in the ksh language. Dynamically addable built-ins that change the syntactic parsing of the shell language on the fly are an absurdity that violates the separation between parsing and execution, which muddies the waters and creates the need for some kind of ugly hack to keep things like shcomp more or less working. This commit extends that hack to support enum. src/cmd/ksh93/sh/parse.c: - check_typedef(): - Add 'intypeset' parameter that should be set to 1 for typeset and friends, 2 for enum. - When processing enum arguments, use AST getopt(3) to skip over enum's options to find the name of the type to be defined. (getopt failed if we were running a -c script; deal with this by zeroing opt_info.index first.) - item(): Update check_typedef() call, passing lexp->intypeset. - simple(): Set lexp->intypeset to 2 when processing enum. The rest of the changes are all to support the above and should be fairly obvious, except: src/cmd/ksh93/bltins/enum.c: - enuminfo(): Return on null pointer, avoiding a crash upon executing 'Type_t --man' if Type_t has not been fully defined due to the definition being pre-added at parse time but not executed. It's all still wrong, but a crash is worse. Resolves: https://github.com/ksh93/ksh/issues/256	2021-11-21 17:43:55 +01:00
Martijn Dekker	996def3141	builtins.h: rm broken check for removed SYSDECLARE (re: `921bbcae`)	2021-11-21 17:43:23 +01:00
Martijn Dekker	893c6a9068	nv_associative(): clarify value indicating enum (re: `6b9703ff`)	2021-11-21 17:43:14 +01:00
Johnothan King	e554a07c56	`typeset -T` shouldn't list types created with `enum` (#340 ) Listing types with 'typeset -T' will list not only types created with typeset, but also types created with enum. However, the types created by enum are not displayed correctly in the resulting output: $ enum Foo_t=(foo bar) $ typeset -T typeset -T Foo_t typeset -T Foo_t=fo) The fix for this bug was backported from ksh93v- 2013-10-08. src/cmd/ksh93/sh/nvtype.c: - sh_outtype(): Skip over enums when listing types with 'typeset -T'.	2021-11-20 09:48:48 +01:00
Martijn Dekker	cb961788a8	shell.3: fix formatting for sh_{g,s}etscope	2021-11-20 04:53:42 +01:00
Martijn Dekker	98ea0c2dbb	tests/signal.sh: fix AT&T's err_exit bogosity (re: `712261c8`)	2021-11-20 03:31:10 +01:00
Martijn Dekker	6829fc9a29	tests/leaks.sh: tweak Linux tolerance again (re: `31fe1c28`) The referenced commit did not fix the symptoms on the 1.0 branch (no vmalloc) on the GitHub CI runners. The failures are intermittent and are not reproduced with vmalloc or on other operating systems. Though the failures occur on a different test each time, the total amount of "leaked" bytes is always 36864, e.g.: leaks.sh[388]: run command with preceding PATH assignment in main shell (leaked approx 36864 bytes after 4096 iterations) 36864/4096 equals exactly 9. An odd number, literally and figuratively, but I suppose that's the tolerance Linux needs. src/cmd/ksh93/tests/leaks.sh - Increase tolerance of bytes per iteration from 8 to 9.	2021-11-19 20:21:25 +01:00
Johnothan King	396b388e1f	Fix a few issues with $RANDOM seeding in subshells (#339 ) This commit fixes an issue I found in the subshell $RANDOM reseeding code. The main issue is a performance regression in the shbench fibonacci benchmark, introduced in commit `af6a32d1`. Performance dropped in this benchmark because $RANDOM is always reseeded and restored, even when it's never used in a subshell. Performance results from before and after this performance fix (results are on Linux with CC=gcc and CCFLAGS='-O2 -D_std_malloc'): $ ./shbench -b bench/fibonacci.ksh -l 100 ./ksh-0f06a2e ./ksh-af6a32d ./ksh-f31e368 ./ksh-randfix benchmarking ./ksh-0f06a2e, ./ksh-af6a32d, ./ksh-f31e368, ./ksh-randfix ... * fibonacci.ksh * # ./ksh-0f06a2e # Recent version of ksh93u+m # ./ksh-af6a32d # Commit that introduced the regression # ./ksh-f31e368 # Commit without the regression # ./ksh-randfix # Ksh93u+m with this patch applied ------------------------------------------------------------------------------------------------- name ./ksh-0f06a2e ./ksh-af6a32d ./ksh-f31e368 ./ksh-randfix ------------------------------------------------------------------------------------------------- fibonacci.ksh 0.481 [0.459-0.515] 0.472 [0.455-0.504] 0.396 [0.380-0.442] 0.407 [0.385-0.439] ------------------------------------------------------------------------------------------------- src/cmd/ksh93/include/variables.h, src/cmd/ksh93/sh/{init,subshell}.c: - Rather than reseed $RANDOM every time a subshell is created, add a sh_save_rand_seed() function that does this only when the $RANDOM variable is used in a subshell. This function is called by the $RANDOM discipline functions nget_rand() and put_rand(). As a minor optimization, sh_save_rand_seed doesn't reseed if it's called from put_rand(). - Because $RANDOM may have a seed of zero (i.e., RANDOM=0), sp->rand_seed isn't enough to tell if $RANDOM has been reseeded. Add sp->rand_state for this purpose. - sh_subshell(): Only restore the former $RANDOM seed and state if it is necessary to prevent a subshell leak. src/cmd/ksh93/tests/variables.sh: - Add two regression tests for bugs I ran into while making this patch.	2021-11-19 08:18:44 +01:00
Martijn Dekker	745ffd366d	sh.1: Add missing printf -v doc (re: eb760a62); more tweaks Also add a missing 'Shell Variables' heading that is referred to elsewhere, and capitalise the ASCII acronym.	2021-11-19 05:32:09 +01:00
Martijn Dekker	15bbc2f632	manual: use consistent terminology The ksh manual page is one of the few places that calls globbing "file name generation". The mksh and zsh manuals use the same term. But every other shell's manual calls it "pathname expansion": bash, dash, yash, FreeBSD sh. So does ksh's built-in documentation (alias --man, export --man, readonly --man, set --man, typeset --man). What's more, the authoritative ksh reference, Bolsky & Korn's 1995 "The New Kornshell" book, also calls it "pathname expansion", and so does the POSIX standard. Similarly, "arithmetic substitution" should be called "arithmetic expansion" per Bolsky & Korn as well as POSIX. This commit has several other miscellaneous documentation tweaks as well.	2021-11-19 03:54:42 +01:00
Martijn Dekker	bd9752e43c	Backport 'printf -v' from ksh 93v- 'printf' on bash and zsh has a popular -v option that allows assigning formatted output directly to variables without using a command substitution. This is much faster and avoids snags with stripping final linefeeds. AT&T had replicated this feature in the abandoned 93v- beta version. This backports it with a few tweaks and one user-visible improvement. The 93v- version prohibited specifying a variable name with an array subscript, such as printf -v var\[3\] foo. This works fine on bash and zsh, so I see no reason why this should not work on ksh, as nv_putval() deals with array subscripts just fine. src/cmd/ksh93/bltins/print.c: b_print(): - While processing the -v option when called as printf, get a pointer to the variable, creating it if necessary. Pass only the NV_VARNAME flag to enforce a valid variable name, and not (as 93v- does) the NV_NOARRAY flag to prohibit array subscripts. - If a variable was given, set the output file to an internal string buffer and jump straight to processing the format. - After processing the format, assign the contents to the string buffer to the variable. src/cmd/ksh93/data/builtins.c: - Document the new option, adding a warning that unquoted square brackets may trigger pathname expansion.	2021-11-19 03:54:33 +01:00
Martijn Dekker	fb8308243c	printf: fix %(pattern)q documentation in 'printf --man' %(pattern)q is equivalent to %P. It's also equivalent to %#P, but since the alternative format specifier '#' does nothing for %P, %P and %#P are the same and documenting #%P is just confusing. Thanks to @stephane-chazelas for the report. src/cmd/ksh93/bltins/print.c: - In the printmap struct, document %P as equivalent of %(pattern)q. - Sort it alphabetically. - Do not pointlessly repeat the string "Equivalent to". Instead, let the discipline function infof() insert it for each entry. (This is the function used to dynamically insert the equivalents documentation into the --man output at the \fextra\f tag in sh_optprintf[] in data/builtins.c.) Resolves: https://github.com/ksh93/ksh/issues/338	2021-11-18 17:46:38 +01:00
Martijn Dekker	0b0d0094b9	bltins/misc.c: exec: finish cleanup (re: `d8eba9d1`) An obsolete struct was left that passed some variables on between b_exec() and the deleted B_login(). We can simply make those local variables now. Let's get rid of the redundant sh pointer, too.	2021-11-18 04:38:46 +01:00
Martijn Dekker	1e96013367	tests/pty.sh: fix two failures due to typeahead on Debian Bullseye As the (original AT&T) comment at the top says, "the trickiest part of the tests is avoiding typeahead in the pty dialogue". Two tests failed to [p]eek at the prompt before they started 'typing'. This causes unpredictable results. On Debian Bullseye this triggers typeahead, which produces unwanted echo to the terminal, killing the tests. src/cmd/ksh93/tests/pty.sh: - Add missing 'p' commands for the first prompt to the tests 'nobackslashctrl in emacs' and 'emacs backslash escaping'. Resolves: https://github.com/ksh93/ksh/issues/332	2021-11-17 23:05:05 +01:00
Martijn Dekker	77c7de7cc7	package: fix Bourne compat (re: `48e6dd98`) Tried to compile on Solaris 10.1 for the first time in a while. Turns out the obsolete Bourne /bin/sh does not support 'test -e'. bin/package, src/cmd/INIT/package.sh: - Use 'test -f' instead.	2021-11-17 06:09:35 +01:00
Martijn Dekker	c734568b02	arithmetic: Fix the octal leading zero mess (#337 ) In C/POSIX arithmetic, a leading 0 denotes an octal number, e.g. 010 == 8. But this is not a desirable feature as it can cause problems with processing things like dates with a leading zero. In ksh, you should use 8#10 instead ("10" with base 8). It would be tolerable if ksh at least implemented it consistently. But AT&T made an incredible mess of it. For anyone who is not intimately familiar with ksh internals, it is inscrutable where arithmetic evaluation special-cases a leading 0 and where it doesn't. Here are just some of the surprises/inconsistencies: 1. The AT&T maintainers tried to honour a leading 0 inside of ((...)) and $((...)) and not for arithmetic contexts outside it, but even that inconsistency was never quite consistent. 2. Since 2010-12-12, $((x)) and $(($x)) are different: $ /bin/ksh -c 'x=010; echo $((x)) $(($x))' 10 8 That's a clear violation of both POSIX and the principle of least astonishment. $((x)) and $(($x)) should be the same in all cases. 3. 'let' with '-o letoctal' acts in this bizarre way: $ set -o letoctal; x=010; let "y1=$x" "y2=010"; echo $y1 $y2 10 8 That's right, 'let y=$x' is different from 'let y=010' even when $x contains the same string value '010'! This violates established shell grammar on the most basic level. This commit introduces consistency. By default, ksh now acts like mksh and zsh: the octal leading zero is disabled in all arithmetic contexts equally. In POSIX mode, it is enabled equally. The one exception is the 'let' built-in, where this can still be controlled independently with the letoctal option as before (but, because letoctal is synched with posix when switching that on/off, it's consistent by default). We're also removing the hackery that causes variable expansions for the 'let' builtin to be quietly altered, so that 'x=010; let y=$x' now does the same as 'let y=010' even with letoctal on. Various files: - Get rid of now-redundant sh.inarith (shp->inarith) flag, as we're no longer distinguishing between being inside or outside ((...)). src/cmd/ksh93/sh/arith.c: - arith(): Let disabling POSIX octal constants by skipping leading zeros depend on either the letoctal option being off (if we're running the "let" built-in") or the posix option being off. - sh_strnum(): Preset a base of 10 for strtonll(3) depending on the posix or letoctal option being off, not on the sh.inarith flag. src/cmd/ksh93/include/argnod.h, src/cmd/ksh93/sh/args.c, src/cmd/ksh93/sh/macro.c: - Remove astonishing hackery that violated shell grammar for 'let'. src/cmd/ksh93/sh/name.c (nv_getnum()), src/cmd/ksh93/sh/nvdisc.c (nv_getn()): - Remove loops for skipping leading zeroes that included a broken check for justify/zerofill attributes, thereby fixing this bug: $ typeset -Z x=0x15; echo $((x)) -ksh: x15: parameter not set Even if this code wasn't redundant before, it is now: sh_arith() is called immediately after the removed code and it ignores leading zeroes via sh_strnum() and strtonll(3). Resolves: https://github.com/ksh93/ksh/issues/334	2021-11-17 04:28:08 +01:00
Martijn Dekker	257eea612a	edit.c: don't trace tput command on init (re: `ef8b80cf`) When starting a new interactive ksh with the -v or -x option, an annoying symptom occurs: the 'tput' command that ed_setup() issues to get the escape sequence for cursor-up is xtraced or echoed, corrupting prompt display, for example ('▂' is the cursor): $ ksh -x $ + /usr/bin/tput cuu1 + 2> /dev/null + .sh.subscript=$'\E[A' ▂ or $ ksh -v $ .sh.subscript=$(/usr/bin/tput cuu1 2>/dev/null)▂ src/cmd/ksh93/edit/edit.c: ed_setup(): - Turn off xtrace and verbose while sh_trap()ing tput.	2021-11-17 04:27:20 +01:00
Martijn Dekker	54674cb325	shcomp: refuse to write binary data to terminal So, shcomp has messed up my terminal once too often by writing compiled binary data to it. While fixing that I've done some other tweaks as well. src/cmd/ksh93/sh/shcomp.c: main(): - Fix error/warning message id (the "name:" prefix before messages) so it makes sense to the user. Save shcomp's argv[0] id for error messages that are directly from shcomp's main(), and use the argv[1] script id (set by sh_init()) for warnings produced by the compilation process. If there is no script id because we're reading from stdin, set it to "(stdin)". - If no arguments are given, refuse to read from standard input if it's on a tty. Instead, write a brief usage message (with pointer to --help and --man, see `e21a053e`) and exit. This is far more helpful; people will rarely want to compile a script by manually typing it in. If you really want to do that, use /dev/stdin as the input filename. :) - Error out if we're about to write binary data to a tty (even if /dev/stdout was given as the output filename). - Turn off SH_MULTILINE to avoid some pointless editor init in case we're reading from stdin on a terminal. - Do not attempt to copy remaining data if we're already at EOF. This fixes a bug that required the user to press Ctrl+D twice when manually entering a script on the terminal. Pressing Ctrl+D once and then entering more data would corrupt the bytecode.	2021-11-16 23:34:52 +01:00
Johnothan King	b40155fae8	Fix file descriptor leaks in the `hist` builtin (#336 ) This commit fixes two file descriptor leaks in the hist built-in. The bugfix for the first file descriptor leak was backported from ksh2020. See: https://github.com/att/ast/issues/872 `73bd61b5` Reproducer: $ echo no $ hist -s no=yes The second file descriptor leak occurs after a substitution error in the hist built-in (this leak wasn't fixed in ksh2020). Reproducer: $ echo no $ ls /proc/$$/fd $ hist -s no=yes $ hist -s no=yes $ ls /proc/$$/fd src/cmd/ksh93/bltins/hist.c: - Close leftover file descriptors when an error occurs and after 'hist -s' runs a command. src/cmd/ksh93/tests/builtins.sh: - Add two regression tests for both of the file descriptor leaks.	2021-11-16 23:34:46 +01:00
Martijn Dekker	7ea95b7df3	tests: fix intermittent $RANDOM reseeding fails (re: `af6a32d1`) When testing whether subshell $RANDOM reseeding worked, checking for non-identical numbers is not sufficient. There is no check for randomly occurring duplicate numbers, nor can there be, because subshells cannot (or, in the case of virtual subshells, should not) influence each other or the parent shell. src/cmd/ksh93/tests/variables.sh: - Try up to three times, tolerating identical numbers twice.	2021-11-15 21:16:39 +01:00
Martijn Dekker	56c2e13e92	arith: Fix variables 'nan' and 'inf' in arithmetic for POSIX mode The --posix compliance option now disables the case-insensitive special floating point constants Inf and NaN so that all case variants of $((inf)) and $((nan)) refer to the variables by those names as the standard requires. (BUG_ARITHNAN) src/cmd/ksh93/sh/arith.c: arith(): - Only do case-insensitive checks for "Inf" and "NaN" if the POSIX option is off.	2021-11-15 21:16:23 +01:00
Martijn Dekker	d9cd49c6d7	Remove duplicate error message e_badnum from streval.h and e_number from shell.h are both defined as "%s: bad number". We only need one. Remove the one that is used only once: e_badnum.	2021-11-15 21:15:41 +01:00
Martijn Dekker	ef1f53b5b2	test/[: rm SH_INTESTCMD; test for 'test' directly (re: `cd2cf236`) Turns out there is a way to check what built-in we're running at any time. It is done for 'let' in arith.c: sh.bltindata.bnode==SYSLET For test/[, that would be (see include/builtins.h): sh.bltindata.bnode==SYSTEST \|\| sh.bltindata.bnode==SYSBRACKET	2021-11-15 21:15:25 +01:00
Martijn Dekker	a4375f3090	Fix crash on unsetting .sh.match ksh crashed after unsetting .sh.match and then matching a pattern: $ unset .sh.match $ [[ bar == ba* ]] Memory fault src/cmd/ksh93/sh/init.c: sh_setmatch(): - Do nothing if we cannot get an array pointer to SH_MATCHNOD.	2021-11-15 21:15:08 +01:00
Martijn Dekker	31fe1c2890	tests/leaks.sh: increase iterations on Linux There are one or two leaks that show up intermittently on the Github runners for the 1.0 branch (which is compiled as a release, i.e. no vmalloc). If they're intermittent, they must be false positives due to malloc artefacts. Let's double the number of iterations for the /proc/$$/stat method and see what happens.	2021-11-15 03:00:40 +01:00
Martijn Dekker	d9f1fdaa41	Fix [ $ str -a str $ ], [ $ str -o str $ ] Symptoms: $ test $ string1 -a string2 $ /usr/local/bin/ksh: test: argument expected $ test $ string1 -o string2 $ /usr/local/bin/ksh: test: argument expected The parentheses should be irrelevant and this should be a test for the non-emptiness of string1 and/or string2. src/cmd/ksh93/bltins/test.c: - b_test(): There is a block where the case of 'test' with five or less arguments, the first and last one being parentheses, is special-cased. The parentheses are removed as a workaround: argv is increased to skip the opening parenthesis and argc is decreased by 2. However, there is no corresponding increase of tdata.av which is a copy of this function's argv. This renders the workaround ineffective. The fix is to add that increase. - e3(): Do not handle '!' as a negator if not followed by an argument. This allows a right-hand expression that is equal to '!' (i.e. a test for the non-emptiness of the string '!').	2021-11-15 02:44:56 +01:00
Martijn Dekker	802136a6ad	Fix goof in regression test (re: `c8147306`)	2021-11-14 12:30:49 +01:00
Martijn Dekker	c81473061a	test/[: binary operators: fix '<' and add '=~'; some more cleanups In ksh88, the test/[ built-in supported both the '<' and '>' lexical sorting comparison operators, same as in [[. However, in every version of ksh93, '<' does not work though '>' still does! Still, the code for both is present in test_binop(): src/cmd/ksh93/bltins/test.c 548: case TEST_SGT: 549: return(strcoll(left, right)>0); 550: case TEST_SLT: 551: return(strcoll(left, right)<0); Analysis: The binary operators are looked up in shtab_testops[] in data/testops.c using a macro called sh_lookup, which expands to a sh_locate() call. If we examine that function in sh/string.c, it's easy to see that on systems using ASCII (i.e. all except IBM mainframes), it assumes the table is sorted in ASCII order. src/cmd/ksh93/sh/string.c 64: while((c= tp->sh_name) && (CC_NATIVE!=CC_ASCII \|\| c <= first)) The problem was that the '<' operator was not correctly sorted in shtab_testops[]; it was sorted immediately before '>', but after '='. The ASCII order is: < (60), = (61), > (62). This caused '<' to never be found in the table. The test_binop() function is also used by [[, yet '<' always worked in that. This is because the parser has code that directly checks for '<' and '>' within [[ (in sh/parse.c, lines 1949-1952). This commit also adds '=~' to 'test', which took three lines of code and allowed eliminating error handling in test_binop() as test/[ and [[ now support the same binary ops. (re: `fc2d5a60`) src/cmd/ksh93//*.[ch]: - Rename a couple of very misleadingly named macros in test.h: . For == and !=, the TEST_PATTERN bit is off for pattern compares and on for literal string compares! Rename to TEST_STRCMP. . The TEST_BINOP bit does not denote all binary operators, but only the logical -a/-o ops in test/[. Rename to TEST_ANDOR. src/cmd/ksh93/bltins/test.c: test_binop(): - Add support for =~. This is only used by test/[. The method is implemented in two lines that convert the ERE to a shell pattern by prefixing it with ~(E), then call test_strmatch with that temporary string to match the ERE and update ${.sh.match}. - Since all binary ops from shtab_testops[] are now accounted for, remove unknown op error handling from this function. src/cmd/ksh93/data/testops.c: - shtab_testops[]: . Correctly sort the '<' (TEST_SLT) entry. . Remove ']]' (TEST_END). It's not an op and doesn't belong here. - Update sh_opttest[] documentation with =~, \<, \>. - Remove now-unused e_unsupported_op[] error message. src/cmd/ksh93/sh/lex.c: sh_lex(): - Check for ']]' directly instead of relying on the removed TEST_END entry from shtab_testops[]. src/cmd/ksh93/tests/bracket.sh: - Add relevant tests. src/cmd/ksh93/tests/builtins.sh: - Fix an old test that globally deleted the 'test' builtin. Delete it within the command substitution subshell only. - Remove the test for non-support of =~ in test/[. - Update the test for invalid test/[ op to use test directly.	2021-11-14 02:46:34 +01:00
Martijn Dekker	6f5c9fea93	test/[: Fix binary -a/-o operators in POSIX mode POSIX requires test "$a" -a "$b" to return true if both $a and $b are non-empty, and test "$a" -o "$b" to return true if either $a or $b is non-empty. In ksh, this fails if "$a" is '!' or '(' as this causes ksh to interpret the -a and -o as unary operators (-a being a file existence test like -e, and -o being a shell option test). $ test ! -a ""; echo "$?" 0 (expected: 1/false) $ set -o trackall; test ! -o trackall; echo "$?" 1 (expected: 0/true) $ test $ -a $; echo "$?" ksh: test: argument expected 2 (expected: 0/true) $ test $ -o $ ksh: test: argument expected 2 (expected: 0/true) Unfortunately this problem cannot be fixed without risking breakage in legacy scripts. For instance, a script may well use test ! -a filename to check that a filename is nonexistent. POSIX specifies that this always return true as it is a test for the non-emptiness of both strings '!' and 'filename'. So this commit fixes it for POSIX mode only. src/cmd/ksh93/bltins/test.c: e3(): - If the posix option is active, specially handle the case of having at least three arguments with the second being -a or -o, overriding their handling as unary operators. src/cmd/ksh93/data/testops.c: - Update 'test --man --' date and say that unary -a is deprecated. src/cmd/ksh93/sh.1: - Document the fix under the -o posix option. - For test/[, explain that binary -a/-o are deprecated. src/cmd/ksh93/tests/bracket.sh: - Add tests based on reproducers in bug report. Resolves: https://github.com/ksh93/ksh/issues/330	2021-11-13 03:43:29 +01:00
Martijn Dekker	568cfdbda7	sh_type(): Do not set POSIX mode when invoked as su On Linux, the 'su' program sets $0 to '-su' when doing 'su -' or 'su - username'. When ksh is the target account's default shell, this caused ksh to consider itself to be launched as a standard POSIX sh, which (among other things) disables the default aliases on interactive shells. This caused confusion for at least one user as they lost their 'history' alias after 'su -': https://www.linuxquestions.org/questions/slackware-14/in-current-with-downgrade-to-ksh93-lost-the-alias-history-4175703408/ bash does not consider itself to be sh when invoked as su, so ksh probably shouldn't, either. The behaviour was also undocumented, making it even more surprising. src/cmd/ksh93/sh/init.c: sh_type(): - Only set the SH_TYPE_POSIX bit if we're invoked as 'sh' (or, on windows, as 'sh.exe').	2021-11-12 04:35:15 +01:00
Johnothan King	3a5752218d	Shorten command name used to test ENAMETOOLONG exit status (#333 ) A change in FreeBSD 13 now causes extremely long command names to exit with errno set to E2BIG if the name can't fit in the list of arguments. This was causing the regression tests for ENAMETOOLONG to fail on FreeBSD 13 because the exit status for these errors differ (ENAMETOOLONG uses status 127 while E2BIG uses status 126). src/cmd/ksh93/tests/path.sh: - To fix the failing regression tests, the command name has been shortened to twice the length of NAME_MAX. This length is still long enough to trigger an ENAMETOOLONG error without causing an E2BIG failure on FreeBSD 13. Fixes https://github.com/ksh93/ksh/issues/331	2021-11-12 04:35:04 +01:00
Martijn Dekker	ca6299ec4b	fix 3 typos: staring -> starting	2021-11-09 13:52:08 +00:00

... 3 4 5 6 7 ...

1072 commits