external/cde - Personal Git space

mirror of git://git.code.sf.net/p/cdesktopenv/code synced 2025-03-09 15:50:02 +00:00

Author	SHA1	Message	Date
Martijn Dekker	dde8da6769	Fix history comment character in history expansion (re: `41ee12a5`) Reproducer: on an interactive shell with the -H option on, $ v=foo $ print ${#v} does not print anything (but should print "3"). The 'print' line also is not added to the history. This bug was exposed by commit `41ee12a5` which enabled the history comment character by default, setting it to '#' as on bash. When it was disabled by default, this bug was rarely exposed. The problem happens here: src/cmd/ksh93/edit/hexpand.c 203: if(hc[2] && cp == hc[2]) / history comment designator, skip rest of line / 204: { 205: stakputc(cp++); 206: stakputs(cp); 207: DONE(); 208: } The DONE() macro sets the HIST_ERROR bit flag: src/cmd/ksh93/edit/hexpand.c 45: #define DONE() {flag \|= HIST_ERROR; cp = 0; stakseek(0); goto done;} For the history comment character that is clearly wrong, as no error has occurred. There is another problem. The documentation I added for history expansion states this bit, which is based on bash's behaviour: If a word on a command line begins with the history comment character #, history expansion is ignored for the rest of that line. With an expansion like ${#v}, the word does not begin with # so history expansion should not have parsed that as a comment character. The intention was to act like bash. src/cmd/ksh93/edit/hexpand.c: - Split the DONE() macro into DONE and ERROROUT of which only the latter sets the HIST_ERROR bit flag. Change usage accordingly. Thix fixes the first problem. - Don't make these new macros function-style macros (with ()) as they end in a goto, so that's a bit misleading. - Add is_wordboundary() which makes a best-effort attempt to determine if the character following the specified character is considered to start a new word by shell grammar rules. Word boundary characters are whitespace and: \|&;()`<> - Only recognise the history comment character if is_wordbounary() returns true for the previous character. This fixes the second problem. Thanks to @jghub for the bug report. Resolves: https://github.com/ksh93/ksh/issues/513	2022-08-16 22:00:20 +01:00
Martijn Dekker	ebdcfcbb62	Release 1.0.2 (re: `cd063869`) This release fixes the interactive shell crashing when one of the predefined aliases (currently 'history' and 'r') is redefined, whether from a profile/kshrc script or manually. This crash occurred in two scenarios: 1. when redefining and then unsetting a predefined alias; 2. when redefining a predefined alias and then executing a shell script that does not begin with a #! path. Both are fixed now.	2022-08-08 10:49:58 +01:00
Martijn Dekker	cd0638690c	Fix crashes on redefining/unsetting predefined alias (re: `7e7f1372`) Reproducer 1: $ alias r r='hist -s' $ alias r=foo $ unalias r ksh(10127,0x10d6c35c0) malloc: * error for object 0x7ffdcd404468: pointer being freed was not allocated ksh(10127,0x10d6c35c0) malloc: * set a breakpoint in malloc_error_break to debug Abort The crash happens as unall() (typeset.c) calls nv_delete() (name.c) which tries to free a node pointer that was not directly allocated. Reproducer 2: $ ENV=/./dev/null ksh $ echo : >script $ chmod +x script $ alias r=foo $ ./script ksh(10193,0x10c8505c0) malloc: * error for object 0x7fa136c04468: pointer being freed was not allocated ksh(10193,0x10c8505c0) malloc: * set a breakpoint in malloc_error_break to debug Abort This crash happens for the same reason, but in another location, namely in sh_reinit() (init.c) as it is freeing up the alias table before executing a script that does not start with a #! path. This is a serious bug because it is not uncommon for .kshrc or .profile scripts to (re)define an alias called 'r'. Analysis: These crashes happen because the incorrectly freed node pointer is part of a larger block of nodes initialised by sh_inittree() in init.c. That function allocates all the nodes for a table (see data/{aliases,builtins,variables}.c) in a contiguous block addressable by numeric index (see builtins.h and variables.h for how that is used). So, while the value of the alias is correctly marked NV_NOFREE and is not freed, that attribute does not apply to the node pointer itself, which also is not freeable. Thus, if the value is replaced by a freeable one, the node pointer is incorrectly freed upon unaliasing it, and the shell crashes. The simplest fix is to allocate each predefined alias node individually, like any other alias -- because, in fact, we do not need the predefined alias nodes to be in a contiguous addressable block; there is nothing that specifically addresses these aliases. src/cmd/ksh93/sh/main.c: sh_main(): - Instead of calling sh_inittree(), use a dedicated loop to allocate each predefined alias node individually, making each node invidually freeable. The value does remain non-freeable, but the NV_NOFREE attribute correctly takes care of that. src/cmd/ksh93/bltins/typeset.c: - Get rid of the incomplete and now unnecessary workarounds in unall() and sh_reinit(). Thanks to @jghub and @JohnoKing for finding and reporting the bug. Discussion: https://github.com/ksh93/ksh/discussions/503#discussioncomment-3337172	2022-08-06 11:54:33 +01:00
Martijn Dekker	9c9743998f	Revert buggy exec optimisation (re: `d6c9821c`) As of 2022-06-18, ksh 93u+m is not capable of being used as /bin/sh while building GNU binutils. The execution of some of its build system's dot scripts is incorrectly aborted as an external 'sed' command is execve(2)'d without forking. This means that incorrect exec optimization was happening. Unfortunately I have not been able to derive a minimal reproducer of the problem yet because the GNU binutils build scripts are very complex. Pending further research, the optimisation is reverted. Even if a way to make it work is found, it will not be reintroduced to the 1.0 branch. Thanks to @atheik for finding the problem and identifying the commit that introduced it. Resolves: https://github.com/ksh93/ksh/issues/507	2022-08-05 12:52:49 +02:00
Martijn Dekker	3a25aa0d93	Release 93u+m/1.0.0 _ _ ___ _____ ___ ___ ___ \| \| _____\| \|__ / _ \___ / _ _ _ _ __ ___ / / \| / _ \ / _ \ \| \|/ / __\| '_ \ \| (_) \|\|_ \\| \| \| \|_\| \|_\| '_ ` _ \ / /\| \|\| \| \| \| \| \| \| \| <\__ \ \| \| \| \__, \|__) \| \|_\| \|_ _\| \| \| \| \| \|/ / \| \|\| \|_\| \| \|_\| \| \|_\|\_\___/_\| \|_\| /_/____/ \__,_\| \|_\| \|_\| \|_\| \|_/_/ \|_(_)___(_)___/ It may have taken exactly a decade, but here we are... a proper new ksh release. :) Many thanks to all contributors for their hard work! Compared to an unpatched 93u+, this release has roughly a thousand bugs fixed. It incorporates a fair number of enhancements as well. Not all known bugs have been worked out yet; see the TODO file. Let's hope this release will rekindle interest and attract more bug hunters. This commit also makes some very minor fixes in comments. Notable: src/cmd/ksh93/sh/arith.c: sh_strnum(): - Update a security-related comment. As of `b48e5b33`, evaluating untrusted arithmetic expressions from the environment should no longer cause CVE-2019-14868. But let's keep disallowing it anyway. Resolves: https://github.com/ksh93/ksh/issues/491	2022-08-01 21:44:02 +02:00
Martijn Dekker	9c69fac793	Re-backport atomic job locking from 93v- (re: `52067c3d`, `595a0a56`) [This commit was previously reverted because it seemed to cause the build to fail on Cygwin. But I just re-tested it, and it's fine. It may be that my Cygwin installation at the time was defective.] Something similar was previously done in `07cc71b8` from a Debian patch, and eventually reverted; it redefined the ast atomic functions asoincint() and asodecint() to be gcc-specific. This imports the upstream version from the ksh 93v- beta instead. This commit is based on an OpenSUSE patch: https://build.opensuse.org/package/view_file/shells/ksh/ksh93-joblock.dif src/cmd/ksh93/include/jobs.h: - Replace job locking mechanism with the 93v- version which uses the atomic libast functions asoincint(), asogetint() and asodecint(). See: src/lib/libast/man/aso.3 src/cmd/ksh93/sh/jobs.c: job_subsave(): - Revert gcc optimiser bug workaround from `c258a04f`. It should now be unnecessary.	2022-08-01 05:41:52 +02:00
Martijn Dekker	38acc02e78	nv_newattr(): Fix out-of-bounds read (re: `bdb99741`) When converting from -Z to another attribute, if the value of the -Z variable was empty (so -Z0 and an empty value), the loop that skips initial zeros may read before the beginning of the buffer: 2968: if(nv_isattr(np,NV_ZFILL)) 2969: { 2970: while(sp=='0') sp++; / skip initial zeros / 2971: if(!sp) sp--; /* if number was 0, leave one zero / 2972: } If the sp value is empty (just a terminating zero byte), line 2970 does nothing, but line 2971 still decrases the pointer, to before the beginning of the buffer. The fix is to check for an initial zero before running that block. Reproducer (fails intermittently, depending on garbage before *sp): typeset -Z foo= typeset -i foo	2022-07-31 04:13:42 +02:00
Martijn Dekker	9f6841c37e	Fix "$" doing pattern matching if $IFS is wildcard (BUG_IFSGLOBS) The bug is that "$", and related expansions such as "${arr[]}", etc., do pattern matching if the first character of $IFS is a wildcard. For example, the following: IFS= set -- F '' case BUGFREE in BUG"$") echo bug ;; esac outputs 'bug'. This bug can be reproduced in every other glob pattern matching context as well, but not in pathname expansion. src/cmd/ksh93/sh/macro.c: varsub(): - When joining fields into one for a "$"-type expansion, check if a glob pattern matching operation follows (mp->pattern is set). If so, write a preceding backslash to escape the separator. Resolves: https://github.com/ksh93/ksh/issues/489 Resolves: https://github.com/att/ast/issues/12	2022-07-28 23:36:39 +02:00
Martijn Dekker	441dcc0483	Upgrade licence to EPL 2.0 EPL 1.0 says, in section 7: "The Program (including Contributions) may always be distributed subject to the version of the Agreement under which it was received. In addition, after a new version of the Agreement is published, Contributor may elect to distribute the Program (including its Contributions) under the new version." The Eclipse Foundation also encourage everyone to upgrade: https://www.eclipse.org/legal/epl-2.0/faq.php#h.60mjudroo8e5 https://www.eclipse.org/legal/epl-2.0/faq.php#h.tci84nlsqpgw Unfortunately the new Secondary License option is not available to us as we're not the original copyright holders and don't have the legal power to add one. So, no GPL compatibility. Sorry.	2022-07-28 05:46:08 +02:00
Martijn Dekker	48f78227a9	Fix $PPID for hashbangless script (re: `232b7bff`) I made a mistake in sh.reinit() which caused $PPID to be set to the new process ID, not the parent process ID. This commit fixes it by introducing, updating and using sh.current_ppid, so we continue to minimise context switches due to getpid(2)/getppid(2) system calls. Thanks to Geoff Clare for the report.	2022-07-27 22:48:57 +02:00
Martijn Dekker	592ce7415a	test/[/[[: Fix incorrect 0-skipping, float precision (re: `c734568b`) The arguments to the binary numerical comparison operators (-eq, -gt, etc.) in the [[ and test/[ commands are treated as arithmetic expressions, even if $((...)) is not used. But there is some seriously incorrect behaviour: Reproducers (all should output 0/true): $ [[ 0x0A -eq 10 ]]; echo $? 1 $ [[ 1+0x0A -eq 11 ]]; echo $? 0 $ (set --posix; [[ 010 -eq 8 ]]); echo $? 1 $ (set --posix; [[ +010 -eq 8 ]]); echo $? 0 $ [[ 0xA -eq 10 ]]; echo $? 1 $ xA=10; [[ 0xA -eq 10 ]]; echo $? 0 $ xA=WTF; [[ 0xA -eq 10 ]]; echo $? ksh: WTF: parameter not set (POSIX mode enables the recognition of the initial 0 as a prefix indicating an octal number in arithmetic expressions.) The cause is the two 'while' loops in this section in test_binop() in src/cmd/ksh93/bltins/test.c: 502: if(op&TEST_ARITH) 503: { 504: while(left=='0') 505: left++; 506: while(right=='0') 507: right++; 508: lnum = sh_arith(left); 509: rnum = sh_arith(right); 510: } So initial zeros are unconditionally skipped. Ostensibly this is to disable the recognition of the initial zero as an octal prefix as well as 0x as a hexadecimal prefix. This would be okay for enforcing a decimal-only limitation for simple numbers, but to do this for arithmetic expressions is flagrantly incorrect, to say the least. (insert standard rant about AT&T quality standards here) The fix for '[[' is simply to delete the two 'while' loops. But that creates a problem for the deprecated-but-standard 'test'/'[' command, which also uses the test_binop() function. According to POSIX, test/[ only parses simple decimal numbers, so octal, etc. is not a thing. But, after that fix, 'test 08 -eq 10' in POSIX mode yields true, which is unlike every other shell. (Note that this is unlike [[ 08 -eq 10 ]], which yields true on bash because '[[' parses operands as arithmetic expressions.) For test/[ in non-POSIX mode, we don't need to change anything. For POSIX mode, we should only parse literal decimal numbers for these operators in test/[, disallowing unexpanded arithmetic expressions. This makes ksh's POSIX-mode test/[ act like every other shell and like external .../bin/test commands shipped by OSs. src/cmd/ksh93/bltins/text.c: test_binop(): - Correct a type mismatch. The left and right hand numbers should be Sfdouble_t, the maximum double length on the current system, and the type that sh_arith() returns. Instead, long double (typeset -lF) values were reduced to double (typeset -F) before comparing! - For test/[ in POSIX, only accept simple decimal numbers: use strtold() instead of sh_arith(). Do skip initial zeros here as this disables the recognition of the hexadecimal prefix. Throw an error on an invalid decimal number. Floating point constants are still parsed, but that's fine as this does not cause any incompatibility with POSIX. - For code legibility, move handling of TEST_EQ, etc. to within the if(op&TEST_ARITH) block. This also allows lnum and rnum to be local to that block.	2022-07-26 21:33:00 +02:00
Martijn Dekker	b7817c3750	Release 93u+m/1.0.0-rc.1 We're nearly there! I intend to release ksh 93u+m/1.0.0 on 2022-08-01, precisely ten years after the last canonical 93u+ release. We have a week until then, so here's a release candidate. Please try as hard as you can to break it, and to help fix known bugs. As of this commit, the 1.0 branch is feature-frozen and will only get bugfixes. src/cmd/ksh93/fun/man: - Last-minute fix: .man.try_os_man(): do not look for arguments with / in section 1 and 8; this can cause false positives.	2022-07-25 04:03:16 +02:00
Martijn Dekker	bb4f23e63f	set -b/--notify: do not mess up the command line This commit makes three interrelated changes. First, the code for erasing the command line before redrawing it upon a window size change is simplified and modernised. Instead of erasing the line with lots of spaces, it now uses the sequence obtained from 'tput ed' (usually ESC, '[', 'J') to "erase to the end of screen". This avoids messing up the detection and automatic redrawing of wrapped lines on terminals such as Apple Terminal.app. Second, the -b/--notify option is made more usable. When it is on and either the vi or emacs/gmacs line editor is in use, 'Done' and similar notifications are now buffered and trigger a command line redraw as if the window size changed, and the redraw routine prints that notify buffer between erasing and redrawing the commmnd line. The effect is that the notification appears to magically insert itself directly above the line you're typing. (The notification behaviour when not in the shell line editor, e.g. while running commands such as external editors, is unchanged.) Third, a bug is fixed that caused -b/--notify to only report on one job when more than one terminated at the same time. The rest was reported on the next command line as if -b were not on. Reproducer: $ set -b; sleep 1 & sleep 1 & sleep 1 & This commit also includes a fair number of other window size and $COLUMNS/$LINES handling tweaks that made all this easier, not all of which are mentioned below. src/cmd/ksh93/include/fault.h, src/cmd/ksh93/sh/fault.c: - Replace sh_update_columns_lines with a new sh_winsize() function. It calls astwinsize() and is to be used instead of it, and instead of nv_getval(LINES) and nv_getval(COLUMNS) calls. It: - Allows passing one or neither of lines or cols pointers. - Updates COLUMNS and LINES, but only if they actually changed from the last values. This makes .set discipline functions defined for these variables more useful. - Sets the sh.winch flag, but only if COLUMNS changes. If only the height changes, the command line does not need redrawing. src/cmd/ksh93/include/io.h: - Add sh_editor_active() that allows checking whether one of vi, emacs or gmacs is active without onerous #if SHOPT_* directives. src/cmd/ksh93/sh/jobs.c: job_reap(): - Remove the fix backported in `fc655f1a`, which was really just a workaround that papered over the real bug. - Move a check for errno==ECHILD so it is only done when waitpid() returns an error (pid < 0); the check was not correct because C library functions that do not error out also do not change errno. - Move the SH_NOTIFY && SH_TTYWAIT code section to within the while(1) loop so it is run for each job, not only the last-processed one. This fixes the bug where only one job was notified when more than one ended at the same time. - In that section, check if an editor is active; if so, set the output file for job_list() to sh.notifybuffer instead of standard error, list the jobs without the initial newline (JOB_NLFLAG), and trigger a command line redraw by setting sh.winch. src/cmd/ksh93/edit/edit.c: - Obtain not just CURSOR_UP but also ERASE_EOS (renamed from KILL_LINE) using 'tput'. The latter had the ANSI sequence hardcoded. Define a couple of TPUT_* macros to make it easier to deal with terminfo/termcap codes. - Add get_tput() to make it easier to get several tput values robustly (with SIGINT blocked, trace disabled, etc.) - ed_crlf(): Removed. Going by those ancient #ifdefs, nothing that 93u+m will ever run on requires a '\r' before a '\n' to start a new line on the terminal. Plus, as of 93u+, there were already several spots in emacs.c and vi.c where it printed a sole '\n'. - ed_read(): - Simplify/modernise command line erase using ERASE_EOS. - Between erasing and redrawing, print the contents of the notify buffer. This has the effect of inserting notifications above the command line while the user is typing. src/cmd/ksh93/features/cmds: - To detect terminfo vs termcap codes, use all three codes we're currently using. This matters on at least on my system (macOS 10.14.6) in which /usr/bin/tput has incomplete terminfo support (no 'ed') but complete termcap support!	2022-07-24 04:11:12 +02:00
Martijn Dekker	948fab26aa	Fix: running external command influences $RANDOM sequence The pseudorandom generator generates a reproducible sequnece of values after seeding it with a known value. But: $ (RANDOM=1; echo $RANDOM; echo $RANDOM) 2100 18270 $ (RANDOM=1; echo $RANDOM; ls >/dev/null; echo $RANDOM) 2100 30107 Since RANDOM was seeded with a specific value, the two command lines should output the same pair of numbers. Running 'ls' in the middle should make no difference. The cause is a nv_getval(RANDNOD) call in xec.c, case TFORK, that is run for all TFORK cases, in the parent process -- including background jobs and external commands. What should happen instead is that $RANDOM is reseeded in the child process. This bug is in every version of ksh93 since before 1995. There is also an opportunity for optimisation. As of `396b388e`, the RANDOM seed may be invalidated by setting rand_last to -2, triggering a reseed the next time a $RANDOM value is obtained. This was done to optimise the virtual subshell mechanism. But that can also be used to eliminate unconditional reseeding elsewhere. So as of this commit, RANDOM is never (re)seeded until it's used. src/cmd/ksh93/include/variables.h, src/cmd/ksh93/sh/subshell.c: - Add RAND_SEED_INVALIDATED macro, a single source of truth for the value that triggers a reseeding in sh_save_rand_seed(). - Add convenient sh_invalidate_rand_seed() function macro. src/cmd/ksh93/sh/init.c, src/cmd/ksh93/sh/xec.c: - Optimisation: invalidate seed instead of reseeding directly. - sh_exec(): case TFORK: Delete the nv_getval(RANDNOD) call. Add a sh_invalidate_rand_seed() to the child part. This fixes the bug.	2022-07-21 01:01:32 +02:00
Martijn Dekker	adc6a64b82	Fix bad job control msg on trapped SIGINT and set -b (re: `fc655f1`) When running an external command while trapping Ctrl+C via SIGINT, and set -b is on, then a spurious Done job control message is printed. No background job was executed. $ trap 'ls' INT $ set -b $ <Ctrl+C>[file listing follows] [1] + Done set -b In jobs.c (487-493), job_reap() calls job_list() to list a running or completed background job, passing the JOB_NFLAG bit to only print jobs with the P_NOTIFY flag. But the 'ls' in the trap is not a background job. So it is getting the P_NOTIFY flag by mistake. In fact all processes get the P_NOTIFY flag by default when they terminate. Somehow the shell normally does not follow a code path that calls job_list() for foreground processes, but does when running one from a trap. I have not yet figured out how that works. What I do know is that there is no reason why non-background processes should ever have the P_NOTIFY flag set on termination, because those should never print any 'Done' messages. And we seem to have a handy P_BG flag that is set for background processes; we can check for this before setting P_NOTIFY. The only thing is that flag is only compiled in if SHOPT_BGX is enabled, as it was added to support that functionality. For some reason I am unable to reproduce the bug in a pty session, so there is no pty.sh regression test. src/cmd/ksh93/sh/jobs.c: - Rename misleadingly named P_FG flag to P_MOVED2FG; this flag is not set for all foreground processes but only for processes moved to the foreground by job_switch(), called by the fg command. - Compile in the P_BG flag even when SHOPT_BGX is not enabled. We need to set this flag to check for a background job. - job_reap(): Do not set the P_NOTIFY flag for all terminated processes, but only for those with P_BG set. src/cmd/ksh93/sh/xec.c: sh_fork(): - Also pass special argument 1 for background job to job_post() if SHOPT_BGX is not enabled. This is what gets it to set P_BG. - Simplify 5 lines of convoluted code into 1. Resolves: https://github.com/ksh93/ksh/issues/481	2022-07-14 07:02:30 +02:00
Martijn Dekker	ffee9100d5	Robustify ${.sh.level} scope switching (re: `69d37d5e`, `e1c41bb2`) Switching the function scope to a parent scope by assigning to .sh.level (SH_LEVELNOD) leaves the shell in an inconsistent state, causing invalid-free and/or use-after-free bugs. The intention of .sh.level was always to temporarily switch scopes inside a DEBUG trap, so this commit minimises the pitfalls and instability by imposing some sensible limitations: 1. .sh.level is now a read-only variable except while executing a DEBUG trap; 2. while it's writeable, attempts to unset .sh.level or to change its attributes are ignored; 3. attempts to set a discipline function for .sh.level are ignored; 4. it is an error to set a level < 0 or > the current scope. Even more crashing bugs are fixed by simplifiying the handling and initialisation of .sh.level and by exempting it completely from virtual subshell scoping (to which it's irrelevant). TODO: one thing remains: scope corruption and use-after-free happen when using the '.' command inside a DEBUG trap with ${.sh.level} changed. Behaviour same as before this commit. To be investigated. All changed files: - Consistently use the int16_t type for level values as that is the type of its non-pointer storage in SH_LEVELNOD. - Update .sh.level by using an update_sh_level() macro that assigns directly to the node value, then restores the scope if needed. - To eliminate implicit typecasts, use the same int16_t type (the type used by short ints such as SH_LEVELNOD) for all variables containing a function and/or dot script level. src/cmd/ksh93/include/variables.h: - Add update_sh_level() macro. src/cmd/ksh93/include/name.h, src/cmd/ksh93/sh/macro.c: - Add a nv_nonptr() macro that checks attributes for a non-pointer value -- currently only signed or unsigned short integer value, accessed via the 's' member of 'union Value' (e.g. np->nvalue.s). - nv_isnull(): To avoid undefined behaviour, check for attributes indicating a non-pointer value before accessing the nvalue.cp pointer (re: `5aba0c72`). - varsub(): In the set/unset check, remove the now-redundant exception for SH_LEVELNOD. src/cmd/ksh93/data/variables.c, src/cmd/ksh93/sh/init.c: - shtab_variables[]: Make .sh.level a read-only short integer. - sh_inittree(): To avoid undefined behaviour, do not assign to the 'union Value' char pointer if the attribute indicates a non- pointer short integer value. Instead, the table value is ignored. src/cmd/ksh93/sh/subshell.c: sh_assignok(): - Never create a subshell scope for SH_LEVELNOD. src/cmd/ksh93/sh/xec.c: - Get rid of 'struct Level' and its maxlevel member. This was only used in put_level() to check for an out of range assignment, but this can be trivially done by checking sh.fn_depth+sh.dot_depth. - This in turn allows further simplification that reduces init for .sh.level to a single nv_disc() call in sh_debug(), so get rid of init_level(). - put_level(): Throw a "level out of range" error if assigned a wrong level. - sh_debug(): - Turn off the NV_RDONLY (read-only) attribute for SH_LEVELNOD while executing the DEBUG trap. - Restore the current scope when trap execution is finished. - sh_funct(): Remove all .sh.level handling. POSIX functions (and dot scripts) already handle it in b_dot_cmd(), so sh_funct(), which is used by both, is the wrong place to do it. - sh_funscope(): Update .sh.level for ksh syntax functions here instead. Also, do not bother to initialise its discipline here, as it can now only be changed in a DEBUG trap. src/cmd/ksh93/bltins/typeset.c: setall(): - When it's not read-only, ignore all attribute changes for .sh.level, as changing the attributes would crash the shell. src/cmd/ksh93/sh/nvdisc.c: nv_setdisc(): - Ignore all attempts to set a discipline function for .sh.level, as doing this would crash the shell. src/cmd/ksh93/bltins/misc.c: b_dot_cmd(): - Bug fix: also update .sh.level when quitting a dot script. src/cmd/ksh93/sh/name.c: - _nv_unset(): - To avoid an inconsistent state, ignore all attempts to unset .sh.level. - To avoid undefined behaviour, do not zero np->nvalue.cp if attributes for np indicate a non-pointer value (the actual bit value of a null pointer is not defined by the standard, so there is no guarantee that zeroing .cp will zero .s). - sh_setscope(): For consistency, always set error_info.id (the command name for error messages) to the new scope's cmdname. Previously this was only done for two calls of this function. - nv_name(): Fix a crashing bug by checking that np->nvname is a non-null pointer before dereferencing it. src/cmd/ksh93/include/nval.h: - The NV_UINT16P macro (which is unsigned NV_INT16P) had a typo in it, which went unnoticed for many years because it's not directly used (though its bit flags are set and used indirectly). Let's fix it anyway and keep it for completeness' sake.	2022-07-13 23:11:18 +02:00
Martijn Dekker	7a01d6df47	history: fix out-of-bounds read on retrieving empty line Reproducer: Compile a ksh with AddressSanitizer. In that ksh, edit the last command line with 'fc', insert an empty line at the start, and save. Now use the up-arrow to retrieve the empty line. Ksh aborts on history.c line 1011 as hist_copy() tries to read before the beginning of the buffer pointed to by s1. src/cmd/ksh93/edit/history.c: hist_copy(): - Verify that the s1 pointer was increased from the original s1 before trying to read the character *(s1-1).	2022-07-10 21:18:49 +02:00
Martijn Dekker	1934686de3	Fix oddly specific syntax error corrupting subsequent [[ ... ]] Reproducer: $ x=([x]=1 [y) -ksh: syntax error: `)' unexpected $ [[ -z $x ]] -ksh: [[ -z ]]: not found Any '[[' command following that syntax error will fail similarly; the whole of it (after variable expansion) is incorrectly looked up as a command name. The syntax error must be generated by an associative array assignment (with or without an explicit typeset -A) with at least one valid assignment element followed by an invalid assignment element starting with '[' but not containing ']='. This seems to be another bug that is in every ksh93 version ever. I've confirmed that ksh 1993-12-28 s+ and ksh2020 fail identically. Presumably, so does everything in between. Analysis: The syntax error function, sh_syntax(), calls lexopen() in mode 0 to reset the lexer state. There is a variable that isn't getting reset there though it should be. Using systematic elimination I found that the variable that needs to be reset is lp->assignok (set "when name=value is legal"). If it is set, '[[' is not processed. src/cmd/ksh93/sh/lex.c: lexopen(): - Reset 'assignok' in the lexer state (regardless of mode). - In the mode 0 total lexer state reinit, several members of lexd (struct _shlex_pvt_lexdata_) were not getting reset; just memset the whole thing to zero. Note for backporters: this change requires commit `da97587e` to be correct. That commit took the stack size and pointer (lex_max and *lex_match) out of this struct; those should not be reset! Resolves: https://github.com/ksh93/ksh/issues/486	2022-07-09 23:00:11 +02:00
Martijn Dekker	06e56251b9	Fix wrong syntax error upon process substitution after redirection Grammatically, redirections may occur anywhere within a command line and are removed after processing them, whereas a process substitution (<(commandlist) or >(commandlist)) is replaced by a file name which should be treated as just another simple word. So the following should not be a syntax error: $ cat </dev/null <(true) -ksh: syntax error: `)' unexpected $ cat </dev/null >(true) -ksh: syntax error: `)' unexpected $ cat >/dev/null <(true) -ksh: syntax error: `)' unexpected $ cat >/dev/null >(true) -ksh: syntax error: `)' unexpected This bug is in every ksh93 version. The problem is in the parser (parse.c). The process substitution is misparsed as a redirection due to inout() recursively parsing multiple redirections without recognising process substitutions. inout() is mistaking '<(' for '<' and '>(' for '>', which explains the incorrect syntax error. This also causes the following to fail to detect a syntax error: $ cat >&1 <(README.md [the contents of README.md are shown] ...and other syntax errors detected in the wrong spot, for example: $ { true; } <(echo wrong) -ksh: syntax error: `wrong' unexpected which should be: -ksh: syntax error: `<(' unexpected src/cmd/ksh93/sh/parse.c: - Add global inout_found_procsub flag. - inout(): On encountering a process substitution, set this flag and return, otherwise clear the flag. - simple(): After calling inout(), check this flag and, if set, jump back to where process substitutions are parsed. Resolves: https://github.com/ksh93/ksh/issues/418	2022-07-05 13:20:28 +02:00
Martijn Dekker	4df6d674a0	Fix signal exit status of last command in subshell (re: `b3050769`) Reproducer (on macOS/*BSD where SIGUSR1 has signal number 30): $ ksh -c '(sh -c '\''kill -s USR1 $$'\''); echo $?' ksh: 54220: User signal 1 30 Expected output for $?: 286, not 30. The signal is not reflected in the 9th bit of the exit status. This bug was introduced for virtual subshells in `b3050769` but exists in every ksh93 version for real (forked) subshells: $ ksh -c '(ulimit -t unlimited; trap : EXIT; \ sh -c '\''kill -s USR1 $$'\''); echo $?' ksh: 54267: User signal 1 30 (As of `d6c9821c`, a dummy trap is needed to trigger the bug, or it will be masked by the exec optimization for the sh invocation.) This is caused by the exit status being masked to 8 bits when a subshell terminates. For a real subshell, this is inevitable as the kernel does this. As of `b3050769`, virtual subshells behave in a manner consistent with real subshells in this regard. However, for both virtual and real subshells, if its last command was terminated by a signal, then that should still be reflected in the 9th bit of ksh's exit stauts. The root of the problem is that ksh simply cannot rely internally on the 9th bit of the exit status to determine if a command exited due to a signal. The 9th bit may be trimmed by a subshell or may be set by 'return' without a signal being involved. This commit fixes it by introducing a separate flag which will be a reliable indicator of this. src/cmd/ksh93/include/shell.h: - Add sh.chldexitsig flag (set if the last command was a child process that exited due to a signal). src/cmd/ksh93/sh/jobs.c: job_wait(): - When the last child process exited due to a signal, not only set the 9th (SH_EXITSIG) bit of sh.exitval but also sh.chldexitsig. src/cmd/ksh93/sh/subshell.c: sh_subshell(): - Fix the virtual subshell reproducer above. After trimming the exit status to 8 bit, set the 9th bit if sh.chldexitsig is set. This needs to be done in two places: one that runs in the parent process after sh_subfork() and one for the regular virtual subshell exit. src/cmd/ksh93/sh/fault.c: - sh_trap(): Save and restore sh.chldexitsig so that this fix does not get deactivated if a trap is set. - sh_done(): - Fix the real subshell reproducer above. When the last command of a real subshell is a child process that exited due to a signal (i.e., if (sh.chldexitsig && sh.realsubshell)), then activate the code to pass down the signal to the parent process. Since there is no way to pass a 9-bit exit status to a parent process, this is the only way to ensure a correct exit status in the parent shell environment. - When exiting the main shell, use sh.chldexitsig and not the unreliable SH_EXITSIG bit to determine if the 8th bit needs to be set for a portable exit status indicating its last command exited due to a signal.	2022-07-03 12:49:36 +02:00
Martijn Dekker	372d704bfb	{edit,fault}.c: improve SIGWINCH and SIGINT handling The fault.c and edit.c changes in this commit were inspired by changes in the 93v- beta but take a slightly different approach: mainly, the code to update $COLUMNS and $LINES is put in its own function instead of duplicated in sh_chktrap() and ed_setup(). src/cmd/ksh93/sh/fault.c: - Move code to update $COLUMNS and $LINES to a new sh_update_columns_lines() function so it can be reused. - Fix compile error on systems without SIGWINCH. src/cmd/ksh93/edit/edit.c: - ed_setup(): Call sh_update_columns_lines() instead of issuing SIGWINCH to self. - Change two sh_fault(SIGINT) calls to issuing the signal to the current process using kill(2). sh_fault() is now never called directly (as in the 93v- beta). src/cmd/ksh93/sh/main.c: sh_main(): - On non-interactive, call sh_update_columns_lines() and set the signal handler for SIGWINCH to sh_fault(), causing $COLUMNS and $LINES to be kept up to date when the terminal window is resized (this is handled elsewhere for interactive shells). This change makes ksh act like mksh, bash and zsh. (Previously, ksh required setting a dummy SIGWINCH trap to get auto-updated $COLUMNS and $LINES in scripts, as this set the SIGWINCH signal handler to sh_fault(). This persisted even after unsetting the trap again, so that was inconsistent behaviour.) src/cmd/ksh93/include/shell.h: - Don't define sh.winch on systems without SIGWINCH. src/cmd/ksh93/sh.1: - Update and tweak the COLUMNS and LINES variable documentation. - Move them up to the section of variables that are set by the shell (which AT&T should have done before).	2022-07-01 22:49:40 +02:00
Martijn Dekker	416a412d71	Fix seeking on block devices with FD 0, 1 or 2 @stephane-chazelas reports: > A very weird issue: > > To reproduce on GNU/Linux (here as superuser) > > # truncate -s10M file > # export DEV="$(losetup -f --show file)" > # ksh -c 'exec 3<> "$DEV" 3>#((0))' # fine > # ksh -c 'exec 1<> file 1>#((0))' # fine > # ksh -c 'exec 1<> "$DEV" 1>#((0))' > ksh: 0: invalid seek offset > > Any seek offset is considered "invalid" as long as the file is a > block device and the fd is 0, 1 or 2. It's fine for fds above 2 > and it's fine with any fd for regular files. Apparently, block devices are not seekable with sfio. In io.c there is specific code to avoid using sfio's sfseek(3) if there is no sfio stream in sh.sftable[] for the file descriptor in question: 1398: Sfio_t sp = sh.sftable[fn]; [...] 1420: if(sp) 1421: { 1422: off=sfseek(sp, off, SEEK_SET); 1423: sfsync(sp); 1424: } 1425: else 1426: off=lseek(fn, off, SEEK_SET); For file descriptors 0, 1 or 2 (stdin/stdout/stderr), there is a sh.sftable[] stream by default, and it is marked as not seekable. This makes it return -1 in these lines in sfseek.c, even if the system call called via SFSK() succeeds: 89: if(f->extent < 0) 90: { / let system call set errno */ 91: (void)SFSK(f,(Sfoff_t)0,SEEK_CUR,f->disc); 92: return (Sfoff_t)(-1); 93: } ...which explains the strange behaviour. src/lib/libast/sfio/sfseek.c: sfseek(): - Allow for the possibility that the fallback system call might succeed: let it handle both errno and the return value. Resolves: https://github.com/ksh93/ksh/issues/318	2022-06-28 22:18:46 +02:00
Martijn Dekker	d8dc2a1d81	sh_setenviron(): deactivate compound assignment prefix Reproducers: $ ksh -c 'typeset -a arr=( ( (a $(($(echo 1) + 1)) c)1))' ksh: echo: arr[0]._AST_FEATURES=CONFORMANCE - ast UNIVERSE - ucb: cannot be an array ksh: [1]=1: invalid variable name $ ksh -c 'typeset -a arr=( (a $(($(echo 1) + 1)) c)1)' ksh: echo: arr._AST_FEATURES=CONFORMANCE - ast UNIVERSE - ucb: is not an identifier ksh: [1]=1: invalid variable name src/cmd/ksh93/sh/name.c: sh_setenviron(): - Save and clear the current compound assignment prefix (sh.prefix) while assigning to the _AST_FEATURES variable.	2022-06-23 03:34:16 +01:00
Martijn Dekker	225323f138	Fix more "/dev/null: cannot create" (re: `411481eb`) Reproducer: trap : USR1 while :; do kill -s USR1 $$ \|\| exit; done & while :; do : >/dev/null; done It can take between a fraction of a second and a few minutes, but eventually it will fail like this: $ ksh foo foo[3]: /dev/null: cannot create kill: 77946: no such process It fails similarly with "cannot open" if </dev/null is used instead of >/dev/null. This is the same problem as in the referenced commit, except when handling traps -- so the same fix is required in sh_fault().	2022-06-20 19:39:00 +01:00
Martijn Dekker	16b3802148	Fix incorrect exec optimisation with monitor/pipefail on Reproducer script: tempfile=/tmp/out2.$$.$RANDOM bintrue=$(whence -p true) for opt in monitor pipefail do ( set +x -o "$opt" ( sleep .05 echo "ok $opt" >&2 ) 2>$tempfile \| "$bintrue" ) & wait cat "$tempfile" rm -f "$tempfile" done Expected output: ok monitor ok pipefail Actual output: (none) The 'monitor' and 'pipefail' options are supposed to make the shell wait for the all commands in the pipeline to terminate and not only the last component, regardless of whether the pipe between the component commands is still open. In the failing reproducer, the dummy external true command is subject to an exec optimization, so it replaces the subshell instead of forking a new process. This is incorrect, as the shell is no longer around to wait for the left-hand part of the pipeline, so it continues in the background without being waited for. Since it writes to standard error after .05 seconds (after the pipe is closed), the 'cat' command reliably finds the temp file empty. Without the sleep this would be a race condition with unpredictable results. Interestingly, this bug is only triggered for a (background subshell)& and not for a forked (regular subshell). Which means the exec optimization is not done for a forked regular subshell, though there is no reason not to. That will be fixed in the next commit. src/cmd/ksh93/sh/xec.c: sh_exec(): - case TFORK: Never allow an exec optimization if we're running a command in a multi-command pipeline (pipejob is set) and the shell needs to wait for all pipeline commands, i.e.: either the time keyword is in use, the SH_MONITOR state is active, or the SH_PIPEFAIL option is on. - case TFIL: Fix the logic for setting job.waitall for the non-SH_PIPEFAIL case. Do not 'or' in the boolean value but assign it, and include the SH_TIMING (time keyword in use) state too. - case TTIME: After that fix in case TFIL, we don't need to bother setting job.waitall explicitly here. src/cmd/ksh93/sh.1: - Add missing documentation for the conditions where the shell waits for all pipeline components (time, -o monitor/pipefail). Resolves: https://github.com/ksh93/ksh/issues/449	2022-06-18 23:25:30 +01:00
Martijn Dekker	6016fb64ce	Forking workaround for converting to associative array in subshell $ arch/*/bin/ksh -xc 'typeset -a a=(1 2 3); \ (typeset -A a; typeset -p a); typeset -p a' typeset -A a=() typeset -a a=(1 2 3) The associative array in the subshell is empty, so the conversion failed. So far, I have been unsuccessful at fixing this in the array and/or virtual subshell code (a patch that fixes it there would still be more than welcome). As usual, real subshells work correctly, so this commit adds another forking workaround. The use case is rare and specific enough that I have no performance concerns. src/cmd/ksh93/bltins/typeset.c: setall(): - Fork a virtual subshell if we're actually converting a variable to an associative array, i.e.: the NV_ARRAY (-A, associative array) attribute was passed, there are no assignments (sh.envlist is NULL), and the variable is not unset. src/cmd/ksh93/tests/arith.sh: - Fix the "Array subscript quoting test" tests that should not have been passing and that correctly failed after this fix; they used 'typeset -A' without an assignment in a subshell, assuming it was unset in the parent shell, which it wasn't. Resolves: https://github.com/ksh93/ksh/issues/409	2022-06-15 04:58:14 +01:00
Martijn Dekker	50db00e136	Fix subshell trap integrity, e.g. re-trapping a signal in subshell Ksh handles local traps in virtual subshells the same way as local traps in ksh-style shell functions, which can cause incorrect operation. Reproducer script: trap 'echo "parent shell trap"; exit 0' USR1 (trap 'echo "subshell trap"' USR1; kill -USR1 $$) echo wrong Output on every ksh93 version: 'wrong' Output on every other shell: 'parent shell trap' The ksh93 output is wrong because $$ is the PID of the main shell, therefore 'kill -USR1 $$' from a subshell needs to issue SIGUSR1 to the main shell and trigger the 'echo SIGUSR1' trap. This is an inevitable consequence of processing signals in a virtual subshell. Signals are a process-wide property, but a virtual subshell and the parent shell share the same process. Therefore it is not possible to distinguish between the parent shell and subshell trap. This means virtual subshells are fundamentally incompatible with receiving signals. No workaround can make this work properly. Ksh could either assume the signal needs to be caught by the subshell trap (wrong in this case, but right in others) or by the parent shell trap. But it does neither and just gives up and does nothing, which I suppose is the least bad way of doing it wrong. As another example, consider a subshell that traps a signal, then passes its own process ID (as of `9de65210`, that's ${.sh.pid}) to another process to say "here is where to signal me". A virtual subshell will send it the PID that it shares with the the parent shell. Even if a virtual subshell receives the signal correctly, it may fork mid-execution afterwards, depending on the commands that it runs (and this varies by implementation as we fix or work around bugs). So its PID could be invalidated at any time. Forking a virtual subshell at the time of trapping a signal is the only way to ensure a persistent PID and correct operation. src/cmd/ksh93/bltins/trap.c: b_trap(): - Fork when trapping (or ignoring) a signal in a virtual subshell. (There's no need to fork when trapping a pseudosignal.) src/cmd/ksh93/tests/signal.sh: - Add tests. These are simplified versions of tests already there, which issued 'kill' as a background job. Currently, running a background job causes a virtual subshell to fork before forking the 'kill' background job (or any background job, see `e3d7bf1d`) -- an ugly partial workaround that I believe just became redundant and which I will remove in the next commit.	2022-06-14 01:33:24 +01:00
Martijn Dekker	9b893992a3	[v1.0] posix: don't zero-pad 2nds (re: `5c677a4c`, `70fc1da7`, `b1a41311`) The POSIX mode now disables left-hand zero-padding of seconds in 'time'/'times' output. The standard specifies the output format quite exactly and zero-padding is not in it.	2022-06-12 16:16:11 +01:00
Martijn Dekker	148a8a3f46	Another build system overhaul (re: `35672208`, `580ff616`, `6cc2f6a0`) So far we've been handling AST release build and git commit flags and ksh SHOPT_* compile time options in the generic package build script. That was a hack that was necessary before I had sufficient understanding of the build system. Some of it did not work very well, e.g. the correct git commit did not show up in ${.sh.version} when compiling from a git repo. As of this commit, this is properly included in the mamake dependency tree by handling it from the libast and ksh93 Mamfiles, guaranteeing they are properly up to date. For a release build, the _AST_ksh_release macro is renamed to _AST_release, because some aspects of libast also use this. This commit also adds my first attempt at documenting the (very simple, six-command) mamake language as it is currently implemented -- which is significantly different from Glenn Fowler's original paper. This is mostly based on reading the mamake.c source code. src/cmd/INIT/README-mamake.md: - Added. bin/package, src/cmd/INIT/package.sh: - Delete the hack. /Mamfile: - Remove KSH_RELFLAGS and KSH_SHOPTFLAGS, which supported the hack. - Delete 'meta' commands. They were no-ops; mamake.c ignores them. They also did not add any informative value. src/lib/libast/Mamfile: - Add a 'virtual' target that obtains the current git commit, examines the git branch, and decides whether to auto-set an _AST_git_commit and/or or _AST_release #define to a new releaseflags.h header file. This is overwritten on each run. - Add code to the install target that copies limit.h to include/ast, but only if it doesn't exist or the content of the original changed. This allows '#include <releaseflags.h>' from any program using libast while avoiding needless recompiles. - When there are uncommitted changes, add /MOD (modified) to the commit hash instead of not defining it at all. src/cmd/ksh93/: - Mamfile: Add a shopt.h target that reads SHOPT.sh and converts it into a new shopt.h header file in the object code directory. The shopt.h header only contains SHOPT_* directives that have a value in SHOPT.sh (not the empty/probe ones). They also do not redefine the macros if they already exist, so overriding with something like CCFLAGS+=' -DSHOPT_FOO=1' remains possible. - *.c: Every c file now #includes "shopt.h" first. So SHOPT_ macros are no longer passed via environment/MAM variables. * SHOPT.sh: The AUDITFILE and CMDLIB_DIR macros no longer need an extra backslash-escape for the double quotes in their values. (The old way required this because mamake inserts MAM variables directly into shell scripts as literals without quoting. :-/ ) src/cmd/INIT/mamake.c: - Import the two minor changes between from 93u+ and 93v-: bind() is renamed to bindfile() and there is a tweak to detecting an "improper done statement". - Allow arbitrary whitespace (isspace()) everywhere, instead of spaces only. This obsoletes my earlier indentation workaround from `6cc2f6a0`; turns out mamake always supported indentation, but with spaces only. - Do not skip line numbers at the beginning of each line. This undocumented feature is not and (AFAICT) never has been used. - Throw an error on unknown command or rule attribute. Quite an important feature for manual maintenance: catches typos, etc.	2022-06-12 05:47:02 +01:00
Martijn Dekker	b14e79e9d0	posix: use real pipe(2) instead of socketpair(2) The POSIX standard requires real UNIX pipes as in pipe(2). But on systems supporting it (all modern ones), ksh uses socketpair(2) instead to make it possible for certain commands to peek ahead without consuming input from the pipe, which is not possible with real pipes. See features/poll and sh/io.c. But this can create undesired side effects: applications connected to a pipe may test if they are connected to a pipe, which will fail if they are connected to a socket. Also, on Linux: $ cat /etc/passwd \| head -1 /dev/stdin head: cannot open '/dev/stdin' for reading: No such device or address ...which happens because, unlike most systems, Linux cannot open(2) or openat(2) a socket (a limitation that is allowed by POSIX). Unfortunately at least two things depend on the peekahead capability of the _pipe_socketpair feature. One is the non-blocking behaviour of the -n option of the 'read' built-in: -n Causes at most n bytes to be read instead of a full line, but will return when reading from a slow device as soon as any characters have been read. The other thing that breaks is the <#pattern and <##pattern redirection operators that basically grep standard input, which inherently requires peekahead. Standard UNIX pipes always block on read and it is not possible to peek ahead, so these features inevitably break. Which means we cannot simply use standard pipes without breaking compatibility. But we can at least fix it in the POSIX mode so that cross-platform scripts work more correctly. src/cmd/ksh93/sh/io.c: sh_pipe(): - If _pipe_socketpair is detected at compile time, then use a real pipe via sh_rpipe() if the POSIX mode is active. (If _pipe_socketpair is not detected, a real pipe is always used.) src/cmd/ksh93/data/builtins.c: - sh.1 documents the slow-device behaviour of -n but 'read --man' did not. Add that, making it conditional on _pipe_socketpair. Resolves: https://github.com/ksh93/ksh/issues/327	2022-06-09 16:16:16 +01:00
Martijn Dekker	0602177646	posix: block brace expansion of unquoted expansions (re: `a14d17c0`) Historically, ksh (including ksh88 and mksh) allow brace expansion not just on literal patterns but also on patterns resulting from unquoted variable expansions or command substitutions: $ a='{a,b}' ksh -c 'echo X{a,b} Y$a' Xa Xb Ya Yb Most people expect only the first (literal) pattern to be expanded, as in bash and zsh: $ a='{a,b}' bash -c 'echo X{a,b} Y$a' Xa Xb Y{a,b} The historic ksh behaviour is poorly documented and nearly unknown, violates the principle of least astonishment, and makes unquoted variable expansions even more unsafe. See discussion at: https://www.austingroupbugs.net/view.php?id=1193 https://github.com/ksh93/ksh/issues/140 Unfortunately, we cannot change it in default ksh without breaking backward compatibility. But we can at least fix it for the POSIX mode (which disables brace expansion by default but allows turning it back on), particularly as it looks like POSIX, if it decides to specify brace expansion in a future version of the standard, will disallow brace expansion on unquoted variable expansions. src/cmd/ksh93/sh/macro.c: endfield(): - When deciding whether to do brace expansion + globbing or only globbing, also check that we do not have POSIX mode and an unquoted variable expansion (mp->pattern==1).	2022-06-08 22:21:53 +01:00
Martijn Dekker	9da0887e54	Fix spurious export attribute when printing compound variables Reproducer script: typeset -Ttyp1 typ1=( function get { .sh.value="'Sample'"; } ) typ1 var11 typeset -p .sh.type typeset -p .sh.type Buggy output: namespace sh.type { typeset -r typ1='Sample' } namespace sh.type { typeset -x -r typ1='Sample' } An -x (export) attribute is magically pulled out of a hat. Analysis: The walk_tree() function in nvdisc.c repurposes (!) the NV_EXPORT attribute as an instruction to turn off indenting when pretty-printing the values of compound variables. The print_namval() function in typeset.c, implementing 'typeset -p', turns on NV_EXPORT for compound variables to inhibit indentation. But it then does not bother to turn it off, which causes this bug. src/cmd/ksh93/bltins/typeset.c: print_namval(): - When printing compound variables, only turn on NV_EXPORT temporarily. Resolves: https://github.com/ksh93/ksh/issues/456	2022-06-07 04:27:54 +01:00
Martijn Dekker	7a5423dfb6	Fix more spurious comsub execution in tab completion (re: `7a2d3564`) Comsubs were either executed or caused a syntax error when attempting to complete them within single quotes. Since single quotes do not expand anything, no such completion should take place. $ '`/de<TAB>-ksh: /dev/: cannot execute [Is a directory] $ '$(/de<TAB>-ksh: syntax error: `end of file' unexpected src/cmd/ksh93/edit/completion.c: - find_begin(): - Remove recursive handling for '`' comsubs from `7a2d3564`; it is sufficient to set the return pointer to the current location cp (the character following '`') if we're not in single quotes. - For '$' and '`', if we are within single quotes, set type '\'' and set the return pointer bp to the location of the '$' or '`'. - ed_expand(): If find_begin() sets type '\'' and the current begin character is $ or `, refuse to attempt completion; return -1 to cause a terminal beep. Related: https://github.com/ksh93/ksh/issues/268 https://github.com/ksh93/ksh/issues/462#issuecomment-1038482307	2022-06-06 03:12:57 +01:00
Martijn Dekker	e3aa32a129	Add --functrace shell option (re: `2a835a2d`) A side effect of the bug fixed in `2a835a2d` caused the DEBUG trap action to appear to be inherited by subshells, but in a buggy way that could crash the shell. After the fix, the trap is reset in subshells along with all the others, as it should be. Nonetheless, as that bug was present for years, some have come to rely on it. This commit implements that functionality properly. When the new --functrace option is turned on, DEBUG trap actions are now inherited by subshells as well as ksh function scopes. In addition, since it makes logical sense, the new option also causes the -x/--xtrace option's state to be inherited by ksh function scopes. Note that changes made within the scope do not propagate upwards; this is different from bash. (I've opted against adding a -T short-form equivalent as on bash, because -T was formerly a different option on 93u+ (see 63c55ad7) and on mksh it has yet anohter a different meaning. To minimise confusion, I think it's best to have the long-form name only.) src/cmd/ksh93/include/shell.h, src/cmd/ksh93/data/options.c: - Add new "functrace" (SH_FUNCTRACE) long-form shell option. src/cmd/ksh93/sh/subshell.c: sh_subshell(): - When functrace is on, copy the parent's DEBUG trap action into the virtual subshell scope after resetting the trap actions. src/cmd/ksh93/sh/xec.c: sh_funscope(): - When functrace is on and xtrace is on in the parent scope, turn it on in the child scope. - Same DEBUG trap action handling as in sh_subshell(). Resolves: https://github.com/ksh93/ksh/issues/162	2022-06-04 17:27:27 +01:00
Martijn Dekker	2ecc2575d5	Fix import of float attribute/value from environment (re: `960a1a99`) Bug 1: as of `960a1a99`, floating point literals were no longer recognised when importing variables from the environment. The attribute was still imported but the value reverted to zero: $ (export LC_NUMERIC=C; typeset -xF5 num=7.75; \ ksh -c 'typeset -p num') typeset -x -F 5 num=0.00000 Bug 2 (inherited from 93u+): The code that imported variable attributes from the environment only checked '.' to determine whether the float attribute should be set. It should check the current radix point instead. $ (export LC_NUMERIC=debug; typeset -xF5 num=7,75; \ ksh -c 'typeset -p num') typeset -x -i num=0 ...or, after fixing bug 1 only, the output is: typeset -x -i num=75000 src/cmd/ksh93/sh/arith.c: sh_strnum(): - When importing untrusted env vars at init time, handle not only "base#value" literals using strtonll, but also floating point literals using strtold. This fixes the bug without reallowing arbitary expressions. (re: `960a1a99`) - When not initialising, use sh.radixpoint (see `f0386a87`) instead of '.' to help decide whether to evaluate an arith expression. src/cmd/ksh93/sh/init.c: env_import_attributes(): - Use sh.radixpoint instead of '.' to check for a decimal fraction. (This code is needed because doubles are exported as integers for ksh88 compatibility; see attstore() in name.c.)	2022-06-03 12:18:54 +01:00
Martijn Dekker	ea300089a1	New feature: 'typeset -g' as in bash 4.2+ typeset -g allows directly manipulating the attributes of variables at the global level from any context. This feature already exists on bash 4.2 and later. mksh (R50+), yash and zsh have this flag as well, but it's slightly different: it ignores the current local scope, but a parent local scope from a calling function may still be used -- whereas on bash, '-g' always refers to the global scope. Since ksh93 uses static scoping (see III.Q28 at <http://kornshell.com/doc/faq.html>), only the bash behaviour makes sense here. Note that the implementation needs to be done both in nv_setlist() (name.c) and in b_typeset() (typeset.c) because assignments are executed before the typeset built-in itself. Hence also the pre-parsing of typeset options in sh_exec(). src/cmd/ksh93/include/nval.h: - Add new NV_GLOBAL bit flag, using a previously unused bit that still falls within the 32-bit integer range. src/cmd/ksh93/sh/xec.c: sh_exec(): - When pre-parsing typeset flags, make -g pass the NV_GLOBAL flag to the nv_setlist() call that processes shell assignments prior to running the command. src/cmd/ksh93/sh/name.c: nv_setlist(): - When the NV_GLOBAL bit flag is passed, save the current variable tree pointer (sh.var_tree) as well as the current namespace (sh.namespace) and temporarily set the former to the global variable tree (sh.var_base) and the latter to NULL. This makes assignments global and ignores namesapces. src/cmd/ksh93/bltins/typeset.c: - b_typeset(): - Use NV_GLOBAL bit flag for -g. - Allow combining -n with -g, permitting 'typeset -gn var' or 'nameref -g var' to create a global nameref from a function. - Do not allow a nonsensical use of -g when using nested typeset to define member variables of 'typeset -T' types. (Type method functions can still use it as normal.) - setall(): - If NV_GLOBAL is passed, use sh.var_base and deactivate sh.namespace as in nv_setlist(). This allows attributes to be set correctly for global variables. src/cmd/ksh93/tests/{functions,namespace}.sh: - Add regression tests based on reproducers for problems found by @hyenias in preliminary versions of this feature. Resolves: https://github.com/ksh93/ksh/issues/479	2022-06-01 21:07:01 +01:00
Martijn Dekker	f73b8617dd	Restore namespace's parent scope when exiting due to error Reproducer: $ namespace test { x=123; typeset -g x=456; } $ echo $x ${.test.x} 456 123 $ namespace test { typeset -Q; } arch/darwin.i386-64/bin/ksh: typeset: -Q: unknown option [usage message snipped for brevity] $ echo $x ${.test.x} 123 123 <== expected: 123 456 $ x=789 $ echo $x ${.test.x} 789 789 <== expected: 789 456 $ # look at that, we never left the namespace... When prefixing the erroneous 'typeset' with 'command', the problem does not occur. 'command' disables the properties of special built-ins such as exit on error. So, when a special built-in exits on error, the parent scope is not properly resotred. This bug exists in every ksh93 version with SHOPT_NAMESPACE so far. src/cmd/ksh93/sh/xec.c: sh_exec(): - Before entering a namespace, use sh_pushcontext and sigsetjmp to make sure we return here if sh_exit() is called, e.g. when a special builtin throws an error, to ensure the parent scope (oldnspace) is restored. Thanks to @hyenias for making me aware of this bug. Discussion: https://github.com/ksh93/ksh/issues/479#issuecomment-1140468965	2022-05-29 23:05:03 +01:00
Martijn Dekker	8f14514661	set --default: properly restore ksh IFS behaviour (re: `9e2a8c69`) Reproducer: $ (IFS=$'\t\t'; val=$'\tone\t\ttwo\t'; set --posix; \ set -- $val; echo $#; set --noposix; set -- $val; echo $#) 2 4 <== OK $ (IFS=$'\t\t'; val=$'\tone\t\ttwo\t'; set --posix; \ set -- $val; echo $#; set --default; set -- $val; echo $#) 2 2 <== bug The output of the seconnd command line should be like the first. When POSIX mode is turned off using 'set --noposix' (or 'set +o posix'), sh.ifstable is invalidated as it needs to be repopulated on the next field split to restore ksh-specific special handling of a repeated $IFS whitespace character as non-whitespace. However, when 'set --default' is used, this does not happen, which is a bug. src/cmd/ksh93/sh/args.c: sh_argopts(): - While processing --default, when turning off SH_POSIX, call sh_invalidate_ifs() to invalidate sh.ifstable.	2022-05-28 00:13:46 +01:00
Martijn Dekker	83baa27ef9	Fix incorrect typeset -L/-R/-Z on input with spaces (re: `bdb99741`) The typeset output for -L/-R/-Z seems to be wrong when the input has leading/trailing spaces. This started occurring after the dynamic buffer size changes introduced in name.c as part of the fix for <https://github.com/ksh93/ksh/issues/142>. Test script: typeset -L8 s_date1=" 22/02/09 08:25:01"; echo "$s_date1" typeset -R10 s_date1="22/02/09 08:25:01 "; echo "$s_date1" typeset -Z10 s_date1="22/02/09 08:25:01 "; echo "$s_date1" Actual output: 22/02/0 08:25:01 0008:25:01 Expected output: 22/02/09 9 08:25:01 9 08:25:01 src/cmd/ksh93/sh/name.c: nv_newattr(): - Simplify allocation code, replacing the earlier dynamic buffer size calculation with just the greater of the strlen and size. Resolves: https://github.com/ksh93/ksh/issues/476 Co-authored-by: George Lijo <george.lijo@gmail.com>	2022-05-26 00:08:45 +01:00
atheik	9bed28c3f9	Fix line continuation within command substitutions In command substitutions of the $(standard) and ${ shared state; } form, backslash line continuation is broken. Reproducer: echo $( echo one two\ three ) Actual output (ksh93, all versions): one two\ three Expected output (every other shell, POSIX spec): one twothree src/cmd/ksh93/sh/lex.c: sh_lex(): case S_REG: - Do not skip new-line joining if we're currently processing a command substitution of one of these forms (i.e., if the lp->lexd.dolparen level is > 0). Background info/analysis: comsub() is called from sh_lex() when S_PAR is the current state. In src/cmd/ksh93/data/lexstates.c, we see that S_PAR is reached in the ST_DOL state table at index 40. Decimal 40 is ( in ASCII. So, the previous skipping of characters was done according to the ST_DOL state table, and the character that stopped it was (. This means we have $(. Alternatively, comsub() may be called from sh_lex() by jumping to the do_comsub label. In brief, that is the case when we have ${. Regardless of which it is from the two, comsub() is now called from sh_lex(). In comsub(), lp->lexd.dolparen is incremented at the beginning and decremented at the end. Between them, we see that sh_lex() is called. So, lp->lexd.dolparen in sh_lex() indicates the depth of nesting $( or ${ statements we're in. Thus, it is also the number of comsub() invocations seen in a backtrace taken in sh_lex(). The codepath for `...` is different (and never had this bug). Co-authored by: Martijn Dekker <martijn@inlv.org> Resolves: https://github.com/ksh93/ksh/issues/367	2022-05-22 00:23:54 +01:00
atheik	40a5c45b48	Allow double quotes within backtick comsub within double quotes The following reproducer causes a spurious syntax error: foo="`: "("`" The nested double quotes are not recognised correctly, causing a syntax error at the '('. Removing the outer double quotes (which are unnecessary) is a workaround, but it's still a bug as every other shell accepts this. This bug has been present since the original Bourne shell. src/cmd/ksh93/sh/lex.c: sh_lex(): case S_QUOTE: - If the current character is '"' and we're in a `...` command substitution (ingrave is true), then do not switch to the old mode but keep using the ST_QUOTE state table. Thanks to @JohnoKing for the report and to @atheik for the fix. Co-authored by: Martijn Dekker <martijn@inlv.org> Resolves: https://github.com/ksh93/ksh/issues/352	2022-05-20 22:48:47 +01:00
Martijn Dekker	fd28da31da	Fix another test/[ corner case bug; add --posix test script This fixes another corner case bug in the horror show that is the test/[ comand. Reproducer: $ ksh --posix -c 'test X -a -n' ksh: test: argument expected Every other shell returns 0 (success) as, POSIXly, this is a test for the strings 'X' and '-n' both being non-empty, combined with the binary -a (logical and) operator. Instead, '-n' was taken as a unary primary operator with a missing argument, which is incorrect. POSIX reference: https://pubs.opengroup.org/onlinepubs/9699919799/utilities/test.html > 3 arguments: > * If $2 is a binary primary, perform the binary test of $1 and $3. src/cmd/ksh93/bltins/test.c: - e3(): If the final argument begins with a dash, always treat it as a test for a non-empty string, therefore return true. Do not limit this to "new flags" only. src/cmd/ksh93/tests/posix.sh: - Added. These are tests for every aspect of the POSIX mode.	2022-03-11 21:23:45 +01:00
Martijn Dekker	fae1932e62	enum: remove arbitrary one-argument limitation b_enum() contains a check that exactly one argument is given: 237: if (error_info.errors \|\| !argv \|\| (argv + 1)) But the subsequent argument handling loop will happily deal with multiple arguments: 246: while(cp = argv++) Every other declaration command supports multiple arguments and I see no reason why enum shouldn't. Simply removing the '(argv + 1)' check allows 'enum' to create more than one type per invocation. src/cmd/ksh93/bltins/enum.c: - b_enum(): Remove check for >1 args as described above. - Update documentation to describe the behaviour of enumeration types in arithmetic expressions and to add an example: a bool type with two enumeration values 'false' (0) and 'true' (1). That type is predefined in ksh 93v- and 2020. We're not going to do that in 93u+m but it's good to document the possibility. src/cmd/ksh93/sh.1: - Make changes parallel to the enum.c self-doc update.	2022-03-11 21:21:23 +01:00
atheik	2e5fd4d4c1	slowread(): Turn off O_NONBLOCK for stdin if it is on (#471 ) This change turns off O_NONBLOCK for stdin if a previously ran program left it on so that interactive programs that expect it to be off work properly. src/cmd/ksh93/sh/io.c: slowread(): - Turn off O_NONBLOCK for stdin if it is on. Fixes: https://github.com/ksh93/ksh/issues/469	2022-03-11 21:10:59 +01:00
Johnothan King	e87dbebebd	Fix use after free bug when using += (re: `75796a9c`) (#466 ) The previous fix for the += operator introduced a use-after-free bug that could result in a variable pointing to random garbage: $ foo=bar $ foo+=_foo true $ typeset -p foo foo=V V The use after free issue occurs because when nv_clone creates a copy of $foo in the true command's invocation-local scope, it does not duplicate the string $foo points to. As a result, the $foo variable in the parent scope points to the same string as $foo in the invocation-local scope, which causes the use after free bug when cloned $foo variable is freed from memory. src/cmd/ksh93/sh/nvdisc.c: - To fix the use after free bug, allow nv_clone to duplicate the string with memdup or strdup when no flags are passed. src/cmd/ksh93/tests/variables.sh: - Add a regression test for using the += operator with regular commands. src/cmd/ksh93/tests/leaks.sh: - Add a regression test to ensure the bugfix doesn't introduce any memory leaks.	2022-03-11 21:08:57 +01:00
Martijn Dekker	11177d448d	Fix crash on cd in subshell with PWD unset (re: `5ee290c`) Reproducer: $ ksh -c 'unset PWD; (cd /); :' Memory fault The shell crashes because b_cd() is testing the value of the PWD variable without checking if there is one. src/cmd/ksh93/sh/path.c: path_pwd(): - Never return an unfreeable pointer to e_dot; always return a freeable pointer. This fixes another corner-case crashing bug. - Make sure the PWD variable gets assigned a value if it doesn't have one, even if it's the "." fallback. However, if the PWD is inaccessible but we did inherit a $PWD value that starts with a /, then use the existing $PWD value as this will help the shell fail gracefully. src/cmd/ksh93/bltins/cd_pwd.c: - b_cd(): When checking if the PWD is valid, use the sh.pwd copy instead of the PWD variable. This fixes the crash above. - b_cd(): Since path_pwd() now always returns a freeable value, free sh.pwd unconditionally before setting the new value. - b_pwd(): Not only check that path_pwd() returns a value starting with a slash, but also verify it with test_inode() and error out if it's wrong. This makes the 'pwd' command useful for checking that the PWD is currently accessible. src/cmd/ksh93/data/msg.c: - Change e_pwd error message for accuracy and clarity.	2022-02-17 19:45:37 +00:00
Martijn Dekker	d55e9686d7	Backport 'read -a' and 'read -p' from ksh 93v-/2020 This backports two minor additions to the 'read' built-in from ksh 93v-: '-a' is now the same as '-A' and '-u p' is the same as '-p'. This is for compatibility with some 93v- or ksh2020 scripts. Note that their change to the '-p' option to support both prompts and reading from the coprocess was not backported because we found it to be broken and unfixable. Discussoin at: https://github.com/ksh93/ksh/issues/463 src/cmd/ksh93/bltins/read.c: b_read(): - Backport as described above. - Rename the misleadingly named 'name' variable to 'prompt'. It points to the prompt string, not to a variable name. src/cmd/ksh93/data/builtins.c: sh_optpwd[]: - Add -a as an alterative to -A. All that is needed is adding '\|a' and optget(3) will automatically convert it to 'A'. - Change -u from a '#' (numeric) to ':' option to support 'p'. Note that b_read() now needs a corresponding strtol() to convert file descriptor strings to numbers where applicable. - Tweaks. src/cmd/ksh93/sh.1: - Update accordingly. - Tidy up the unreadable mess that was the 'read' documentation. The options are now shown in a list.	2022-02-17 19:44:54 +00:00
Martijn Dekker	95d695cb5a	Improve and document fast filescan loops (SHOPT_FILESCAN) From README: FILESCAN on Experimental option that allows fast reading of files using while < file;do ...; done and allowing fields in each line to be accessed as positional parameters. As SHOPT_FILESCAN has been enabled by default since ksh 93l 2001-06-01, the filescan loop is now documented in the manual page and the compile-time option is no longer considered experimental. We must disable this at runtime if --posix is active because it breaks a portable use case: POSIXly, 'while <file; do stuff; done' repeatedly excutes 'stuff' while 'file' can successfully be opened for reading, without actually reading from 'file'. This also backports a bugfix from the 93v- beta. Reproducer: $ echo 'one two three' >foo $ while <foo; do printf '[%s] ' "$@"; echo; done [one two three] Expected output: [one] [two] [three] The bug is that "$@" acts like "$*", joining all the positional parameters into one word though it should be generating one word for each. src/cmd/ksh93/sh/macro.c: varsub(): - Backport fix for the bug described above. I do not understand the opaque macro.c code well enough yet to usefully describe the fix. src/cmd/ksh93/sh/xec.c: sh_exec(): - Improved sanity check for filescan loop: do not recognise it if the simple command includes variable assignments, more than one redirection, or an output or append redirection. - Disable filescan loops if --posix is active. - Another 93v- fix: handle interrupts (errno==EINTR) when closing the input file.	2022-02-17 19:43:36 +00:00
Martijn Dekker	4886463bb6	Disable broken KEYBD trap for multibyte characters In UTF-8 locales, ksh breaks when a KEYBD trap is active, even a dummy no-op one like 'trap : KEYBD'. Entering multi-byte characters fails (the input is interrupted and a new prompt is displayed) and pasting content with multi-byte characters produces corrupted results. The cause is that the KEYBD trap code is not multibyte-ready. Unfortunately nobody yet understands the edit.c code well enough to implement a proper fix. Pending that, this commit implements a workaround that at least avoids breaking the shell. src/cmd/ksh93/edit/edit.c: ed_getchar(): - When a multi-byte locale is active, do not trigger the the KEYBD trap except for ASCII characters (1-127). Resolves: https://github.com/ksh93/ksh/issues/307	2022-02-17 19:39:42 +00:00
Martijn Dekker	6304dfce41	Fix corner-case >&- redirection leak out of subshell Reproducer: exec 9>&1 ( { exec 9>&1; } 9>&- ) echo "test" >&9 # => 9: cannot open [Bad file descriptor] The 9>&- incorrectly persists beyond the { } block that it was attached to and beyond the ( ) subshell. This is yet another bug with non-forking subshells; forking it with something like 'ulimit -t unlimited' works around the bug. In over a year we have not been able to find a real fix, but I came up with a workaround that forks a virtual subshell whenever it executes a code block with a >&- or <&- redirection attached. That use case is obscure enough that it should not cause any performance regression except in very rare corner cases. src/cmd/ksh93/sh/xec.c: sh_exec(): TSETIO: - This is where redirections attached to code blocks are handled. Check for a >&- or <&- redirection using bit flaggery from shnodes.h and fork if we're executing such in a virtual subshell. Resolves: https://github.com/ksh93/ksh/issues/161 Thanks to @ko1nksm for the bug report.	2022-02-17 19:35:47 +00:00

1 2 3 4 5

233 commits