external/cde - Personal Git space

mirror of git://git.code.sf.net/p/cdesktopenv/code synced 2025-03-09 15:50:02 +00:00

Author	SHA1	Message	Date
Johnothan King	1d9093e603	Add support for process substitutions to the deparser (#288 ) Like tdump() and trestore() before commit `32d1abb1`, sh_deparse() fails to handle process substitutions correctly. This limitation of the shell deparser is rather minor since it's unused. However, seeing as the deparser was left in the code base intentionally it should at least function properly. src/cmd/ksh93/sh/deparse.c: - Add a PROCSUBST flag for handling process substitutions in sh_deparse(). - If we're handling a process substitution, add an ending ')' without an extra newline. - Avoid adding an extra ' &' to commands inside of a process substitution. An extra ' &' is only added if the FAMP and FINT flags are set, which indicates the command was spawned as a separate job with '&'. - Add process substitution handling to 'p_redirect' by calling p_tree() when encountering a process substitution.	2021-04-26 00:12:29 +01:00
Martijn Dekker	13c57e4b58	Fix 'unset -f' to work in subshells without forking (re: `047cb330`) This commit implements unsetting functions in virtual subshells, removing the need for the forking workaround. This is done by either invalidating the function found in the current subshell function tree by unsetting its NV_FUNCTION attribute bits (which will cause sh_exec() to skip it) or, if the function exists in a parent shell, by creating an empty dummy subshell node in the current function tree without that attribute. As a beneficial side effect, it seems that bug 228 (unset -f fails in forked subshells if a function is defined before forking) is now also fixed. src/cmd/ksh93/include/defs.h, src/cmd/ksh93/sh/init.c: - Add sh.fun_base for a saved pointer to the main shell's function tree for checking when in a subshell, analogous to sh.var_base. src/cmd/ksh93/bltins/typeset.c: unall(): - Remove the fork workaround. - When unsetting a function found in the current function tree (troot) and that tree is not sh.var_base (which checks if we're in a virtual subshell in a way that handles shared-state command substitutions correctly), then do not delete the function but invalidate it by unsetting its NV_FUNCTION attribute bits. - When unsetting a function not found in the current function tree, search for it in sh.fun_base and if found, add an empty dummy node to mask the parent shell environment's function. The dummy node will not have NV_FUNCTION set, so sh_exec() will skip it. src/cmd/ksh93/sh/subshell.c: - sh_subfuntree(): For 'unset -f' to work correctly with shared-state command substitutions (subshares), this function needs a fix similar to the one applied to sh_assignok() for variables in commit `911d6b06`. Walk up on the subshells tree until we find a non-subshare. - sh_subtracktree(): Apply the same fix for the hash table. - Remove table_unset() and incorporate an updated version of its code in sh_subshell(). As of `ec888867`, this function was only used to clean up the subshell function table as the alias table no longer exists. - sh_subshell(): * Simplify the loop to free the subshell hash table. * Add table_unset() code, slightly refactored for readability. Treat dummy nodes now created by unall() separately to avoid a memory leak; they must be nv_delete()d without passing the NV_FUNCTION bits. For non-dummy nodes, turn on the NV_FUNCTION attribute in case they were invalidated by unall(); this is needed for _nv_unset() to free the function definition. src/cmd/ksh93/tests/subshell.sh: - Update the test for multiple levels of subshell functions to test a subshare as well. While we're add it, add a very similar test for multiple levels of subshell variables that was missing. - Add @JohnoKing's reproducer from #228. src/cmd/ksh93/tests/leaks.sh: - Add leak tests for unsetting functions in a virtual subshell. Test both the simple unset case (unall() creates a dummy node) and the define/unset case (unall() invalidates existing node). Resolves: https://github.com/ksh93/ksh/issues/228	2021-04-24 06:57:49 +01:00
Johnothan King	086d504393	Lots of man page fixes and some other minor fixes (#284 ) Noteworthy changes: - The man pages have been updated to fix a ton of instances of runaway underlining (this was done with `sed -i 's/\\f5/\\f3/g'` commands). This commit dramatically increased in size because of this change. - The documentation for spawnveg(3) has been extended with information about its usage of posix_spawn(3) and vfork(2). - The documentation for tmfmt(3) has been updated with the changes previously made to the man pages for the printf and date builtins (though the latter builtin is disabled by default). - The shell's tracked alias tree (hash table) is now documented in the shell(3) man page. - Removed the commented out regression test for an ERRNO variable as the COMPATIBILITY file states it was removed in ksh93.	2021-04-23 22:02:30 +01:00
Johnothan King	2c22ace1e6	Fix LINENO after unsetting it a virtual subshell (#283 ) There is a TODO note in variables.sh that notes the value of LINENO is wrong after a virtual subshell. The following script should print '6', but the bug causes it to print '1' instead: $ cat /tmp/lineno #!/bin/ksh ( unset LINENO : ) echo $LINENO This bug started to occur after the bugfix applied in `7b994b6a`. However, that commit is not where the cause of bug was (when that bugfix is applied to ksh versions 2008-07-25 through 2012-01-01, $LINENO works fine). Rather, the cause of this bug was introduced in 93u+ 2012-02-29. In that version, the mp->nvfun pointer was only copied from np->nvfun if the variable can be freed from memory. This is what caused `7b994b6a` to break $LINENO in subshells, so to fix this bug the mp->nvfun and np->nvfun must point to the same object, even when the variable isn't freed from memory. src/cmd/ksh93/sh/subshell.c: nv_restore(): - Always copy the np->nvfun pointer to mp->nvfun. To prevent crashes, the value of np->nvfun->nofree is set to the value given by the nofree variable, which is set before _nv_unset. See also commit `7e7f1372`, which fixed a crash that happened because _nv_unset discards the NV_NOFREE flag. src/cmd/ksh93/tests/variables.sh: - Remove the workaround for LINENO after a virtual subshell. - Add a regression test for the value of LINENO when unset in a virtual subshell, then used after the subshell. Note that before commit `997ad43b` LINENO's value was corrupted after being unset in a subshell, so the test checks for corruption of the LINENO variable (in prior commits LINENO was set to '49' because of the previous bug).	2021-04-22 19:16:25 +01:00
Johnothan King	01c01fe8f6	Fix buffer overflows and memory leaks caught by ASAN (#282 ) The changes in this commit allow ksh to be built and run with ASan[], although for now it only works under vmalloc. Example command to build ksh with ASan: $ bin/package make CCFLAGS='-O0 -g -fsanitize=address' [] https://en.wikipedia.org/wiki/AddressSanitizer src/cmd/INIT/mamake.c: - Fix a few memory leaks in mamake. This doesn't fix all of the memory leaks ASan complains about (there is one remaining in the view() function), but it's enough to get ksh to build under ASan. src/lib/libast/features/map.c, src/lib/libast/misc/glob.c: - Rename the ast globbing functions to _ast_glob() and _ast_globfree(). Without this change the globbing tests fail under ASan. See: `2c49eb6e` src/cmd/ksh93/sh/{init,io,nvtree,subshell}.c: - Fix buffer overflows by using strncmp(3) instead of memcmp(3). src/cmd/ksh93/sh/name.c: - Fix another invalid usage of memcmp by using strncmp instead. This change is also in one of Red Hat's patches: https://git.centos.org/rpms/ksh/blob/c8s/f/SOURCES/ksh-20120801-nv_open-memcmp.patch Resolves: https://github.com/ksh93/ksh/issues/230	2021-04-22 18:13:12 +01:00
Martijn Dekker	32d1abb1ba	shcomp: fix redirection with process substitution The commands within a process substitution used as an argument to a redirection (e.g. < <(...) or > >(...)) are simply not included in parse trees dumped by shcomp. This can be verified with a command like hexdump -C. As a result, these process substitutions do not work when running a bytecode-compiled shell script. The fix is surprisingly simple. A process substitution is encoded as a complete parse tree. When used with a redirection, that parse tree is used as the file name for the redirection. All we need to do is treat the "file name" as a parse tree instead of a string if flags indicate a process substitution. A process substitution is detected by the struct ionod field 'iofile'. Checking the IOPROCSUB bit flag is not enough. We also need to exclude the IOLSEEK flag as that form of redirection may use the IOARITH flag which has the same bit value as IOPROCSUB (see include/shnodes.h). src/cmd/ksh93/sh/tdump.c: p_redirect(): - Call p_tree() instead of p_string() for a process substitution. src/cmd/ksh93/sh/trestore.c: r_redirect(): - Call r_tree() instead of r_string() for a process substitution. src/cmd/ksh93/include/version.h: - Bump the shcomp binary header version as this change is not backwards compatible; previous trestore.c versions don't know how to read the newly compiled process substitutions and would crash. src/cmd/ksh93/tests/io.sh: - Add test. src/cmd/ksh93/tests/builtins.sh, src/cmd/ksh93/tests/options.sh: - Revert shcomp workarounds. (re: `6701bb30`) Resolves: https://github.com/ksh93/ksh/issues/165	2021-04-22 03:25:24 +01:00
Martijn Dekker	b7dde4e747	Fix ksh exit on syntax error in profile (re: `cb67a01b`, `ceb77b13`) Johnothan King writes: > There are two regressions related to how ksh handles syntax > errors in the .kshrc file. If ~/.kshrc or the file pointed to by > $ENV have a syntax error, ksh exits during startup. Additionally, > the error message printed is incorrect: > > $ cat /tmp/synerror > (( > echo foo > > # ksh93u+m > $ ENV=/tmp/synerror arch/*/bin/ksh -ic 'echo ${.sh.version}' > /tmp/synerror: syntax error: `/t/tmp/synerror' unmatched > > # ksh93u+ > $ ENV=/tmp/synerror ksh93u -ic 'echo ${.sh.version}' > /tmp/synerror: syntax error: `(' unmatched > Version AJM 93u+ 2012-08-01 > > The regression that causes the incorrect error message was > introduced by commit `cb67a01`. The other bug that causes ksh to > exit on startup was introduced by commit `ceb77b1`. src/cmd/ksh93/sh/lex.c: fmttoken(): - Call stakfreeze(0) to terminate a possible unterminated previous stack item before writing the token string onto the stack. This fixes the bug with garbage in a syntax error message. src/cmd/ksh93/sh/main.c: exfile(): - Revert Red Hat's ksh-20140801-diskfull.patch applied in `ceb77b13`. This fixes the bug with interactive ksh exiting on syntax error in a profile script. Testing by @JohnoKing showed the patch is no longer necessary to fix a login crash on disk full, as commit `970069a6` (which applied Red Hat patches ksh-20120801-macro.patch and ksh-20120801-fd2lost.patch) also fixes that crash. src/cmd/ksh93/README: - Fix typos. (re: `fdc08b23`) Co-authored-by: Johnothan King <johnothanking@protonmail.com> Resolves: https://github.com/ksh93/ksh/issues/281	2021-04-21 19:42:24 +01:00
Martijn Dekker	7954855f21	Don't import/export readonly attribute via magic A__z env var While automagically importing/exporting ksh variable attributes via the environment is probably a misfeature in general (now disabled for POSIX standard mode), doing so with the readonly attribute is particularly problematic. Scripts can take into account the possibility of importing unwanted attributes by unsetting or typesetting variables before using them. But there is no way for a script to get rid of an unwanted imported readonly variable. This is a possible attack vector with no possible mitigation. This commit blocks both the import and the export of the readonly attribute through the environment. I consider it a security fix. src/cmd/ksh93/sh/init.c: env_import_attributes(): - Clear NV_RDONLY from imported attributes before applying them. src/cmd/ksh93/sh/name.c: sh_envgen(): - Remove NV_RDONLY from bitmask defining attributes to export.	2021-04-21 04:11:55 +01:00
Martijn Dekker	61e0f90460	Yet more fixes for subshell directory handling (re: `feaf718f`) There were still problems left after the previous commit. On at least one system (QNX i386), the following regression test crashed: src/cmd/ksh93/test/subshell.c 900 got=$( { "$SHELL" -c '(cd /; (cd /)); print -r -- "PWD=$PWD"'; } 2>&1 ) A backtrace done on the core dunp pointed to the free() call here: src/cmd/ksh93/bltins/cd_pwd.c 90 if(oldpwd && oldpwd!=shp->pwd && oldpwd!=e_dot) 91 free(oldpwd); Analysis: The interaction between $PWD, sh.pwd aka shp->pwd, and the path_pwd() function is a mess. path_pwd() usually returns a freeable value, but not always. sh.pwd is sometimes a pointer to the value of $PWD, but not always (e.g. when you unset PWD or assign to it). Instead of debugging the exact cause of the crash, I think it is better to make this work in a more consistent way. As of this commit: 1. sh.pwd keeps its own copy of the PWD, independently of the PWD variable. The old value must always be freed immediately before assigning a new one. This is simple and consistent, reducing the chance of bugs at negligible cost. 2. The PWD variable is no longer given the NV_NOFREE attribute because its value no longer points to sh.pwd. It is now a variable like any other. src/cmd/ksh93/sh/path.c: path_pwd(): - Do not give PWDNOD the NV_NOFREE attribute. - Give sh.pwd its own copy of the PWD by strdup'ing PWDNOD's value. src/cmd/ksh93/bltins/cd_pwd.c: b_cd(): - Since sh.pwd is now consistently freed before giving it a new value and at no other time, oldpwd must not be freed any longer and can become a regular non-static variable. - If the PWD needs reinitialising, call path_pwd() to do it. src/cmd/ksh93/sh/subshell.c: sh_subshell(): - Systems with fchdir(2): Always restore the PWD upon exiting a non-subshare subshell. The check to decide whether or not to restore it was unsafe: it was not restored if the current PWD pointer and value was identical to the saved one, but a directory can be deleted and recreated under the same name. - Systems without fchdir(2) (if any exist): . Entry: Fork if the PWD is nonexistent or has no x permission. . Restore: Only chdir back if the subshell PWD was changed. That's probably the best we can do. It remains inherently unsafe. We should probably just require fchdir(2) at some point.	2021-04-20 05:39:10 +01:00
Martijn Dekker	feaf718f16	More fixes for subshell directory handling (re: `7bab9508`, `5ee290c7`) This commit fixes what are hopefully the two final aspects of #153: 1. If the present working directory does not exist (was moved or deleted) upon entering a virtual subshell, no PWD directory path is saved. Since restoring the state after exiting a virtual subshell is contingent on a previous PWD path existing, this resulted in entire aspects of the virtual subshell, such as the subshell function tree, not being cleaned up. 2. A separate problem is that 'cd ..' does not update PWD or OLDPWD when run from a nonexistent directory. A reproducer exposing both problems is: $ mkdir test $ cd test $ ksh -c '(subfn() { BAD; }; cd ..; echo subPWD==$PWD); typeset -f subfn; echo mainPWD==$PWD' subPWD==/usr/local/src/ksh93/ksh/test subfn() { BAD; };mainPWD==/usr/local/src/ksh93/ksh/test Expected output: subPWD==/usr/local/src/ksh93/ksh mainPWD==/usr/local/src/ksh93/ksh/test src/cmd/ksh93/bltins/cd_pwd.c: - If path_pwd() fails to get the PWD (usually it no longer exists), don't set $OLDPWD to '.' as that is pointless; use $PWD instead. After cd'ing from a nonexistent directory, 'cd -' should fail and should not be equivalent to 'cd .'. - Remove a redundant check for (!oldpwd) where it is always set. - Do not prematurely return without setting PWD or OLDPWD if pathcanon() fails to canonicalise a nonexistent directory. Instead, fall back to setting PWD to the result of getcwd(3). src/cmd/ksh93/sh/subshell.c: - Minor stylistic adjustment. Some NULL macros sneaked in. This historic code base does not use them (yet); change to NIL(type*). - sh_subshell(): Fix logic for determining whether to save/restore subshell state. 1. When saving, 'if(!comsub \|\| !shp->subshare)' is redundant; 'if(!shp->subshare)' should be enough. If we're not in a subshare, state should be saved. 2. When restoring, 'if(sp->shpwd)' is just nonsense as there is no guarantee that the PWD exists upon entering a subshell. Simply use the same 'if(!shp->subshare)'. Add an extra check for sp->pwd to avoid a possible segfault. Always restore the PWD on subshell exit and not only if shp->pwd is set. - sh_subshell(): Issue fatal errors in libast's "panic" format. src/cmd/ksh93/tests/builtins.sh: - Adjust a relevant test to run err_exit() outside of the subshell so that any error is counted in the main shell. - Add test for problem 2 described at the top. src/cmd/ksh93/tests/subshell.sh: - Add test for problems 1 and 2 based on reproducer above. Resolves: https://github.com/ksh93/ksh/issues/153	2021-04-19 00:06:43 +01:00
Martijn Dekker	b0a6c1bde5	Further fix '<>;' and fix crash on 32-bit systems (re: `6701bb30`) Accessing t->tre.treio for every sh_exec() run is invalid because 't' is of type Shnode_t, which is a union that can contain many different kinds of structs. As all members of a union occupy the same address space, only one can be used at a time. Which member is valid to access depends on the node type sh_exec() was called with. The invalid access triggered a crash on 32-bit systems when executing an arithmetic command like ((x=1)). The t->tre.treio union member should be accessed for a simple command (case TCOM in sh_exec()). The fix is also needed for redirections attached to blocks (case TSETIO) in which case the union member to use is t->fork.forkio. src/cmd/ksh93/sh/xec.c: - Add check_exec_optimization() function that checks for all the conditions where the exec optimisation should not be done. For redirections we need to loop through the whole list to check for an IOREWRITE (<>;) one. - sh_exec(): case TCOM (simple command): Only bother to call check_exec_optimization() if there are either command arguments or redirections (IOW: don't bother for bare variable assignments), so move it to within the if(io\|\|argn) block. - sh_exec(): case TSETIO: This needs a similar fix. To avoid the optimization breaking again if the last command is a subshell with a <>; redirection attached, we need to not only set execflg to 0 but also clear the SH_NOFORK state bit from the 'flags' variable which is passed on to the recursive sh_exec() call. src/cmd/ksh93/tests/io.sh: - Update and expand tests. Add tests for redirections attached to simple commands (TCOM) and various kinds of code block (TSETIO). Co-authored-by: Johnothan King <johnothanking@protonmail.com> Resolves: https://github.com/ksh93/ksh/issues/278	2021-04-17 21:56:39 +01:00
Johnothan King	6701bb30de	Fix <>; redirection for final command exec optimization (#277 ) The <>; operator doesn't work correctly if it's used as the last command of a -c script. Reproducer: $ echo test > a; ksh -c 'echo x 1<>; a'; cat a x st This bug is caused by ksh running the last command of -c scripts with execve(2) instead of posix_spawn(3) or fork(2). The <>; operator is noted by the man page as being incompatible with the exec builtin (see also the ksh93u+ man page), so it's not surprising this bug occurs when ksh runs a command using execve: > <>;word cannot be used with the exec and redirect built-ins. The ksh2020 fix simply removed the code required for ksh to use this optimization at all. It's not a performance friendly fix and only papers over the bug, so this commit provides a better fix. This bug was first reported at: https://github.com/att/ast/issues/9 In addition, this commit re-enables the execve(2) optimization for the last command for scripts loaded from a file. It was enabled in in older ksh versions, and was only disabled in interactive shells: https://github.com/ksh93/ast-open-history/blob/2011-06-30/src/cmd/ksh93/sh/main.c#L593-L599 It was changed on 2011-12-24 to only be used for -c scripts: https://github.com/ksh93/ast-open-history/blob/2011-12-24/src/cmd/ksh93/sh/main.c#L593-L599 We think there is no good reason why scripts loaded from a file should be optimised less than scripts loaded from a -c argument. They're both scripts; there's no essential difference between them. So this commit reverts that change. If there is a bug left in the optimization after this fix, this revert increases the chance of exposing it so that it can be fixed. src/cmd/ksh93/sh/xec.c: - The IOREWRITE flag is set when handling the <>; operator, so to fix this bug, avoid exec'ing the last command if it uses <>;. See also commit `17ebfbf6`, which fixed another issue related to the execve optimization. src/cmd/ksh93/tests/io.sh: - Enable a regression test that was failing because of this bug. - Add the reproducer from https://github.com/att/ast/issues/9 as a regression test. src/cmd/ksh93/sh/main.c: - Only avoid the non-forking optimization in interactive shells. src/cmd/ksh93/tests/signal.sh: - Add an extra comment to avoid the non-forking optimization in the regression test for rhbz#1469624. - If the regression test for rhbz#1469624 fails, show the incorrect exit status in the error message. src/cmd/ksh93/tests/builtins.sh, src/cmd/ksh93/tests/options.sh: - This bugfix was causing the options regression test to segfault when run under shcomp. The cause is the same as <https://github.com/ksh93/ksh/issues/165>, so as a workaround, avoid parsing process substitutions with shcomp until that is fixed. This workaround should also avoid the other problem detailed in <https://github.com/ksh93/ksh/issues/274>. Resolves: https://github.com/ksh93/ksh/issues/274	2021-04-15 18:29:50 +01:00
Andy Fiddaman	2fdf394b99	nv_disc(NV_LAST) loses trailing shell context from discipline stack (#276 ) This is the underlying cause for the issue worked around in `3654ee73`. The following explanation refers to the current illumos version of ksh93 and shows output from illumos' modular debugger: https://illumos.org/books/dev/debugging.html Each environment variable (name/value pair) has a linked list of disciplines attached to it, and at the end of that list there is optionally a shell context pointer. For example, for the EDITOR variable: > ::bp libshell.so.1`put_ed > ::run $ $ EDITOR=vim > ::stack ! head -1 libshell.so.1`put_ed+0x14(e06208, e01c58, 0, dced90) > e06208::print Namval_t { nvname = 0xfffffbffeec40a0e "EDITOR" nvfun = 0xdced90 nvalue = 0 } > e06208::print Namval_t nvfun \| ::print Namfun_t { disc = libshell.so.1`EDITOR_disc next = libshell.so.1`sh+0x710 } Here, the EDITOR Namval_t has a discipline stack containing EDITOR_disc and &Shell_t.nvfun. The problem arises when a new discipline is pushed onto the stack, such as when using typeset -u to add an upper-case translation discipline. $ typeset -u EDITOR > e06208::print Namval_t { nvname = 0xfffffbffeec40a0e "EDITOR" nvfun = 0xdced90 nvalue = 0xe0fdb0 "vim" } > e06208::print Namval_t nvfun \| ::print Namfun_t { disc = libshell.so.1`EDITOR_disc next = 0xdc27a0 } > e06208::print Namval_t nvfun \| ::print Namfun_t next \| ::print Namfun_t { disc = libshell.so.1`TRANS_disc next = 0 } TRANS_disc has been pushed onto the end of the discipline stack, but the shell handle has been lost. With this change, the attributes and variables tests pass (this is on illumos where this change originates).	2021-04-15 14:41:32 +01:00
Martijn Dekker	519bb08265	Allow invoking path-bound built-in commands by direct path or preceding `PATH` assignment (#275 ) Path-bound builtins on ksh (such as /opt/ast/bin/cat) break some basic assumptions about paths in the shell that should hold true, e.g., that a path output by whence -p or command -v should actually point to an executable command. This commit should fix the following: 1. Path-bound built-ins (such as /opt/ast/bin/cat) can now be executed by invoking the canonical path (independently of the value of $PATH), so the following will now work as expected: $ /opt/ast/bin/cat --version version cat (AT&T Research) 2012-05-31 $ (PATH=/opt/ast/bin:$PATH; "$(whence -p cat)" --version) version cat (AT&T Research) 2012-05-31 In the event an external command by that path exists, the path-bound builtin will now override it when invoked using the canonical path. To invoke a possible external command at that path, you can still use a non-canonical path, e.g.: /opt//ast/bin/cat or /opt/ast/./bin/cat 2. Path-bound built-ins will now also be found on a PATH set locally using an assignment preceding the command, so something like the following will now work as expected: $ PATH=/opt/ast/bin cat --version version cat (AT&T Research) 2012-05-31 The builtin is not found by sh_exec() because the search for builtins happens long before invocation-local preceding assignments are processsed. This only happens in sh_ntfork(), before forking, or in sh_fork(), after forking. Both sh_ntfork() and sh_fork() call path_spawn() to do the actual path search, so a check there will cover both cases. This does mean the builtin will be run in the forked child if sh_fork() is used (which is the case on interactive shells with job.jobcontrol set, or always after compiling with SHOPT_SPAWN disabled). Searching for it before forking would mean fundamentally redesigning that function to be basically like sh_ntfork(), so this is hard to avoid. src/cmd/ksh93/sh/path.c: path_spawn(): - Before doing anything else, check if the passed path appears in the builtins tree as a pathbound builtin. If so, run it. Since a builtin will only be found if a preceding PATH assignment temporarily changed the PATH, and that assignment is currently in effect, we can just sh_run() the builtin so a nested sh_exec() invocation will find and run it. - If 'spawn' is not set (i.e. we must return), set errno to 0 and return -2. See the change to sh_ntfork() below. src/cmd/ksh93/sh/xec.c: - sh_exec(): When searching for built-ins and the restricted option isn't active, also search bltin_tree for names beginning with a slash. - sh_ntfork(): Only throw an error if the PID value returned is exactly -1. This allows path_spawn() to return -2 after running a built-in to tell sh_ntfork() to do the right things to restore state. src/cmd/ksh93/sh/parse.c: simple(): - When searching for built-ins at parse time, only exclude names containing a slash if the restricted option is active. This allows finding pointers to built-ins invoked by literal path like /opt/ast/bin/cat, as long as that does not result from an expansion. This is not actually necessary as sh_exec() will also cover this case, but it is an optimisation. src/lib/libcmd/getconf.c: - Replace convoluted deferral to external command by a simple invocation of the path to the native getconf command determined at compile time (by src/lib/libast/comp/conf.sh). Based on: https://github.com/ksh93/ksh/issues/138#issuecomment-816384871 If there is ever a system that has /opt/ast/bin/getconf as its default native external 'getconf', then there would still be an infinite recursion crash, but this seems extremely unlikely. Resolves: https://github.com/ksh93/ksh/issues/138	2021-04-15 04:08:12 +01:00
Johnothan King	2c38fb93fd	Fix the exit status returned when a command isn't executable (#273 ) Previous discussion: https://github.com/att/ast/issues/485 If ksh attempts to execute a non-executable command found in the PATH, in some instances the error message and return status are incorrect. In the example below, ksh returns with exit status 126 when using the -c execve(2) optimization or when using fork(2) in an interactive shell. However, using posix_spawn(3) causes the exit status to change: $ echo 'print cannot execute' > /tmp/x # Runs command with spawnveg (i.e., posix_spawn or vfork) $ ksh -c 'PATH=/tmp; x; echo $?' ksh: x: not found 127 # Runs command with execve $ ksh -c 'PATH=/tmp; x'; echo $? ksh: x: cannot execute [Permission denied] 126 # Runs command with fork $ ksh -ic 'PATH=/tmp; x; echo $?' ksh: x: cannot execute [Permission denied] 126 Since 'x' is in the PATH but can't be executed, the correct exit status is 126, not 127. It's worth noting this bug doesn't cause the regression tests to fail with ksh93u+m, but it does cause one test to fail when run under dtksh: path.sh[706]: Long nonexistent command name: got status 126, '' This commit backports various fixes for this bug from ksh2020, with additional fixes applied (since there were still some additional issues the ksh2020 patch didn't fix). The lacking regression test for exit status 126 in path.sh has been rewritten to test for more scenarios where ksh failed to return the correct error message and/or exit status. I can also confirm with this patch applied the path.sh regression tests now pass when run under dtksh. src/cmd/ksh93/sh/path.c: - Add a comment to path_absolute() describing 'oldpp' is the current pointer in the while loop and 'pp' is the next pointer. Backported from: `a6cad450` - The patch from ksh2020 didn't fix this bug in the SHOPT_SPAWN code (because ksh2020 prefers fork(2)), so issues with the exit status could still occur when using spawnveg. To fix this, always set 'noexec' to the value of errno if can_execute fails. Before this fix, errno was discarded if 'pp' was a null pointer and can_execute failed. - If a command couldn't be executed and the error wasn't ENOENT, save errno in a 'not_executable' variable. If an executable command couldn't be found in the PATH, exit with status 126 and set errno to the saved value. This was based on a ksh2020 bugfix, but it has been reworked a little bit to fix a bug that caused a mismatch between the error message shown and errno. Example with a non-executable file in PATH: $ nonexec ksh2020: nonexec: cannot execute [No such file or directory] The ksh2020 patch: <https://github.com/att/ast/pull/493> - Backport a ksh2020 bugfix for directories in the PATH when running one of the added regression tests on OpenBSD: https://github.com/att/ast/pull/767 src/cmd/ksh93/data/msg.c, src/cmd/ksh93/include/shell.h, src/cmd/ksh93/sh/{path,xec}.c: - If a command name is too long (ENAMETOOLONG), then it wasn't found in the PATH. For that case return exit status 127, like for ENOENT. src/cmd/ksh93/tests/path.sh: - Replace the old test with a new set of more extensive tests. These tests check the error message and exit status when ksh attempts to run a command using any of the following: - execve(2), used with the last command run with -c (A tests). - posix_spawn(3)/vfork(2), used in noninteractive scripts (B tests). - fork(2), used in interactive shells with job control (C tests). - command -x (D tests). - exec(1) (E tests). - Add a regression test from ksh2020 for attempting to execute a directory: https://github.com/att/ast/pull/758 src/lib/libast/include/ast.h, src/lib/libast/include/wait.h: - Avoid bitshifts in macros for static error codes. The return values of command not found and exec related errors are static values and should not require any macro magic for calculation. Backported from: `c073b102` - Simplify EXIT_ and W* macros to use 8 bits.	2021-04-15 03:37:57 +01:00
hyenias	d6ddd89053	Correct memory fault when removing default nameref KSH_VERSION (#271 ) This commit fixes a segmentation fault when an attempt was made to unset the default KSH_VERSION variable prior any other nameref activity such as creating another nameref or even reassigning the nameref KSH_VERSION to something else. (new shell without prior nameref activity) $ nameref KSH_VERSION=.sh.version $ unset -n KSH_VERSION Memory fault src/cmd/ksh93/sh/name.c: _nv_unset(): - Add a 'Refdict' check before attempting to remove a value from it as apparently one does not exist until some sort of nameref activity occurs after shell startup as the default nameref of 'KSH_VERSION=.sh.version' does not create one.	2021-04-13 03:15:34 +01:00
Johnothan King	75796a9c75	Fix += operator regressions (re: `fae8862c`) (#270 ) The bugfix for BUG_CMDSPASGN backported in commit `fae8862c` caused two regressions with the += operator: 1. The += operator did not append to variables. Reproducer: $ integer foo=3 $ foo+=2 command eval 'echo $foo' 2 2. The += operator ignored the readonly attribute, modifying readonly variables in the same manner as above. Reproducer $ readonly bar=str $ bar+=ing command eval 'echo $bar' ing Both of the regressions above were caused by nv_putval() failing to clone the variable from the previous scope into the invocation-local scope. As a result, 'foo+=2' was effectively 0 + 2 (since ksh didn't clone 3). The first regression was noticed during the development of ksh93v-, so to fix both bugs I've backported the bugfix for the regression from the ksh93v- 2013-10-10 alpha version: https://www.mail-archive.com/ast-users@lists.research.att.com/msg00369.html src/cmd/ksh93/sh/name.c: - To fix both of the bugs above, find the variable to modify with nv_search(), then clone it into the invocation local scope. To fix the readonly bug as well, this is done before the NV_RDONLY check (otherwise np will be missing that attribute and be incorrectly modified in the invocation-local scope). - Update a nearby comment describing what sh_assignok() does (per this comment: https://github.com/ksh93/ksh/pull/249#issuecomment-811381759) src/cmd/ksh93/tests/builtins.sh: - Add regression tests for both of the now fixed regressions, loosely based on the regression tests in ksh93v-.	2021-04-12 01:24:33 +01:00
Martijn Dekker	d50d3d7c4c	Reset arithmetic recursion level on all errors (re: `264ba48b`) The recursion level for arithmetic expressions is kept track of in a static 'level' variable in streval.c. It is reset when arithmetic expressions throw an error. But an error for an arithmetic expression may also occur elsewhere -- at least in one case: when an arithmetic expression attempts to change a read-only variable. In that case, the recursion level is never reset because that code does not have access to the static 'level' variable. If many such conditions occur (as in the new readonly.sh regression tests), an arithmetic command like 'i++' may eventually fail with a 'recursion too deep' error. To mitigate the problem, MAXLEVEL in streval.c was changed from 9 to 1024 in `264ba48b` (as in the ksh 93v- beta). This commit leaves that increase, but adds a proper fix. src/cmd/ksh93/include/defs.h: - Add global sh.arithrecursion (a.k.a. shp->arithrecursion) variable to keep track of the arithmetic recursion level, replacing the static 'level' variable in streval.c. src/cmd/ksh93/sh/xec.c: sh_exec(): - Reset sh.arithrecursion before starting a new simple command (TCOM), a new subshell with parentheses (TPAR), a new pipe (TFIL), or a new [[ ... ]] command (TTST). These are the same places where 'echeck' is set to 1 for --errexit and ERR trap checks, so it should cover everything. src/cmd/ksh93/sh/streval.c: - Change all uses of 'level' to sh.arithrecursion. - _seterror, aritherror(): No longer bother to reset the level to zero here; xec.c should have this covered for all cases now. src/cmd/ksh93/tests/arith.sh: - Add tests for main shell and subshell.	2021-04-11 01:25:19 +01:00
Martijn Dekker	f6bc5c03ca	nv_setlist(): add check for readonly (re: `264ba48b`) One area where readonly is still ineffective is the local environment list for a command (preceding assignments) if that command is not executed using exec(3) after fork(2). Builtin commands are one example. The following succeeds but should fail: (readonly v=1; v=2 true) # succeeds, but should fail If the shell is compiled with SHOPT_SPAWN (the default) then this also applies to external commands invoked with sh_ntfork(): (readonly v=1; v=2 env) # succeeds if SHOPT_SPAWN This presents to the user as inconsitent behaviour because external commands may be fork()ed under certain circumstances but not others, depending on complex optimisations. One example is: $ ksh -c 'readonly v=1; v=2 env' ksh: v: is read only $ ksh -c 'readonly v=1; v=2 env; :' (bad: environment list is output, including 'v=2') In the first command above, where 'v2=env' is the last command in the -c script, the optimisation skips creating a scope and assigns the environment list in the current scope. src/cmd/ksh93/sh/name.c: nv_setlist(): - Add check for readonly. This requires searching for the variable in the main tree using nv_search() before a locally scoped one is added using nv_open(). Since nv_search() only works with plain variable names, temporarily end the string at '='. src/cmd/ksh93/tests/readonly.sh: - Add version check and fork the test command substitution subshell on older versions that would otherwise abort the tests due to the combination of an excessively low arithmetic recursion tolerance and a bug that sometimes fails to restore the shell's arithmetic recursion level.	2021-04-10 23:12:18 +01:00
Martijn Dekker	66c37202fd	SHOPT_SPAWN: rm unused job control code (re: `f207cd57`, `41ebb55a`) Since `f207cd57`, sh_ntfork() is never called if job.jobcontrol is set (i.e. if job control is active on an interactive shell), so the code that is only run if job.jobcontrol is set should be removed. src/cmd/ksh93/sh/xec.c: - Remove spawnveg() define that is unused as of `7b0e0776`. - sh_exec(): Simplify SHOPT_SPAWN preprocessor logic. As sh_fork() never returns a negative value, only run the parent<0 check after running sh_ntfork() -- that check already didn't happen when compiling ksh with SHOPT_SPAWN disabled. - sh_ntfork(): Remove signal and terminal handling (with race condition) that was only run with job.jobcontrol set.	2021-04-10 18:10:27 +01:00
Johnothan King	5461f11968	Fix handling of '--posix' and '--default' (#265 ) src/cmd/ksh93/sh/args.c: sh_argopts(): - Remove special-casing for --posix (see also data/builtins.c) and move the case -5: to the case ':' instead, so this option is handled like all other long options. This change fixes two bugs: 1. 'set --posix' had no effect on the letoctal or braceexpand options. Reproducer: $ set --posix $ [[ -o braceexpand ]]; echo $? 0 $ [[ -o letoctal ]]; echo $? 1 2. 'ksh --posix' could not run scripts correctly because it wrongly enabled '-c'. Reproducer: $ ksh --posix < <(echo 'exit 0') ksh: -c requires argument Usage: ksh [--posix] [arg ...] Help: ksh [ --help \| --man ] 2>&1 - Don't allow 'set --default' to unset the restricted option. src/cmd/ksh93/tests/options.sh: - Add regression tests for the bugs described above, using -o posix and --posix. src/cmd/ksh93/tests/restricted.sh: - Add a regression test for 'set --default' in rksh. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2021-04-09 23:26:07 +01:00
Martijn Dekker	cb67a01b45	lex.c: simplify fmttoken() by using the stack (re: `3255aed2`) Using the stack makes it impossible for future buffer overflows to occur. It also simplifies fmttoken() by eliminating the need to declare a local buffer and pass a pointer to that as an argument. For info: man src/lib/libast/man/stak.3	2021-04-09 17:36:29 +01:00
hyenias	3255aed2c4	lex.c: Fix buffer overflow in debug sh_lex and sh_syntax (#262 ) fmttoken() needs a minimal char[4] token buffer passed to it. Originally reported by: Jakub Wilk <jwilk@jwilk.net> Original bug report: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=879464 The following code lines from fmttoken() yield a n=3 for SYMSEMI as n=1 from the start, e.g. 'for <>;'. case SYMSEMI: if(tok[0]=='<') tok[n++] = '>'; sym = ';'; break; default: sym = 0; } tok[n++] = sym; } tok[n] = 0; n[0]='<' n[1]='>' n[2]=';' n[3]=0 # <-- BUFFER overflow as the passed character buffers have a size of 3 src/cmd/ksh93/sh/lex.c: - DBUG: sh_lex(): Adjust char tokstr[3] to char tokstr[4] - sh_syntax(): Adjust char tokbuf[3] to char tokbuf[4]	2021-04-09 02:47:21 +01:00
Johnothan King	a065558291	Fix more compiler warnings, typos and other minor issues (#260 ) Many of these changes are minor typo fixes. The other changes (which are mostly compiler warning fixes) are: NEWS: - The --globcasedetect shell option works on older Linux kernels when used with FAT32/VFAT file systems, so remove the note about it only working with 5.2+ kernels. src/cmd/ksh93/COMPATIBILITY: - Update the documentation on function scoping with an addition from ksh93v- (this does apply to ksh93u+). src/cmd/ksh93/edit/emacs.c: - Check for '_AST_ksh_release', not 'AST_ksh_release'. src/cmd/INIT/mamake.c, src/cmd/INIT/ratz.c, src/cmd/INIT/release.c, src/cmd/builtin/pty.c: - Add more uses of UNREACHABLE() and noreturn, this time for the build system and pty. src/cmd/builtin/pty.c, src/cmd/builtin/array.c, src/cmd/ksh93/sh/name.c, src/cmd/ksh93/sh/nvtype.c, src/cmd/ksh93/sh/suid_exec.c: - Fix six -Wunused-variable warnings (the name.c nv_arrayptr() fixes are also in ksh93v-). - Remove the unused 'tableval' function to fix a -Wunused-function warning. src/cmd/ksh93/sh/lex.c: - Remove unused 'SHOPT_DOS' code, which isn't enabled anywhere. https://github.com/att/ast/issues/272#issuecomment-354363112 src/cmd/ksh93/bltins/misc.c, src/cmd/ksh93/bltins/trap.c, src/cmd/ksh93/bltins/typeset.c: - Add dictionary generator function declarations for former aliases that are now builtins (re: `1fbbeaa1`, `ef1621c1`, `3ba4900e`). - For consistency with the rest of the codebase, use '(void)' instead of '()' for print_cpu_times. src/cmd/ksh93/sh/init.c, src/lib/libast/path/pathshell.c: - Move the otherwise unused EXE macro to pathshell() and only search for 'sh.exe' on Windows. src/cmd/ksh93/sh/xec.c, src/lib/libast/include/ast.h: - Add an empty definition for inline when compiling with C89. This allows the timeval_to_double() function to be inlined. src/cmd/ksh93/include/shlex.h: - Remove the unused 'PIPESYM2' macro. src/cmd/ksh93/tests/pty.sh: - Add '# err_exit #' to count the regression test added in commit `113a9392`. src/lib/libast/disc/sfdcdio.c: - Move diordwr, dioread, diowrite and dioexcept behind '#ifdef F_DIOINFO' to fix one -Wunused-variable warning and multiple -Wunused-function warnings (sfdcdio() only uses these functions when F_DIOINFO is defined). src/lib/libast/string/fmtdev.c: - Fix two -Wimplicit-function-declaration warnings on Linux by including sys/sysmacros.h in fmtdev().	2021-04-08 19:58:07 +01:00
Martijn Dekker	ecf260c282	SHOPT_SPAWN: Fix 'not found' error message inconsistency There's an annoying inconsistency in error messages if ksh is compiled with SHOPT_SPAWN. One way to trigger it: $ /usr/local/bin/ksh -c '/tmp/nonexistent' /usr/local/bin/ksh: /tmp/nonexistent: not found $ /usr/local/bin/ksh -c '/tmp/nonexistent; :' /usr/local/bin/ksh: /tmp/nonexistent: not found [No such file or directory] In the first variant, as an optimisation, ksh went straight to exec'ing the command without forking first. In the second variant, sh_ntfork() was used. The first variant is done in path_exec(), path.c, line 1049: errormsg(SH_DICT,ERROR_exit(ERROR_NOENT),e_found,arg0); The second one is in sh_ntfork(), xec.c, line 3654: errormsg(SH_DICT,ERROR_system(ERROR_NOENT),e_found+4); In both cases, the e_found message is only used if errno==ENOENT, so the extra '[No such file or directory]' message generated by ERROR_system() is pointless as that will never change for that message. src/cmd/ksh93/sh/xec.c: sh_ntfork(): - Use ERROR_exit() instead of ERROR_system() for the e_found message to avoid the superfluous addition.	2021-04-08 16:46:47 +01:00
Martijn Dekker	2e5b625915	Allow path-bound builtins on restricted shells If a system administrator prefixes /opt/ast/bin to the path and then invokes the shell in restricted mode, they clearly intend for the user to run those AST utilities. Similarly, if a system administrator sets a PATH for a restricted shell that includes libraries listed in the .paths file, they must have intended for the user to use those loadable built-ins, as they will be associated with the pathnames of their respective libraries. Since the user cannot change PATH or use the builtin command, they still cannot load just any built-in they choose. src/cmd/ksh93/sh/path.c: - Remove SH_RESTRICTED check when handling path-bound builtins or dynamic libaries containining builtins in $PATH. src/cmd/ksh93/tests/builtins.sh: - Add test verifying a restricted user can use /opt/ast/bin/cat via a PATH search. Progresses: https://github.com/ksh93/ksh/issues/138	2021-04-08 14:48:29 +01:00
Johnothan King	0cd8646361	Backport bugfix for BUG_CSUBSTDO from ksh93v- 2012-08-24 (#259 ) This commit fixes BUG_CSUBSTDO, which could break stdout inside of non-forking command substitutions. The breakage only occurred when stdout was closed outside of the command substitution and a file descriptor other than stdout was redirected in the command substitution (such as stderr). Thanks to the ast-open-history repo, I was able to identify and backport the bugfix from ksh93v- 2012-08-24. This backport may fix other bugs as well. On 93v- 2012-08-24 it fixed the regression below, though it was not triggered on 93u+(m). src/cmd/ksh93/tests/heredoc.sh 487 print foo > $tmp/foofile 488 x=$( $SHELL 2> /dev/null 'read <<< $(<'"$tmp"'/foofile) 2> /dev/null;print -r "$REPLY"') 489 [[ $x == foo ]] \|\| err_exit '<<< $(<file) not working' src/cmd/ksh93/sh/io.c: sh_open(): - If the just-opened file descriptor exists in sftable and is flagged with SF_STRING (as in non-forking command substitutions, among other situations), then move the file descriptor to a number >= 10. src/cmd/ksh93/tests/io.sh: - Add a regression test for BUG_CSUBSTDO, adapted from the one in modernish.	2021-04-08 13:24:17 +01:00
Johnothan King	b2a7ec032f	Add LC_TIME to the supported locale variables (#257 ) The current version of 93u+m does not have proper support for the LC_TIME variable. Setting LC_TIME has no effect on printf %T, and if the locale is invalid no error message is shown: $ LC_TIME=ja_JP.UTF-8 $ printf '%T\n' now Wed Apr 7 15:18:13 PDT 2021 $ LC_TIME=invalid.locale $ # No error message src/cmd/ksh93/data/variables.c, src/cmd/ksh93/include/variables.h, src/cmd/ksh93/sh/init.c: - Add support for the $LC_TIME variable. ksh93v- attempted to add support for LC_TIME, but the patch from that version was extended because the variable still didn't function correctly. src/cmd/ksh93/tests/variables.sh: - Add LC_TIME to the regression tests for LC_* variables.	2021-04-08 13:06:22 +01:00
Martijn Dekker	d0a5cab1ab	cleanup: remove another old and unused experiment This experiment, the initialisation of which was disabled with '#if 0', defines a bunch of integer type commands as special builtins. Most are boring as they define variables just like normal integers: pid_t, size_t, etc. One is interesting: mode_t is a type that automatically converts from a octal permission bits (e.g. 755) to a mode string like u+rwx,g+rw,o+rw. That's not a compelling enough use case to permanently define a special and immutable builtin though. stat_t is odd: it takes a file name as an argument and fills the variable with stat information, but it is base64 encoded binary data and there doesn't seem to be anything that can parse it. Anyway, none of this is going to be enabled, so we should get rid.	2021-04-08 05:28:20 +01:00
Martijn Dekker	997ad43bbf	Properly fix $LINENO crash on ARM (re: `23b7a163`) and other bugs The typecast fix was insufficient, avoiding the crash only when compiling with optimisation disabled. The real problem is that put_lineno() was passed a misaligned pointer, and that the value didn't actually contain a double but a string. The bug occurred when restoring the LINENO value upon exiting a virtual subshell. Thanks to Harald van Dijk for figuring out the fix. src/cmd/ksh93/sh/subshell.c: nv_restore(): - When restoring a special variable as defined by nv_cover(), do not pass either the np->nvflag bits or NV_NOFREE. Why? * The np->nvflag bits are not needed. They are also harmful because they may include the NV_INTEGER bit. This is set when the value is numeric. However, nv_getval() always returns the value in string form, converting it if it is numeric. So the NV_INTEGER flag should never be passed to nv_putval() when it uses the result of nv_getval(). * According to nval.3, the NV_NOFREE flag stops nv_putval() from creating a copy of the value. But this should be unnecessary because the earlier _nv_unset(mp,NV_RDONLY\|NV_CLONE) should ensure there is no previous value. In addition, the NV_NOFREE flag triggered another bug that caused the value of SECONDS to be corrupted upon restoring it when exiting a virtual subshell. - When restoring a regular variable, copy the entire nvalue union and not just the 'cp' member. In practice this worked because no current member of the nvalue union is larger than a pointer. However, there is no guarantee it will stay that way. src/cmd/ksh93/tests/leaks.sh: - Add disabled test for a memory leak that was discovered in the course of dealing with this bug. The fix doesn't introduce or influence it. It will have to be dealt with later. src/cmd/ksh93/tests/locale.sh: - Add test for restoring locale on leaving virtual subshell. https://github.com/ksh93/ksh/issues/253#issuecomment-815290154 src/cmd/ksh93/tests/variables.sh: - Test against corruption of SECONDS on leaving virtual subshell. https://github.com/ksh93/ksh/issues/253#issuecomment-815191052 Co-authored-by: Harald van Dijk <harald@gigawatt.nl> Progresses: https://github.com/ksh93/ksh/issues/253	2021-04-08 00:56:09 +01:00
Martijn Dekker	23b7a163f7	Fix implicit typecast mess in $LINENO discipline functions On Ubuntu arm7, two variables.sh regression tests crashed with a bus error (SIGBUS) in init.c on line 720 while testing $LINENO: 707 static void put_lineno(Namval_t* np,const char val,int flags,Namfun_t fp) 708 { 709 register long n; 710 Shell_t shp = sh_getinterp(); 711 if(!val) 712 { 713 fp = nv_stack(np, NIL(Namfun_t)); 714 if(fp && !fp->nofree) 715 free((void)fp); 716 _nv_unset(np,NV_RDONLY); 717 return; 718 } 719 if(flags&NV_INTEGER) 720 n = (double*)val; 721 else 722 n = sh_arith(shp,val); 723 shp->st.firstline += nget_lineno(np,fp)+1-n; 724 } Apparently, gcc on arm7 doesn't like the implicit typecast from double to long. Those three $LINENO discipline functions are generally a mess of implicit typecasts between Sfdouble_t, double, long and int. Line numbers are internally stored as int. The discipline functions need to use Sfdouble_t for API compatibility. src/cmd/ksh93/sh/init.c: nget_lineno(), put_lineno(), get_lineno(): - Get rid of unnecessary implicit typecasts by adjusting the types of local variables. - Make the typecasts that are done explicit. Progresses: https://github.com/ksh93/ksh/issues/253	2021-04-07 15:53:23 +01:00
Martijn Dekker	6b9703ffdd	Backport bugfixes for arrays of 'enum' types from ksh 93v- beta These fixes are applied rather blindly as no one has yet managed to understand the almost entirely uncommented arrays and variables handling code (arrays.c, name.c, nvdisc.c, nvtree.c, nvtype.c). Hopefully we'll figure all that out at some point. In the meantime these backported fixes appear to work fine, and these bugs impact the usability of 'enum', so I'm just going to have to violate my own policy and backport these fixes without understanding them. Thanks to @JohnoKing for putting in a lot of work tracing these. Further discussion at: https://github.com/ksh93/ksh/issues/87 src/cmd/ksh93/sh/array.c: - nv_arraysettype(): * Further simplify the function. After my initial simplification of it (re: `5491fe97`), I don't believe there's actually a need to save a duplicate copy of the value. Use the pointer returned by nv_getval() directly to restore the value. * Cope with a null value (nv_getval() returning a NULL pointer). This is needed for compatibility with the backported fix in nvtype.c (below). - array_putval(): If the array's value pointer (up->cp) is a pointer to the empty string, it is set to NULL before calling nv_putv() to prevent an empty string from being deleted. Backport a fix from 93v- that restores the pointer to the empty string if the NV_NOFREE attribute is set. Removing it somehow causes these regressions: enum.sh[86]: ${array[@]} doesn't yield all values for associative enum arrays (expected 'green blue blue red yellow green red orange'; got 'green blue blue yellow green orange') enum.sh[94]: unsetting associative enum array does not work (got 'Color_t -A Colors=([foo]=red [rood]=red)') enum.sh[116]: assigning first enum element to indexed array failed (expected 'red red'; got 'BUG BUG') - nv_associative(): Do not increase the 'nelem' (number of elements) value of the array's 'header' struct if the array is associative and of an enum type. The original 93v- fix only checked for the NV_INTEGER attribute, but backporting that caused several regressions. Using a debug output command I've determined that the exact value of 'type' is somehow consistently set to 0x26 if the array is associative and of an enum type, which is NV_INTEGER \| NV_LTOU \| NV_RJUST as defined in include/nval.h. I cannot find where/how that value is determined. In any case this fix, based on but more specific than the 93v- one, appears to work fine. Removing it somehow causes this regression: enum.sh[94]: unsetting associative enum array does not work (got 'Color_t -A Colors=()') src/cmd/ksh93/sh/nvtype.c: nv_settype(): - Another fix backported from 93v-. If the variable is an array, also set the type of element 0 of that array using a call to nv_arraysettype(). The value may be null. Removing this somehow causes this regression: enum.sh[94]: unsetting associative enum array does not work (got 'Color_t -A Colors=()') src/cmd/ksh93/tests/enum.sh: - Add tests for all the bugs fixed here, plus some hypothetical bugs (e.g., do the same tests for indexed enum type arrays as for associative enum type arrays, even though indexed enum type arrays didn't have all the same problems). Co-authored-by: Johnothan King <johnothanking@protonmail.com> Resolves: https://github.com/ksh93/ksh/issues/87	2021-04-06 06:33:32 +01:00
Martijn Dekker	db2b1affdf	Fix unsetting array element after expanding array subscript range Simple reproducer: set -A arr a b c d; : ${arr[1..2]}; unset arr[1]; echo ${arr[@]} Output: a Expected output: a c d The ${arr[1..2]} expansion broke the subsequent 'unset' command so that it unsets element 1 and on, instead of only 1. This regression was introduced in nv_endsubscript() on 2009-07-31: `c47896b4/src/cmd/ksh93/sh/array.c` That change checks for the ARRAY_SCAN attribute which enables processing ranges of array elements instead of single array elements, and restores it after. That restore is evidently not correct as it causes the subsequent unset command to malfunction. If we revert that change, the bug disappears and the regression tests show no failures. However, I don't know what this was meant to accomplish and what other bug we might introduce by reverting this. However, no corresponding regression test was added along with the 2009-07-31 change, nor is there any corresponding message in the changelog. So this looks to be one of those mystery changes that we'll never know the reason for. Since we currently have proof that this change causes breakage and no evidence that it fixes anything, I'll go ahead and revert it (and add a regression test, of course). If that causes another regression, hopefully someone will find it at some point. src/cmd/ksh93/sh/array.c: nv_endsubscript(): - Revert the 2009-07-31 change that saves/restores the ARRAY_SCAN attribute. - Keep the 'ap' pointer as it is now used by newer code. Move the declaration up to the beginning of the block, as is customary. src/cmd/ksh93/sh/init.c: - Cosmetic change: remove an unused array_scan() macro that I found when grepping the code for ARRAY_SCAN. The macro was introduced in version 2001-06-01 but the code that used it was replaced in version 2001-07-04, without removing the macro itself. Resolves: https://github.com/ksh93/ksh/issues/254	2021-04-05 22:16:57 +01:00
hyenias	264ba48bdd	Hardening of readonly variables (#239 ) Ksh currently restricts readonly scalar variables from having their values directly changed via a value assignment. However, since ksh allows variable attributes to be altered, the variable's value can be indirectly altered. For instance, if TMOUT=900 (for a 15 minute idle timeout) was set to readonly, all that is needed to alter the value of TMOUT from 900 to 0 is to issue 'typeset -R1 TMOUT', perhaps followed by a 'typeset -i TMOUT' to turn off the shell's timeout value. In addition, there are problems with arrays. The following is incorrectly allowed: typeset -a arr=((a b c) 1) readonly arr arr[0][1]=d arr=(alphas=(a b c);name=x) readonly arr.alphas arr.alphas[1]=([b]=5) arr=(alphas=(a b c);name=x) readonly arr.alphas arr.alphas[1]=(b) typeset -C arr=(typeset -r -a alphas=(a b c);name=x) arr.alphas[1]=() src/cmd/ksh93/bltins/typeset.c: setall(): - Relocate readonly attribute check higher up the code and widen its application to issue an error message if the pre-existing name-pair has the readonly bit flag set. - To avoid compatibility problems, don't check for readonly if NV_RDONLY is the only attribute set (ignoring NV_NOFREE). This allows 'readonly foo; readonly foo' to keep working. src/cmd/ksh93/sh/array.c: nv_endsubscript(): - Apply a readonly flag check when an array subscript or append assignment occurs, but allow type variables (typeset -T) as they utilize '-r' for 'required' sub-variables. src/cmd/ksh93/tests/readonly.sh: - New file. Create readonly tests that validate the warning message and validate that the readonly variable did not change. src/cmd/ksh93/sh/streval.c: - Bump MAXLEVEL from 9 to 1024 as a workaround for arithmetic expansion, avoiding a spurious error about too much recursion when the readonly.sh tests are run. This change is backported from ksh 93v-. TODO: debug a spurious increase in arithmetic recursion level variable when readonly.sh tests with 'typeset -i' are run. That is a different bug for a different commit. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2021-04-05 06:43:19 +01:00
Johnothan King	c4f980eb29	Introduce usage of __builtin_unreachable() and noreturn (#248 ) This commit adds an UNREACHABLE() macro that expands to either the __builtin_unreachable() compiler builtin (for release builds) or abort(3) (for development builds). This is used to mark code paths that are never to be reached. It also adds the 'noreturn' attribute to functions that never return: path_exec(), sh_done() and sh_syntax(). The UNREACHABLE() macro is not added after calling these. The purpose of these is: * to slightly improve GCC/Clang compiler optimizations; * to fix a few compiler warnings; * to add code clarity. Changes of note: src/cmd/ksh93/sh/io.c: outexcept(): - Avoid using __builtin_unreachable() here since errormsg can return despite using ERROR_system(1), as shp->jmplist->mode is temporarily set to 0. See: https://github.com/att/ast/issues/1336 src/cmd/ksh93/tests/io.sh: - Add a regression test for the ksh2020 bug referenced above. src/lib/libast/features/common: - Detect the existence of either the C11 stdnoreturn.h header or the GCC noreturn attribute, preferring the former when available. - Test for the existence of __builtin_unreachable(). Use it for release builds. On development builds, use abort() instead, which crahses reliably for debugging when unreachable code is reached. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2021-04-05 00:28:24 +01:00
Johnothan King	ca2443b58c	`cd -` shouldn't ignore `$OLDPWD` when in a new scope (#249 ) This bug was first reported at <https://github.com/att/ast/issues/8>. The 'cd' command currently takes the value of $OLDPWD from the wrong scope. In the following example 'cd -' will change the directory to /bin instead of /tmp: $ OLDPWD=/bin ksh93 -c 'OLDPWD=/tmp cd -' /bin src/cmd/ksh93/bltins/cd_pwd.c: - Use sh_scoped() to obtain the correct value of $OLDPWD. - Fix a use-after-free bug. Make the 'oldpwd' variable a static char that points to freeable memory. Each time cd is used, this variable is freed if it points to a freeable memory address and isn't also a pointer to shp->pwd. src/cmd/ksh93/sh/path.c: path_pwd(): - Simplify and add comments. - Scope $PWD properly. src/cmd/ksh93/tests/builtins.sh, src/cmd/ksh93/tests/leaks.sh: - Backport the ksh2020 regression tests for 'cd -' when $OLDPWD is set. - Add test for $OLDPWD and $PWD after subshare. - Add test for $PWD after 'cd'. - Add test for possible memory leak. - Add testing for 'unset' on OLDPWD and PWD. src/cmd/ksh93/COMPATIBILITY: - Add compatibility note about changes to $PWD and $OLDPWD. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2021-04-02 01:19:19 +01:00
Johnothan King	ed478ab7e3	Fix many GCC `-Wimplicit-fallthrough` warnings (#243 ) This commit adds '/* FALLTHROUGH */' comments to fix many GCC warnings when compiling with -Wimplicit-fallthrough. Additionally, the existing fallthrough comments have been changed for consistency.	2021-03-30 21:49:20 +01:00
Martijn Dekker	21d591dbd8	parse.c: rm overlooked SHOPT_BASH stuff (re: `921bbcae`) That bit of code supported bash's redundant 'function foo()' function declaration syntax (with both the 'function' keyword and the '()') which is a syntax error on ksh, as it should be.	2021-03-23 20:03:18 +00:00
Martijn Dekker	71934570bf	Add --globcasedetect shell option for globbing and completion One of the best-kept secrets of libast/ksh93 is that the code includes support for case-insensitive file name generation (a.k.a. pathname expansion, a.k.a. globbing) as well as case-insensitive file name completion on interactive shells, depending on whether the file system is case-insensitive or not. This is transparently determined for each directory, so a path pattern that spans multiple file systems can be part case-sensitive and part case- insensitive. In more precise terms, each slash-separated path name component pattern P is treated as ~(i:P) if its parent directory exists on a case-insensitive file system. I recently discovered this while dealing with <https://github.com/ksh93/ksh/issues/223>. However, that support is dead code on almost all current systems. It depends on pathconf(2) having a _PC_PATH_ATTRIBUTES selector. The 'c' attribute is supposedly returned if the given directory is on a case insensitive file system. There are other attributes as well (at least 'l', see src/lib/libcmd/rm.c). However, I have been unable to find any system, current or otherwise, that has _PC_PATH_ATTRIBUTES. Google and mailing list searches yield no relevant results at all. If anyone knows of such a system, please add a comment to this commit on GitHub, or email me. An exception is Cygwin/Windows, on which the "c" attribute was simply hardcoded, so globbing/completion is always case- insensitive. As of Windows 10, that is wrong, as it added the possibility to mount case-sensitive file systems. On the other hand, this was never activated on the Mac, even though macOS has always used a case-insensitive file like Windows. But, being UNIX, it can also mount case-sensitive file systems. Finally, Linux added the possibility to create individual case- insensitive ext4 directories fairly recently, in version 5.2. https://www.collabora.com/news-and-blog/blog/2020/08/27/using-the-linux-kernel-case-insensitive-feature-in-ext4/ So, since this functionality latently exists in the code base, and three popular OSs now have relevant file system support, we might as well make it usable on those systems. It's a nice idea, as it intuitively makes sense for globbing and completion behaviour to auto-adapt to file system case insensitivity on a per-directory basis. No other shell does this, so it's a nice selling point, too. However, the way it is coded, this is activated unconditionally on supported systems. That is not a good idea. It will surprise users. Since globbing is used with commands like 'rm', we do not want surprises. So this commit makes it conditional upon a new shell option called 'globcasedetect'. This option is only compiled into ksh on systems where we can actually detect FS case insensitivity. To implement this, libast needs some public API additions first. * libast changes * src/lib/libast/features/lib: - Add probes for the linux/fs.h and sys/ioctl.h headers. Linux needs these to use ioctl(2) in pathicase(3) (see below). src/lib/libast/path/pathicase.c, src/lib/libast/include/ast.h, src/lib/libast/man/path.3, src/lib/libast/Mamfile: - Add new pathicase(3) public API function. This uses whatever OS-specific method it can detect at compile time to determine if a particular path is on a case-insensitive file system. If no method is available, it only sets errno to ENOSYS and returns -1. Currently known to work on: macOS, Cygwin, Linux 5.2+, QNX 7.0+. - On systems (if any) that have the mysterious _PC_PATH_ATTRIBUTES selector for pathconf(2), call astconf(3) and check for the 'c' attribute to determine case insensitivity. This should preserve compatibility with any such system. src/lib/libast/port/astconf.c: - dynamic[]: As case-insensitive globbing is now optional on all systems, do not set the 'c' attribute by default on _WINIX (Cygwin/Windows) systems. - format(): On systems that do not have _PC_PATH_ATTRIBUTES, call pathicase(3) to determine the value for the "c" (case insensitive) attribute only. This is for compatibility as it is more efficient to call pathicase(3) directly. src/lib/libast/misc/glob.c, src/lib/libast/include/glob.h: - Add new GLOB_DCASE public API flag to glob(3). This is like GLOB_ICASE (case-insensitive matching) except it only makes the match case-insensitive if the file system for the current pathname component is determined to be case-insensitive. - gl_attr(): For efficiency, call pathicase(3) directly instead of via astconf(3). - glob_dir(): Only call gl_attr() to determine file system case insensitivity if the GLOB_DCASE flag was passed. This makes case insensitive globbing optional on all systems. - glob(): The options bitmask needs to be widened to fit the new GLOB_DCASE option. Define this centrally in a new GLOB_FLAGMASK macro so it is easy to change it along with GLOB_MAGIC (which uses the remaining bits for a sanity check bit pattern). src/lib/libast/path/pathexists.c: - For efficiency, call pathicase(3) directly instead of via astconf(3). * ksh changes * src/cmd/ksh93/features/options, src/cmd/ksh93/SHOPT.sh: - Add new SHOPT_GLOBCASEDET compile-time option. Set it to probe (empty) by default so that the shell option is compiled in on supported systems only, which is determined by new iffe feature test that checks if pathicase(3) returns an ENOSYS error. src/cmd/ksh93/data/options.c, src/cmd/ksh93/include/shell.h: - Add -o globcasedetect shell option if compiling with SHOPT_GLOBCASEDET. src/cmd/ksh93/sh/expand.c: path_expand(): - Pass the new GLOB_DCASE flag to glob(3) if the globcasedetect/SH_GLOBCASEDET shell option is set. src/cmd/ksh93/edit/completion.c: - While file listing/completion is based on globbing and automatically becomes case-insensitive when globbing does, it needs some additional handling to make a string comparison case-insensitive in corresponding cases. Otherwise, partial completions may be deleted from the command line upon pressing tab. This code was already in ksh 93u+ and just needs to be made conditional upon SHOPT_GLOBCASEDET and globcasedetect. - For efficiency, call pathicase(3) directly instead of via astconf(3). src/cmd/ksh93/sh.1: - Document the new globcasedetect shell option.	2021-03-22 18:45:19 +00:00
Johnothan King	814b5c6890	Fix various minor problems and update the documentation (#237 ) These are minor fixes I've accumulated over time. The following changes are somewhat notable: - Added a missing entry for 'typeset -s' to the man page. - Add strftime(3) to the 'see also' section. This and the date(1) addition are meant to add onto the documentation for 'printf %T'. - Removed the man page the entry for ksh reading $PWD/.profile on login. That feature was removed in commit `aa7713c2`. - Added date(1) to the 'see also' section of the man page. - Note that the 'hash' command can be used instead of 'alias -t' to workaround one of the caveats listed in the man page. - Use an 'out of memory' error message rather than 'out of space' when memory allocation fails. - Replaced backticks with quotes in some places for consistency. - Added missing documentation for the %P date format. - Added missing documentation for the printf %Q and %p formats (backported from ksh2020: https://github.com/att/ast/pull/1032). - The comments that show each builtin's options have been updated.	2021-03-21 14:39:03 +00:00
Martijn Dekker	7b0e0776e2	cleanup: remove legacy code for systems without fork(2) In 2021, it seems like it's about time to join the 21st century and officially require fork(2). In practice this was already the case as the legacy code was unmaintained and didn't compile.	2021-03-21 06:39:32 +00:00
Martijn Dekker	38f2b94f55	Some more #ifdef cleanups src/cmd/ksh93/edit/edit.c, src/cmd/ksh93/edit/history.c, src/cmd/ksh93/sh/deparse.c: - Remove experimental code protected by '#ifdef future'. No one is going to do anything with this, it's just clutter. src/lib/libast/sfio/sfcvt.c: - In 2021, it might be time to actually start using some C99 features were available. Change two checks for a _c99_in_the_wild macro to actual checks for C99, enabling the use of fpclassify(). Resolves: https://github.com/ksh93/ksh/issues/219	2021-03-21 06:39:32 +00:00
Martijn Dekker	0b814b53bd	Remove more legacy libast code (re: `f9c127e3`, `651bbd56`) This removes #ifdefs checking for the existence of SH_PLUGIN_VERSION (version check for dynamically loaded builtins) and the SFIO identifiers SF_BUFCONST, SF_CLOSING, SF_APPENDWR, SF_ATEXIT, all of which are defined by the bundled libast.	2021-03-21 06:39:32 +00:00
Martijn Dekker	936a1939a8	Allow proper tilde expansion overrides (#225 ) Until now, when performing any tilde expansion like ~/foo or ~user/foo, ksh added a placeholder built-in command called '.sh.tilde', ostensibly with the intention to allow users to override it with a shell function or custom builtin. The multishell ksh93 repo <https://github.com/multishell/ksh93/> shows this was added sometime between 2002-06-28 and 2004-02-29. However, it has never worked and crashed the shell. This commit replaces that with something that works. Specific tilde expansions can now be overridden using .set or .get discipline functions associated with the .sh.tilde variable (see manual, Discipline Functions). For example, you can use either of: .sh.tilde.set() { case ${.sh.value} in '~tmp') .sh.value=${XDG_RUNTIME_DIR:-${TMPDIR:-/tmp}} ;; '~doc') .sh.value=~/Documents ;; '~ksh') .sh.value=/usr/local/src/ksh93/ksh ;; esac } .sh.tilde.get() { case ${.sh.tilde} in '~tmp') .sh.value=${XDG_RUNTIME_DIR:-${TMPDIR:-/tmp}} ;; '~doc') .sh.value=~/Documents ;; '~ksh') .sh.value=/usr/local/src/ksh93/ksh ;; esac } src/cmd/ksh93/include/variables.h, src/cmd/ksh93/data/variables.c: - Add SH_TILDENOD for a new ${.sh.tilde} predefined variable. It is initially unset. src/cmd/ksh93/sh/macro.c: - sh_btilde(): Removed. - tilde_expand2(): Rewritten. I started out with the tiny version of this function from the 2002-06-28 version of ksh. It uses the stack instead of sfio, which is more efficient. A bugfix for $HOME == '/' was retrofitted so that ~/foo does not become //foo instead of /foo. The rest is entirely new code. To implement the override functionality, it now checks if ${.sh.tilde} has any discipline function associated with it. If it does, it assigns the tilde expression to ${.sh.tilde} using nv_putval(), triggering the .set discipline, and then reads it back using nv_getval(), triggering the .get discipline. The resulting value is used if it is nonempty and does not still start with a tilde. src/cmd/ksh93/bltins/typeset.c, src/cmd/ksh93/tests/builtins.sh: - Since ksh no longer adds a dummy '.sh.tilde' builtin, remove the ad-hoc hack that suppressed it from the output of 'builtin'. src/cmd/ksh93/tests/tilde.sh: - Add tests verifying everything I can think of, as well as tests for bugs found and fixed during this rewrite. src/cmd/ksh93/tests/pty.sh: - Add test verifying that the .sh.tilde.set() discipline does not modify the exit status value ($?) when performing tilde expansion as part of tab completion. src/cmd/ksh93/sh.1: - Instead of "tilde substitution", call the basic mechanism "tilde expansion", which is the term used everywhere else (including the 1995 Bolsky/Korn ksh book). - Document the new override feature. Resolves: https://github.com/ksh93/ksh/issues/217	2021-03-17 21:07:14 +00:00
Martijn Dekker	595a0a5684	Revert "Backport atomic job locking from ksh 93v- beta" (`52067c3d`) That patch broke the build on Cygwin, where gcc apparently doesn't have the required atomic addition/subtraction compiler builtins. The build fails at link time with those functions not found. As far as I know, ksh was actually working fine (after @JohnoKing's gcc workaround in `c258a04f`), so I'll just revert this for now. If a need for it is demonstrated later, we'll have to add a feature test or find some other way to get it working on Cygwin.	2021-03-17 14:35:15 +00:00
Martijn Dekker	44438725b1	sh_done(): fix portable exit status logic (re: `d024d4c8`) "savxit -= SH_EXITSIG + 128;" may have worked accidentally due to subsequent bitmasking, but is blatantly wrong . It subtracts 256 + 128 = 384 from the exit status. Use bitwise logic instead, with an octal literal 0200 instead of 128. This makes more sense in this context.	2021-03-17 09:33:23 +00:00
Johnothan King	14352ba0a7	Save $? when discipline triggered without command (#226 ) A discipline function could incorrectly influence the value of $? (exit status of last command) outside its context if it was triggered without another command being run, e.g. when a prompt variable is read, or COLUMNS or LINES is set. Reproducers include: PS1 prompt: $ PS1.get() { true; } $ false $ echo $? 0 PS2 prompt: $ PS2.get() { return 13; } $ \ > $ echo $? 13 The set discipline is affected too, e.g. COLUMNS and LINES: $ COLUMNS.set() { return 13; } $ true $ (press return) $ echo $? 13 There are probably other contexts where the shell reads or changes variables without running commands, allowing their get or set disciplines to influence $?. So this commit makes ksh save $? for all .get, .set, .append, and .unset discipline calls. src/cmd/ksh93/sh/nvdisc.c: - assign(): Save/restore $? when running a .set/.append/.unset discipline function. - lookup(): Save/restore $? when running a .get discipline. src/cmd/ksh93/tests/pty.sh: - Add a regression test for $? after displaying a prompt and when setting a LINES.set discipline function. src/cmd/ksh93/tests/return.sh: - The above test fails in script form on ksh93u+ and ksh2020, as it exposes another form of #117 that occurs after running a subshell. Add the above regression test here as well (re: `092b90da`). Co-authored-by: Martijn Dekker <martijn@inlv.org>	2021-03-16 16:13:13 +00:00
Martijn Dekker	1df6a82a8a	Make ~ expand to home directory after unsetting HOME There was an issue with tilde expansion if the HOME var is unset. $ unset HOME $ echo ~ martijn Only the username is returned. Users are more likely to expect the current user's home directory as configured in the OS. POSIXly, the expansion of ~ is based on the value of HOME. If HOME is unset, the results are unspecified. After unsetting HOME, in bash, ~ returns the user's home directory as specified by the OS, whereas in all other shells, ~ expands to the empty string. Only ksh93 returns the username. The behaviour of bash is more useful. Discussion: https://github.com/ksh93/ksh/pull/225#issuecomment-799074107 src/cmd/ksh93/sh/macro.c, src/cmd/ksh93/tests/tilde.sh: - sh_tilde(): Backport fix by Mike Gilbert from ksh2020. See: https://github.com/att/ast/issues/1391 https://github.com/att/ast/pull/1396 `070d365d` - Add test. src/cmd/ksh93/COMPATIBILITY: - Note this change.	2021-03-15 21:49:02 +00:00
Johnothan King	6d63b57dd3	Re-enable SHOPT_DEVFD, fixing process substitution fd leaks (#218 ) This commit fixes a long-standing bug (present since at least ksh93r) that caused a file descriptor leak when passing a process substitution to a function, or (if compiled with SHOPT_SPAWN) to a nonexistent command. The leaks only occurred when ksh was compiled with SHOPT_DEVFD; the FIFO method was unaffected. src/cmd/ksh93/sh/xec.c: sh_exec(): - When a process substitution is passed to a built-in, the remaining file descriptor is closed with sh_iorestore. Do the same thing when passing a process substitution to a function. This is done by delaying the sh_iorestore() call to 'setexit:' where both built-ins and functions terminate and set the exit status ($?). This means that call now will not be executed if a longjmp is done, e.g. due to an error in a special built-in. However, there is already another sh_iorestore() call in main.c, exfile(), line 418, that handles that scenario. - sh_ntfork() can fail, so rather than assume it will succeed, handle a failure by closing extra file descriptors with sh_iorestore(). This fixes the leak on command not found with SHOPT_SPAWN. src/cmd/ksh93/include/defs.h: - Since the file descriptor leaks are now fixed, remove the workaround that forced ksh to use the FIFO method. src/cmd/ksh93/SHOPT.sh: - Add SHOPT_DEVFD as a configurable option (default: probe). src/cmd/ksh93/tests/io.sh: - Add a regression test for the 'not found' file descriptor leak. - Add a test to ensure it keeps working with 'command'. Fixes: https://github.com/ksh93/ksh/issues/67	2021-03-13 13:46:42 +00:00
Johnothan King	c3eac977ea	Fix unused process substitutions hanging (#214 ) On systems where ksh needs to use the older and less secure FIFO method for process substitutions (which is currently all of them as the more modern and solid /dev/fd method is still broken, see #67), process substitutions could leave background processes hanging in these two scenarios: 1. If the parent process exits without opening a pipe to the child process forked by the process substitution. The fifo_check() function in xec.c, which is periodically called to check if the parent process still exists while waiting for it to open the FIFO, verified the parent process's existence by checking if the PPID had reverted to 1, the traditional PID of init. However, POSIX specifies that the PPID can revert to any implementation- defined system process in that case. So this breaks on certain systems, causing unused process substitutions to hang around forever as they never detect that the parent disappeared. The fix is to save the current PID before forking and having the child check if the PPID has changed from that saved PID. 2. If command invoked from the main shell is passed a process substitution, but terminates without opening the pipe to the process substitution. In that case, the parent process never disappears in the first place, because the parent process is the main shell. So the same infinite wait occurs in unused process substitutions, even after correcting problem 1. The fix is to remember all FIFOs created for any number of process substitutions passed to a single command, and unlink any remaining FIFOs as they represent unused command substitutions. Unlinking them FIFOs causes sh_open() in the child to fail with ENOENT on the next periodic check, which can easily be handled. Fixing these problems causes the FIFO method to act identically to the /dev/fd method, which is good for compatibility. Even when #67 is fixed this will still be important, as ksh also runs on systems that do not have /dev/fd (such as AIX, HP-UX, and QNX), so will fall back to using FIFOs. --- Fix problem 1 --- src/cmd/ksh93/sh/xec.c: - Add new static fifo_save_ppid variable. - sh_exec(): If a FIFO is defined, save the current PID in fifo_save_ppid for the forked child to use. - fifo_check(): Compare PPID against the saved value instead of 1. --- Fix problem 2 --- To keep things simple I'm abusing the name-value pair routines used for variables for this purpose. The overhead is negligible. A more elegant solution is possible but would involve adding more code. src/cmd/ksh93/include/defs.h: _SH_PRIVATE: - Define new sh.fifo_tree pointer to a new FIFO cleanup tree. src/cmd/ksh93/sh/args.c: sh_argprocsubs(): - After launching a process substitution in the background, add the FIFO to the cleanup list before freeing it. src/cmd/ksh93/sh/xec.c: - Add fifo_cleanup() that unlinks all FIFOs in the cleanup list and clears/closes the list. They should only still exist if the command never used them, however, just run 'unlink' and don't check for existence first as that would only add overhead. - sh_exec(): * Call fifo_cleanup() on finishing all simple commands (when setting $?) or when a special builtin fails. * When forking, clear/close the cleanup list; we do not want children doing duplicate cleanup, particularly as this can interfere when using multiple process substitutions in one command. * Process substitution handling: > Change FIFO check frequency from 500ms to 50ms. Note that each check sends a signal that interrupts open(2), causing sh_open() to reinvoke it. This causes sh_open() to fail with ENOENT on the next check when the FIFO no longer exists, so we do not need to add an additional check for existence to fifo_check(). Unused process substitutions now linger for a maximum of 50ms. > Do not issue an error message if errno == ENOENT. - sh_funct(): Process substitutions can be passed to functions as well, and we do not want commands within the function to clean up the FIFOs for the process substitutions passed to it from the outside. The problem is solved by simply saving fifo_tree in a local variable, setting it to null before running the function, and cleaning it up before restoring the parent one at the end. Since sh_funct() is called recursively for multiple-level function calls, this correctly gives each function a locally scoped fifo_tree. --- Tests --- src/cmd/ksh93/tests/io.sh: - Add tests covering the failing scenarios. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2021-03-12 11:43:23 +00:00

1 2 3 4 5 ...

318 commits