external/cde - Personal Git space

mirror of git://git.code.sf.net/p/cdesktopenv/code synced 2025-03-09 15:50:02 +00:00

Author	SHA1	Message	Date
Martijn Dekker	cb67a01b45	lex.c: simplify fmttoken() by using the stack (re: `3255aed2`) Using the stack makes it impossible for future buffer overflows to occur. It also simplifies fmttoken() by eliminating the need to declare a local buffer and pass a pointer to that as an argument. For info: man src/lib/libast/man/stak.3	2021-04-09 17:36:29 +01:00
hyenias	3255aed2c4	lex.c: Fix buffer overflow in debug sh_lex and sh_syntax (#262 ) fmttoken() needs a minimal char[4] token buffer passed to it. Originally reported by: Jakub Wilk <jwilk@jwilk.net> Original bug report: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=879464 The following code lines from fmttoken() yield a n=3 for SYMSEMI as n=1 from the start, e.g. 'for <>;'. case SYMSEMI: if(tok[0]=='<') tok[n++] = '>'; sym = ';'; break; default: sym = 0; } tok[n++] = sym; } tok[n] = 0; n[0]='<' n[1]='>' n[2]=';' n[3]=0 # <-- BUFFER overflow as the passed character buffers have a size of 3 src/cmd/ksh93/sh/lex.c: - DBUG: sh_lex(): Adjust char tokstr[3] to char tokstr[4] - sh_syntax(): Adjust char tokbuf[3] to char tokbuf[4]	2021-04-09 02:47:21 +01:00
Johnothan King	a065558291	Fix more compiler warnings, typos and other minor issues (#260 ) Many of these changes are minor typo fixes. The other changes (which are mostly compiler warning fixes) are: NEWS: - The --globcasedetect shell option works on older Linux kernels when used with FAT32/VFAT file systems, so remove the note about it only working with 5.2+ kernels. src/cmd/ksh93/COMPATIBILITY: - Update the documentation on function scoping with an addition from ksh93v- (this does apply to ksh93u+). src/cmd/ksh93/edit/emacs.c: - Check for '_AST_ksh_release', not 'AST_ksh_release'. src/cmd/INIT/mamake.c, src/cmd/INIT/ratz.c, src/cmd/INIT/release.c, src/cmd/builtin/pty.c: - Add more uses of UNREACHABLE() and noreturn, this time for the build system and pty. src/cmd/builtin/pty.c, src/cmd/builtin/array.c, src/cmd/ksh93/sh/name.c, src/cmd/ksh93/sh/nvtype.c, src/cmd/ksh93/sh/suid_exec.c: - Fix six -Wunused-variable warnings (the name.c nv_arrayptr() fixes are also in ksh93v-). - Remove the unused 'tableval' function to fix a -Wunused-function warning. src/cmd/ksh93/sh/lex.c: - Remove unused 'SHOPT_DOS' code, which isn't enabled anywhere. https://github.com/att/ast/issues/272#issuecomment-354363112 src/cmd/ksh93/bltins/misc.c, src/cmd/ksh93/bltins/trap.c, src/cmd/ksh93/bltins/typeset.c: - Add dictionary generator function declarations for former aliases that are now builtins (re: `1fbbeaa1`, `ef1621c1`, `3ba4900e`). - For consistency with the rest of the codebase, use '(void)' instead of '()' for print_cpu_times. src/cmd/ksh93/sh/init.c, src/lib/libast/path/pathshell.c: - Move the otherwise unused EXE macro to pathshell() and only search for 'sh.exe' on Windows. src/cmd/ksh93/sh/xec.c, src/lib/libast/include/ast.h: - Add an empty definition for inline when compiling with C89. This allows the timeval_to_double() function to be inlined. src/cmd/ksh93/include/shlex.h: - Remove the unused 'PIPESYM2' macro. src/cmd/ksh93/tests/pty.sh: - Add '# err_exit #' to count the regression test added in commit `113a9392`. src/lib/libast/disc/sfdcdio.c: - Move diordwr, dioread, diowrite and dioexcept behind '#ifdef F_DIOINFO' to fix one -Wunused-variable warning and multiple -Wunused-function warnings (sfdcdio() only uses these functions when F_DIOINFO is defined). src/lib/libast/string/fmtdev.c: - Fix two -Wimplicit-function-declaration warnings on Linux by including sys/sysmacros.h in fmtdev().	2021-04-08 19:58:07 +01:00
Martijn Dekker	ecf260c282	SHOPT_SPAWN: Fix 'not found' error message inconsistency There's an annoying inconsistency in error messages if ksh is compiled with SHOPT_SPAWN. One way to trigger it: $ /usr/local/bin/ksh -c '/tmp/nonexistent' /usr/local/bin/ksh: /tmp/nonexistent: not found $ /usr/local/bin/ksh -c '/tmp/nonexistent; :' /usr/local/bin/ksh: /tmp/nonexistent: not found [No such file or directory] In the first variant, as an optimisation, ksh went straight to exec'ing the command without forking first. In the second variant, sh_ntfork() was used. The first variant is done in path_exec(), path.c, line 1049: errormsg(SH_DICT,ERROR_exit(ERROR_NOENT),e_found,arg0); The second one is in sh_ntfork(), xec.c, line 3654: errormsg(SH_DICT,ERROR_system(ERROR_NOENT),e_found+4); In both cases, the e_found message is only used if errno==ENOENT, so the extra '[No such file or directory]' message generated by ERROR_system() is pointless as that will never change for that message. src/cmd/ksh93/sh/xec.c: sh_ntfork(): - Use ERROR_exit() instead of ERROR_system() for the e_found message to avoid the superfluous addition.	2021-04-08 16:46:47 +01:00
Martijn Dekker	2e5b625915	Allow path-bound builtins on restricted shells If a system administrator prefixes /opt/ast/bin to the path and then invokes the shell in restricted mode, they clearly intend for the user to run those AST utilities. Similarly, if a system administrator sets a PATH for a restricted shell that includes libraries listed in the .paths file, they must have intended for the user to use those loadable built-ins, as they will be associated with the pathnames of their respective libraries. Since the user cannot change PATH or use the builtin command, they still cannot load just any built-in they choose. src/cmd/ksh93/sh/path.c: - Remove SH_RESTRICTED check when handling path-bound builtins or dynamic libaries containining builtins in $PATH. src/cmd/ksh93/tests/builtins.sh: - Add test verifying a restricted user can use /opt/ast/bin/cat via a PATH search. Progresses: https://github.com/ksh93/ksh/issues/138	2021-04-08 14:48:29 +01:00
Johnothan King	0cd8646361	Backport bugfix for BUG_CSUBSTDO from ksh93v- 2012-08-24 (#259 ) This commit fixes BUG_CSUBSTDO, which could break stdout inside of non-forking command substitutions. The breakage only occurred when stdout was closed outside of the command substitution and a file descriptor other than stdout was redirected in the command substitution (such as stderr). Thanks to the ast-open-history repo, I was able to identify and backport the bugfix from ksh93v- 2012-08-24. This backport may fix other bugs as well. On 93v- 2012-08-24 it fixed the regression below, though it was not triggered on 93u+(m). src/cmd/ksh93/tests/heredoc.sh 487 print foo > $tmp/foofile 488 x=$( $SHELL 2> /dev/null 'read <<< $(<'"$tmp"'/foofile) 2> /dev/null;print -r "$REPLY"') 489 [[ $x == foo ]] \|\| err_exit '<<< $(<file) not working' src/cmd/ksh93/sh/io.c: sh_open(): - If the just-opened file descriptor exists in sftable and is flagged with SF_STRING (as in non-forking command substitutions, among other situations), then move the file descriptor to a number >= 10. src/cmd/ksh93/tests/io.sh: - Add a regression test for BUG_CSUBSTDO, adapted from the one in modernish.	2021-04-08 13:24:17 +01:00
Johnothan King	b2a7ec032f	Add LC_TIME to the supported locale variables (#257 ) The current version of 93u+m does not have proper support for the LC_TIME variable. Setting LC_TIME has no effect on printf %T, and if the locale is invalid no error message is shown: $ LC_TIME=ja_JP.UTF-8 $ printf '%T\n' now Wed Apr 7 15:18:13 PDT 2021 $ LC_TIME=invalid.locale $ # No error message src/cmd/ksh93/data/variables.c, src/cmd/ksh93/include/variables.h, src/cmd/ksh93/sh/init.c: - Add support for the $LC_TIME variable. ksh93v- attempted to add support for LC_TIME, but the patch from that version was extended because the variable still didn't function correctly. src/cmd/ksh93/tests/variables.sh: - Add LC_TIME to the regression tests for LC_* variables.	2021-04-08 13:06:22 +01:00
Martijn Dekker	d0a5cab1ab	cleanup: remove another old and unused experiment This experiment, the initialisation of which was disabled with '#if 0', defines a bunch of integer type commands as special builtins. Most are boring as they define variables just like normal integers: pid_t, size_t, etc. One is interesting: mode_t is a type that automatically converts from a octal permission bits (e.g. 755) to a mode string like u+rwx,g+rw,o+rw. That's not a compelling enough use case to permanently define a special and immutable builtin though. stat_t is odd: it takes a file name as an argument and fills the variable with stat information, but it is base64 encoded binary data and there doesn't seem to be anything that can parse it. Anyway, none of this is going to be enabled, so we should get rid.	2021-04-08 05:28:20 +01:00
Martijn Dekker	997ad43bbf	Properly fix $LINENO crash on ARM (re: `23b7a163`) and other bugs The typecast fix was insufficient, avoiding the crash only when compiling with optimisation disabled. The real problem is that put_lineno() was passed a misaligned pointer, and that the value didn't actually contain a double but a string. The bug occurred when restoring the LINENO value upon exiting a virtual subshell. Thanks to Harald van Dijk for figuring out the fix. src/cmd/ksh93/sh/subshell.c: nv_restore(): - When restoring a special variable as defined by nv_cover(), do not pass either the np->nvflag bits or NV_NOFREE. Why? * The np->nvflag bits are not needed. They are also harmful because they may include the NV_INTEGER bit. This is set when the value is numeric. However, nv_getval() always returns the value in string form, converting it if it is numeric. So the NV_INTEGER flag should never be passed to nv_putval() when it uses the result of nv_getval(). * According to nval.3, the NV_NOFREE flag stops nv_putval() from creating a copy of the value. But this should be unnecessary because the earlier _nv_unset(mp,NV_RDONLY\|NV_CLONE) should ensure there is no previous value. In addition, the NV_NOFREE flag triggered another bug that caused the value of SECONDS to be corrupted upon restoring it when exiting a virtual subshell. - When restoring a regular variable, copy the entire nvalue union and not just the 'cp' member. In practice this worked because no current member of the nvalue union is larger than a pointer. However, there is no guarantee it will stay that way. src/cmd/ksh93/tests/leaks.sh: - Add disabled test for a memory leak that was discovered in the course of dealing with this bug. The fix doesn't introduce or influence it. It will have to be dealt with later. src/cmd/ksh93/tests/locale.sh: - Add test for restoring locale on leaving virtual subshell. https://github.com/ksh93/ksh/issues/253#issuecomment-815290154 src/cmd/ksh93/tests/variables.sh: - Test against corruption of SECONDS on leaving virtual subshell. https://github.com/ksh93/ksh/issues/253#issuecomment-815191052 Co-authored-by: Harald van Dijk <harald@gigawatt.nl> Progresses: https://github.com/ksh93/ksh/issues/253	2021-04-08 00:56:09 +01:00
Martijn Dekker	23b7a163f7	Fix implicit typecast mess in $LINENO discipline functions On Ubuntu arm7, two variables.sh regression tests crashed with a bus error (SIGBUS) in init.c on line 720 while testing $LINENO: 707 static void put_lineno(Namval_t* np,const char val,int flags,Namfun_t fp) 708 { 709 register long n; 710 Shell_t shp = sh_getinterp(); 711 if(!val) 712 { 713 fp = nv_stack(np, NIL(Namfun_t)); 714 if(fp && !fp->nofree) 715 free((void)fp); 716 _nv_unset(np,NV_RDONLY); 717 return; 718 } 719 if(flags&NV_INTEGER) 720 n = (double*)val; 721 else 722 n = sh_arith(shp,val); 723 shp->st.firstline += nget_lineno(np,fp)+1-n; 724 } Apparently, gcc on arm7 doesn't like the implicit typecast from double to long. Those three $LINENO discipline functions are generally a mess of implicit typecasts between Sfdouble_t, double, long and int. Line numbers are internally stored as int. The discipline functions need to use Sfdouble_t for API compatibility. src/cmd/ksh93/sh/init.c: nget_lineno(), put_lineno(), get_lineno(): - Get rid of unnecessary implicit typecasts by adjusting the types of local variables. - Make the typecasts that are done explicit. Progresses: https://github.com/ksh93/ksh/issues/253	2021-04-07 15:53:23 +01:00
Martijn Dekker	6b9703ffdd	Backport bugfixes for arrays of 'enum' types from ksh 93v- beta These fixes are applied rather blindly as no one has yet managed to understand the almost entirely uncommented arrays and variables handling code (arrays.c, name.c, nvdisc.c, nvtree.c, nvtype.c). Hopefully we'll figure all that out at some point. In the meantime these backported fixes appear to work fine, and these bugs impact the usability of 'enum', so I'm just going to have to violate my own policy and backport these fixes without understanding them. Thanks to @JohnoKing for putting in a lot of work tracing these. Further discussion at: https://github.com/ksh93/ksh/issues/87 src/cmd/ksh93/sh/array.c: - nv_arraysettype(): * Further simplify the function. After my initial simplification of it (re: `5491fe97`), I don't believe there's actually a need to save a duplicate copy of the value. Use the pointer returned by nv_getval() directly to restore the value. * Cope with a null value (nv_getval() returning a NULL pointer). This is needed for compatibility with the backported fix in nvtype.c (below). - array_putval(): If the array's value pointer (up->cp) is a pointer to the empty string, it is set to NULL before calling nv_putv() to prevent an empty string from being deleted. Backport a fix from 93v- that restores the pointer to the empty string if the NV_NOFREE attribute is set. Removing it somehow causes these regressions: enum.sh[86]: ${array[@]} doesn't yield all values for associative enum arrays (expected 'green blue blue red yellow green red orange'; got 'green blue blue yellow green orange') enum.sh[94]: unsetting associative enum array does not work (got 'Color_t -A Colors=([foo]=red [rood]=red)') enum.sh[116]: assigning first enum element to indexed array failed (expected 'red red'; got 'BUG BUG') - nv_associative(): Do not increase the 'nelem' (number of elements) value of the array's 'header' struct if the array is associative and of an enum type. The original 93v- fix only checked for the NV_INTEGER attribute, but backporting that caused several regressions. Using a debug output command I've determined that the exact value of 'type' is somehow consistently set to 0x26 if the array is associative and of an enum type, which is NV_INTEGER \| NV_LTOU \| NV_RJUST as defined in include/nval.h. I cannot find where/how that value is determined. In any case this fix, based on but more specific than the 93v- one, appears to work fine. Removing it somehow causes this regression: enum.sh[94]: unsetting associative enum array does not work (got 'Color_t -A Colors=()') src/cmd/ksh93/sh/nvtype.c: nv_settype(): - Another fix backported from 93v-. If the variable is an array, also set the type of element 0 of that array using a call to nv_arraysettype(). The value may be null. Removing this somehow causes this regression: enum.sh[94]: unsetting associative enum array does not work (got 'Color_t -A Colors=()') src/cmd/ksh93/tests/enum.sh: - Add tests for all the bugs fixed here, plus some hypothetical bugs (e.g., do the same tests for indexed enum type arrays as for associative enum type arrays, even though indexed enum type arrays didn't have all the same problems). Co-authored-by: Johnothan King <johnothanking@protonmail.com> Resolves: https://github.com/ksh93/ksh/issues/87	2021-04-06 06:33:32 +01:00
Martijn Dekker	db2b1affdf	Fix unsetting array element after expanding array subscript range Simple reproducer: set -A arr a b c d; : ${arr[1..2]}; unset arr[1]; echo ${arr[@]} Output: a Expected output: a c d The ${arr[1..2]} expansion broke the subsequent 'unset' command so that it unsets element 1 and on, instead of only 1. This regression was introduced in nv_endsubscript() on 2009-07-31: `c47896b4/src/cmd/ksh93/sh/array.c` That change checks for the ARRAY_SCAN attribute which enables processing ranges of array elements instead of single array elements, and restores it after. That restore is evidently not correct as it causes the subsequent unset command to malfunction. If we revert that change, the bug disappears and the regression tests show no failures. However, I don't know what this was meant to accomplish and what other bug we might introduce by reverting this. However, no corresponding regression test was added along with the 2009-07-31 change, nor is there any corresponding message in the changelog. So this looks to be one of those mystery changes that we'll never know the reason for. Since we currently have proof that this change causes breakage and no evidence that it fixes anything, I'll go ahead and revert it (and add a regression test, of course). If that causes another regression, hopefully someone will find it at some point. src/cmd/ksh93/sh/array.c: nv_endsubscript(): - Revert the 2009-07-31 change that saves/restores the ARRAY_SCAN attribute. - Keep the 'ap' pointer as it is now used by newer code. Move the declaration up to the beginning of the block, as is customary. src/cmd/ksh93/sh/init.c: - Cosmetic change: remove an unused array_scan() macro that I found when grepping the code for ARRAY_SCAN. The macro was introduced in version 2001-06-01 but the code that used it was replaced in version 2001-07-04, without removing the macro itself. Resolves: https://github.com/ksh93/ksh/issues/254	2021-04-05 22:16:57 +01:00
hyenias	264ba48bdd	Hardening of readonly variables (#239 ) Ksh currently restricts readonly scalar variables from having their values directly changed via a value assignment. However, since ksh allows variable attributes to be altered, the variable's value can be indirectly altered. For instance, if TMOUT=900 (for a 15 minute idle timeout) was set to readonly, all that is needed to alter the value of TMOUT from 900 to 0 is to issue 'typeset -R1 TMOUT', perhaps followed by a 'typeset -i TMOUT' to turn off the shell's timeout value. In addition, there are problems with arrays. The following is incorrectly allowed: typeset -a arr=((a b c) 1) readonly arr arr[0][1]=d arr=(alphas=(a b c);name=x) readonly arr.alphas arr.alphas[1]=([b]=5) arr=(alphas=(a b c);name=x) readonly arr.alphas arr.alphas[1]=(b) typeset -C arr=(typeset -r -a alphas=(a b c);name=x) arr.alphas[1]=() src/cmd/ksh93/bltins/typeset.c: setall(): - Relocate readonly attribute check higher up the code and widen its application to issue an error message if the pre-existing name-pair has the readonly bit flag set. - To avoid compatibility problems, don't check for readonly if NV_RDONLY is the only attribute set (ignoring NV_NOFREE). This allows 'readonly foo; readonly foo' to keep working. src/cmd/ksh93/sh/array.c: nv_endsubscript(): - Apply a readonly flag check when an array subscript or append assignment occurs, but allow type variables (typeset -T) as they utilize '-r' for 'required' sub-variables. src/cmd/ksh93/tests/readonly.sh: - New file. Create readonly tests that validate the warning message and validate that the readonly variable did not change. src/cmd/ksh93/sh/streval.c: - Bump MAXLEVEL from 9 to 1024 as a workaround for arithmetic expansion, avoiding a spurious error about too much recursion when the readonly.sh tests are run. This change is backported from ksh 93v-. TODO: debug a spurious increase in arithmetic recursion level variable when readonly.sh tests with 'typeset -i' are run. That is a different bug for a different commit. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2021-04-05 06:43:19 +01:00
Johnothan King	c4f980eb29	Introduce usage of __builtin_unreachable() and noreturn (#248 ) This commit adds an UNREACHABLE() macro that expands to either the __builtin_unreachable() compiler builtin (for release builds) or abort(3) (for development builds). This is used to mark code paths that are never to be reached. It also adds the 'noreturn' attribute to functions that never return: path_exec(), sh_done() and sh_syntax(). The UNREACHABLE() macro is not added after calling these. The purpose of these is: * to slightly improve GCC/Clang compiler optimizations; * to fix a few compiler warnings; * to add code clarity. Changes of note: src/cmd/ksh93/sh/io.c: outexcept(): - Avoid using __builtin_unreachable() here since errormsg can return despite using ERROR_system(1), as shp->jmplist->mode is temporarily set to 0. See: https://github.com/att/ast/issues/1336 src/cmd/ksh93/tests/io.sh: - Add a regression test for the ksh2020 bug referenced above. src/lib/libast/features/common: - Detect the existence of either the C11 stdnoreturn.h header or the GCC noreturn attribute, preferring the former when available. - Test for the existence of __builtin_unreachable(). Use it for release builds. On development builds, use abort() instead, which crahses reliably for debugging when unreachable code is reached. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2021-04-05 00:28:24 +01:00
Johnothan King	ca2443b58c	`cd -` shouldn't ignore `$OLDPWD` when in a new scope (#249 ) This bug was first reported at <https://github.com/att/ast/issues/8>. The 'cd' command currently takes the value of $OLDPWD from the wrong scope. In the following example 'cd -' will change the directory to /bin instead of /tmp: $ OLDPWD=/bin ksh93 -c 'OLDPWD=/tmp cd -' /bin src/cmd/ksh93/bltins/cd_pwd.c: - Use sh_scoped() to obtain the correct value of $OLDPWD. - Fix a use-after-free bug. Make the 'oldpwd' variable a static char that points to freeable memory. Each time cd is used, this variable is freed if it points to a freeable memory address and isn't also a pointer to shp->pwd. src/cmd/ksh93/sh/path.c: path_pwd(): - Simplify and add comments. - Scope $PWD properly. src/cmd/ksh93/tests/builtins.sh, src/cmd/ksh93/tests/leaks.sh: - Backport the ksh2020 regression tests for 'cd -' when $OLDPWD is set. - Add test for $OLDPWD and $PWD after subshare. - Add test for $PWD after 'cd'. - Add test for possible memory leak. - Add testing for 'unset' on OLDPWD and PWD. src/cmd/ksh93/COMPATIBILITY: - Add compatibility note about changes to $PWD and $OLDPWD. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2021-04-02 01:19:19 +01:00
Johnothan King	ed478ab7e3	Fix many GCC `-Wimplicit-fallthrough` warnings (#243 ) This commit adds '/* FALLTHROUGH */' comments to fix many GCC warnings when compiling with -Wimplicit-fallthrough. Additionally, the existing fallthrough comments have been changed for consistency.	2021-03-30 21:49:20 +01:00
Martijn Dekker	21d591dbd8	parse.c: rm overlooked SHOPT_BASH stuff (re: `921bbcae`) That bit of code supported bash's redundant 'function foo()' function declaration syntax (with both the 'function' keyword and the '()') which is a syntax error on ksh, as it should be.	2021-03-23 20:03:18 +00:00
Martijn Dekker	71934570bf	Add --globcasedetect shell option for globbing and completion One of the best-kept secrets of libast/ksh93 is that the code includes support for case-insensitive file name generation (a.k.a. pathname expansion, a.k.a. globbing) as well as case-insensitive file name completion on interactive shells, depending on whether the file system is case-insensitive or not. This is transparently determined for each directory, so a path pattern that spans multiple file systems can be part case-sensitive and part case- insensitive. In more precise terms, each slash-separated path name component pattern P is treated as ~(i:P) if its parent directory exists on a case-insensitive file system. I recently discovered this while dealing with <https://github.com/ksh93/ksh/issues/223>. However, that support is dead code on almost all current systems. It depends on pathconf(2) having a _PC_PATH_ATTRIBUTES selector. The 'c' attribute is supposedly returned if the given directory is on a case insensitive file system. There are other attributes as well (at least 'l', see src/lib/libcmd/rm.c). However, I have been unable to find any system, current or otherwise, that has _PC_PATH_ATTRIBUTES. Google and mailing list searches yield no relevant results at all. If anyone knows of such a system, please add a comment to this commit on GitHub, or email me. An exception is Cygwin/Windows, on which the "c" attribute was simply hardcoded, so globbing/completion is always case- insensitive. As of Windows 10, that is wrong, as it added the possibility to mount case-sensitive file systems. On the other hand, this was never activated on the Mac, even though macOS has always used a case-insensitive file like Windows. But, being UNIX, it can also mount case-sensitive file systems. Finally, Linux added the possibility to create individual case- insensitive ext4 directories fairly recently, in version 5.2. https://www.collabora.com/news-and-blog/blog/2020/08/27/using-the-linux-kernel-case-insensitive-feature-in-ext4/ So, since this functionality latently exists in the code base, and three popular OSs now have relevant file system support, we might as well make it usable on those systems. It's a nice idea, as it intuitively makes sense for globbing and completion behaviour to auto-adapt to file system case insensitivity on a per-directory basis. No other shell does this, so it's a nice selling point, too. However, the way it is coded, this is activated unconditionally on supported systems. That is not a good idea. It will surprise users. Since globbing is used with commands like 'rm', we do not want surprises. So this commit makes it conditional upon a new shell option called 'globcasedetect'. This option is only compiled into ksh on systems where we can actually detect FS case insensitivity. To implement this, libast needs some public API additions first. * libast changes * src/lib/libast/features/lib: - Add probes for the linux/fs.h and sys/ioctl.h headers. Linux needs these to use ioctl(2) in pathicase(3) (see below). src/lib/libast/path/pathicase.c, src/lib/libast/include/ast.h, src/lib/libast/man/path.3, src/lib/libast/Mamfile: - Add new pathicase(3) public API function. This uses whatever OS-specific method it can detect at compile time to determine if a particular path is on a case-insensitive file system. If no method is available, it only sets errno to ENOSYS and returns -1. Currently known to work on: macOS, Cygwin, Linux 5.2+, QNX 7.0+. - On systems (if any) that have the mysterious _PC_PATH_ATTRIBUTES selector for pathconf(2), call astconf(3) and check for the 'c' attribute to determine case insensitivity. This should preserve compatibility with any such system. src/lib/libast/port/astconf.c: - dynamic[]: As case-insensitive globbing is now optional on all systems, do not set the 'c' attribute by default on _WINIX (Cygwin/Windows) systems. - format(): On systems that do not have _PC_PATH_ATTRIBUTES, call pathicase(3) to determine the value for the "c" (case insensitive) attribute only. This is for compatibility as it is more efficient to call pathicase(3) directly. src/lib/libast/misc/glob.c, src/lib/libast/include/glob.h: - Add new GLOB_DCASE public API flag to glob(3). This is like GLOB_ICASE (case-insensitive matching) except it only makes the match case-insensitive if the file system for the current pathname component is determined to be case-insensitive. - gl_attr(): For efficiency, call pathicase(3) directly instead of via astconf(3). - glob_dir(): Only call gl_attr() to determine file system case insensitivity if the GLOB_DCASE flag was passed. This makes case insensitive globbing optional on all systems. - glob(): The options bitmask needs to be widened to fit the new GLOB_DCASE option. Define this centrally in a new GLOB_FLAGMASK macro so it is easy to change it along with GLOB_MAGIC (which uses the remaining bits for a sanity check bit pattern). src/lib/libast/path/pathexists.c: - For efficiency, call pathicase(3) directly instead of via astconf(3). * ksh changes * src/cmd/ksh93/features/options, src/cmd/ksh93/SHOPT.sh: - Add new SHOPT_GLOBCASEDET compile-time option. Set it to probe (empty) by default so that the shell option is compiled in on supported systems only, which is determined by new iffe feature test that checks if pathicase(3) returns an ENOSYS error. src/cmd/ksh93/data/options.c, src/cmd/ksh93/include/shell.h: - Add -o globcasedetect shell option if compiling with SHOPT_GLOBCASEDET. src/cmd/ksh93/sh/expand.c: path_expand(): - Pass the new GLOB_DCASE flag to glob(3) if the globcasedetect/SH_GLOBCASEDET shell option is set. src/cmd/ksh93/edit/completion.c: - While file listing/completion is based on globbing and automatically becomes case-insensitive when globbing does, it needs some additional handling to make a string comparison case-insensitive in corresponding cases. Otherwise, partial completions may be deleted from the command line upon pressing tab. This code was already in ksh 93u+ and just needs to be made conditional upon SHOPT_GLOBCASEDET and globcasedetect. - For efficiency, call pathicase(3) directly instead of via astconf(3). src/cmd/ksh93/sh.1: - Document the new globcasedetect shell option.	2021-03-22 18:45:19 +00:00
Johnothan King	814b5c6890	Fix various minor problems and update the documentation (#237 ) These are minor fixes I've accumulated over time. The following changes are somewhat notable: - Added a missing entry for 'typeset -s' to the man page. - Add strftime(3) to the 'see also' section. This and the date(1) addition are meant to add onto the documentation for 'printf %T'. - Removed the man page the entry for ksh reading $PWD/.profile on login. That feature was removed in commit `aa7713c2`. - Added date(1) to the 'see also' section of the man page. - Note that the 'hash' command can be used instead of 'alias -t' to workaround one of the caveats listed in the man page. - Use an 'out of memory' error message rather than 'out of space' when memory allocation fails. - Replaced backticks with quotes in some places for consistency. - Added missing documentation for the %P date format. - Added missing documentation for the printf %Q and %p formats (backported from ksh2020: https://github.com/att/ast/pull/1032). - The comments that show each builtin's options have been updated.	2021-03-21 14:39:03 +00:00
Martijn Dekker	7b0e0776e2	cleanup: remove legacy code for systems without fork(2) In 2021, it seems like it's about time to join the 21st century and officially require fork(2). In practice this was already the case as the legacy code was unmaintained and didn't compile.	2021-03-21 06:39:32 +00:00
Martijn Dekker	38f2b94f55	Some more #ifdef cleanups src/cmd/ksh93/edit/edit.c, src/cmd/ksh93/edit/history.c, src/cmd/ksh93/sh/deparse.c: - Remove experimental code protected by '#ifdef future'. No one is going to do anything with this, it's just clutter. src/lib/libast/sfio/sfcvt.c: - In 2021, it might be time to actually start using some C99 features were available. Change two checks for a _c99_in_the_wild macro to actual checks for C99, enabling the use of fpclassify(). Resolves: https://github.com/ksh93/ksh/issues/219	2021-03-21 06:39:32 +00:00
Martijn Dekker	0b814b53bd	Remove more legacy libast code (re: `f9c127e3`, `651bbd56`) This removes #ifdefs checking for the existence of SH_PLUGIN_VERSION (version check for dynamically loaded builtins) and the SFIO identifiers SF_BUFCONST, SF_CLOSING, SF_APPENDWR, SF_ATEXIT, all of which are defined by the bundled libast.	2021-03-21 06:39:32 +00:00
Martijn Dekker	936a1939a8	Allow proper tilde expansion overrides (#225 ) Until now, when performing any tilde expansion like ~/foo or ~user/foo, ksh added a placeholder built-in command called '.sh.tilde', ostensibly with the intention to allow users to override it with a shell function or custom builtin. The multishell ksh93 repo <https://github.com/multishell/ksh93/> shows this was added sometime between 2002-06-28 and 2004-02-29. However, it has never worked and crashed the shell. This commit replaces that with something that works. Specific tilde expansions can now be overridden using .set or .get discipline functions associated with the .sh.tilde variable (see manual, Discipline Functions). For example, you can use either of: .sh.tilde.set() { case ${.sh.value} in '~tmp') .sh.value=${XDG_RUNTIME_DIR:-${TMPDIR:-/tmp}} ;; '~doc') .sh.value=~/Documents ;; '~ksh') .sh.value=/usr/local/src/ksh93/ksh ;; esac } .sh.tilde.get() { case ${.sh.tilde} in '~tmp') .sh.value=${XDG_RUNTIME_DIR:-${TMPDIR:-/tmp}} ;; '~doc') .sh.value=~/Documents ;; '~ksh') .sh.value=/usr/local/src/ksh93/ksh ;; esac } src/cmd/ksh93/include/variables.h, src/cmd/ksh93/data/variables.c: - Add SH_TILDENOD for a new ${.sh.tilde} predefined variable. It is initially unset. src/cmd/ksh93/sh/macro.c: - sh_btilde(): Removed. - tilde_expand2(): Rewritten. I started out with the tiny version of this function from the 2002-06-28 version of ksh. It uses the stack instead of sfio, which is more efficient. A bugfix for $HOME == '/' was retrofitted so that ~/foo does not become //foo instead of /foo. The rest is entirely new code. To implement the override functionality, it now checks if ${.sh.tilde} has any discipline function associated with it. If it does, it assigns the tilde expression to ${.sh.tilde} using nv_putval(), triggering the .set discipline, and then reads it back using nv_getval(), triggering the .get discipline. The resulting value is used if it is nonempty and does not still start with a tilde. src/cmd/ksh93/bltins/typeset.c, src/cmd/ksh93/tests/builtins.sh: - Since ksh no longer adds a dummy '.sh.tilde' builtin, remove the ad-hoc hack that suppressed it from the output of 'builtin'. src/cmd/ksh93/tests/tilde.sh: - Add tests verifying everything I can think of, as well as tests for bugs found and fixed during this rewrite. src/cmd/ksh93/tests/pty.sh: - Add test verifying that the .sh.tilde.set() discipline does not modify the exit status value ($?) when performing tilde expansion as part of tab completion. src/cmd/ksh93/sh.1: - Instead of "tilde substitution", call the basic mechanism "tilde expansion", which is the term used everywhere else (including the 1995 Bolsky/Korn ksh book). - Document the new override feature. Resolves: https://github.com/ksh93/ksh/issues/217	2021-03-17 21:07:14 +00:00
Martijn Dekker	595a0a5684	Revert "Backport atomic job locking from ksh 93v- beta" (`52067c3d`) That patch broke the build on Cygwin, where gcc apparently doesn't have the required atomic addition/subtraction compiler builtins. The build fails at link time with those functions not found. As far as I know, ksh was actually working fine (after @JohnoKing's gcc workaround in `c258a04f`), so I'll just revert this for now. If a need for it is demonstrated later, we'll have to add a feature test or find some other way to get it working on Cygwin.	2021-03-17 14:35:15 +00:00
Martijn Dekker	44438725b1	sh_done(): fix portable exit status logic (re: `d024d4c8`) "savxit -= SH_EXITSIG + 128;" may have worked accidentally due to subsequent bitmasking, but is blatantly wrong . It subtracts 256 + 128 = 384 from the exit status. Use bitwise logic instead, with an octal literal 0200 instead of 128. This makes more sense in this context.	2021-03-17 09:33:23 +00:00
Johnothan King	14352ba0a7	Save $? when discipline triggered without command (#226 ) A discipline function could incorrectly influence the value of $? (exit status of last command) outside its context if it was triggered without another command being run, e.g. when a prompt variable is read, or COLUMNS or LINES is set. Reproducers include: PS1 prompt: $ PS1.get() { true; } $ false $ echo $? 0 PS2 prompt: $ PS2.get() { return 13; } $ \ > $ echo $? 13 The set discipline is affected too, e.g. COLUMNS and LINES: $ COLUMNS.set() { return 13; } $ true $ (press return) $ echo $? 13 There are probably other contexts where the shell reads or changes variables without running commands, allowing their get or set disciplines to influence $?. So this commit makes ksh save $? for all .get, .set, .append, and .unset discipline calls. src/cmd/ksh93/sh/nvdisc.c: - assign(): Save/restore $? when running a .set/.append/.unset discipline function. - lookup(): Save/restore $? when running a .get discipline. src/cmd/ksh93/tests/pty.sh: - Add a regression test for $? after displaying a prompt and when setting a LINES.set discipline function. src/cmd/ksh93/tests/return.sh: - The above test fails in script form on ksh93u+ and ksh2020, as it exposes another form of #117 that occurs after running a subshell. Add the above regression test here as well (re: `092b90da`). Co-authored-by: Martijn Dekker <martijn@inlv.org>	2021-03-16 16:13:13 +00:00
Martijn Dekker	1df6a82a8a	Make ~ expand to home directory after unsetting HOME There was an issue with tilde expansion if the HOME var is unset. $ unset HOME $ echo ~ martijn Only the username is returned. Users are more likely to expect the current user's home directory as configured in the OS. POSIXly, the expansion of ~ is based on the value of HOME. If HOME is unset, the results are unspecified. After unsetting HOME, in bash, ~ returns the user's home directory as specified by the OS, whereas in all other shells, ~ expands to the empty string. Only ksh93 returns the username. The behaviour of bash is more useful. Discussion: https://github.com/ksh93/ksh/pull/225#issuecomment-799074107 src/cmd/ksh93/sh/macro.c, src/cmd/ksh93/tests/tilde.sh: - sh_tilde(): Backport fix by Mike Gilbert from ksh2020. See: https://github.com/att/ast/issues/1391 https://github.com/att/ast/pull/1396 `070d365d` - Add test. src/cmd/ksh93/COMPATIBILITY: - Note this change.	2021-03-15 21:49:02 +00:00
Johnothan King	6d63b57dd3	Re-enable SHOPT_DEVFD, fixing process substitution fd leaks (#218 ) This commit fixes a long-standing bug (present since at least ksh93r) that caused a file descriptor leak when passing a process substitution to a function, or (if compiled with SHOPT_SPAWN) to a nonexistent command. The leaks only occurred when ksh was compiled with SHOPT_DEVFD; the FIFO method was unaffected. src/cmd/ksh93/sh/xec.c: sh_exec(): - When a process substitution is passed to a built-in, the remaining file descriptor is closed with sh_iorestore. Do the same thing when passing a process substitution to a function. This is done by delaying the sh_iorestore() call to 'setexit:' where both built-ins and functions terminate and set the exit status ($?). This means that call now will not be executed if a longjmp is done, e.g. due to an error in a special built-in. However, there is already another sh_iorestore() call in main.c, exfile(), line 418, that handles that scenario. - sh_ntfork() can fail, so rather than assume it will succeed, handle a failure by closing extra file descriptors with sh_iorestore(). This fixes the leak on command not found with SHOPT_SPAWN. src/cmd/ksh93/include/defs.h: - Since the file descriptor leaks are now fixed, remove the workaround that forced ksh to use the FIFO method. src/cmd/ksh93/SHOPT.sh: - Add SHOPT_DEVFD as a configurable option (default: probe). src/cmd/ksh93/tests/io.sh: - Add a regression test for the 'not found' file descriptor leak. - Add a test to ensure it keeps working with 'command'. Fixes: https://github.com/ksh93/ksh/issues/67	2021-03-13 13:46:42 +00:00
Johnothan King	c3eac977ea	Fix unused process substitutions hanging (#214 ) On systems where ksh needs to use the older and less secure FIFO method for process substitutions (which is currently all of them as the more modern and solid /dev/fd method is still broken, see #67), process substitutions could leave background processes hanging in these two scenarios: 1. If the parent process exits without opening a pipe to the child process forked by the process substitution. The fifo_check() function in xec.c, which is periodically called to check if the parent process still exists while waiting for it to open the FIFO, verified the parent process's existence by checking if the PPID had reverted to 1, the traditional PID of init. However, POSIX specifies that the PPID can revert to any implementation- defined system process in that case. So this breaks on certain systems, causing unused process substitutions to hang around forever as they never detect that the parent disappeared. The fix is to save the current PID before forking and having the child check if the PPID has changed from that saved PID. 2. If command invoked from the main shell is passed a process substitution, but terminates without opening the pipe to the process substitution. In that case, the parent process never disappears in the first place, because the parent process is the main shell. So the same infinite wait occurs in unused process substitutions, even after correcting problem 1. The fix is to remember all FIFOs created for any number of process substitutions passed to a single command, and unlink any remaining FIFOs as they represent unused command substitutions. Unlinking them FIFOs causes sh_open() in the child to fail with ENOENT on the next periodic check, which can easily be handled. Fixing these problems causes the FIFO method to act identically to the /dev/fd method, which is good for compatibility. Even when #67 is fixed this will still be important, as ksh also runs on systems that do not have /dev/fd (such as AIX, HP-UX, and QNX), so will fall back to using FIFOs. --- Fix problem 1 --- src/cmd/ksh93/sh/xec.c: - Add new static fifo_save_ppid variable. - sh_exec(): If a FIFO is defined, save the current PID in fifo_save_ppid for the forked child to use. - fifo_check(): Compare PPID against the saved value instead of 1. --- Fix problem 2 --- To keep things simple I'm abusing the name-value pair routines used for variables for this purpose. The overhead is negligible. A more elegant solution is possible but would involve adding more code. src/cmd/ksh93/include/defs.h: _SH_PRIVATE: - Define new sh.fifo_tree pointer to a new FIFO cleanup tree. src/cmd/ksh93/sh/args.c: sh_argprocsubs(): - After launching a process substitution in the background, add the FIFO to the cleanup list before freeing it. src/cmd/ksh93/sh/xec.c: - Add fifo_cleanup() that unlinks all FIFOs in the cleanup list and clears/closes the list. They should only still exist if the command never used them, however, just run 'unlink' and don't check for existence first as that would only add overhead. - sh_exec(): * Call fifo_cleanup() on finishing all simple commands (when setting $?) or when a special builtin fails. * When forking, clear/close the cleanup list; we do not want children doing duplicate cleanup, particularly as this can interfere when using multiple process substitutions in one command. * Process substitution handling: > Change FIFO check frequency from 500ms to 50ms. Note that each check sends a signal that interrupts open(2), causing sh_open() to reinvoke it. This causes sh_open() to fail with ENOENT on the next check when the FIFO no longer exists, so we do not need to add an additional check for existence to fifo_check(). Unused process substitutions now linger for a maximum of 50ms. > Do not issue an error message if errno == ENOENT. - sh_funct(): Process substitutions can be passed to functions as well, and we do not want commands within the function to clean up the FIFOs for the process substitutions passed to it from the outside. The problem is solved by simply saving fifo_tree in a local variable, setting it to null before running the function, and cleaning it up before restoring the parent one at the end. Since sh_funct() is called recursively for multiple-level function calls, this correctly gives each function a locally scoped fifo_tree. --- Tests --- src/cmd/ksh93/tests/io.sh: - Add tests covering the failing scenarios. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2021-03-12 11:43:23 +00:00
Martijn Dekker	d4adc8fcf9	Fix test -v for numeric types & set/unset state for short int This commit fixes two interrelated problems. 1. The -v unary test/[/[[ operator is documented to test if a variable is set. However, it always returns true for variable names with a numeric attribute, even if the variable has not been given a value. Reproducer: $ ksh -o nounset -c 'typeset -i n; [[ -v n ]] && echo $n' ksh: n: parameter not set That is clearly wrong; 'echo $n' should never be reached and the error should not occur, and does not occur on mksh or bash. 2. Fixing the previous problem revealed serious breakage in short integer type variables that was being masked. After applying that fix and then executing 'typeset -si var=0': - The conditional assignment expansions ${var=123} and ${var:=123} assigned 123 to var, even though it was set to 0. - The expansions ${var+s} and ${var:+n} incorrectly acted as if the variable was unset and empty, respectively. - '[[ -v var ]]' and 'test -v var' incorrectly returned false. The problems were caused by a different storage method for short ints. Their values were stored directly in the 'union Value' member of the Namval_t struct, instead of allocated on the stack and referred to by a pointer, as regular integers and all other types do. This inherently broke nv_isnull() as this leaves no way to distinguish between a zero value and no value at all. (I'm also pretty sure it's undefined behaviour in C to check for a null pointer at the address where a short int is stored.) The fix is to store short ints like other variables and refer to them by pointers. The NV_INT16P combined bit mask already existed for this, but nv_putval() did not yet support it. src/cmd/ksh93/bltins/test.c: test_unop(): - Fix problem 1. For -v, only check nv_isnull() and do not check for the NV_INTEGER attribute (which, by the way, is also used for float variables by combining it with other bits). See also `5aba0c72` where we recently fixed nv_isnull() to work properly for all variable types including short ints. src/cmd/ksh93/sh/name.c: nv_putval(): - Fix problem 2, part 1. Add support for NV_INT16P. The code is simply copied and adapted from the code for regular integers, a few lines further on. The regular NV_SHORT code is kept as this is still used for some special variables like ${.sh.level}. src/cmd/ksh93/bltins/typeset.c: b_typeset(): - Fix problem 2, part 2. Use NV_INT16P instead of NV_SHORT. src/cmd/ksh93/tests/attributes.sh: - Add set/unset/empty/nonempty tests for all numeric types. src/cmd/ksh93/tests/bracket.sh, src/cmd/ksh93/tests/comvar.sh: - Update a couple of existing tests. - Add test for [[ -v var ]] and [[ -n ${var+s} ]] on unset and empty variables with many attributes. src/cmd/ksh93/COMPATIBILITY: - Add a note detailing the change to test -v. src/cmd/ksh93/data/builtins.c, src/cmd/ksh93/sh.1: - Correct 'typeset -C' documentation. Variables declared as compound are not initially unset, but initially have the empty compound value. 'typeset' outputs them as: typeset -C foo=() and not: typeset -C foo and nv_isnull() is never true for them. This may or may not technically be a bug. I don't think it's worth changing, but it should at least be documented correctly.	2021-03-10 00:38:41 +00:00
Martijn Dekker	4a8072e826	Fix ${!foo@} and ${!foo*} to include 'foo' itself in search These expansions are supposed to yield all variable names beginning with the indicated prefix. This should include the variable name that is identical to the prefix (as 'prefix' begins with 'prefix'). This bugfix is backported from the abandoned ksh 93v- beta, so AT&T intended this change. It also makes ksh work like bash in this. src/cmd/ksh93/sh/macro.c: varsub(): M_NAMESCAN: - Check if the prefix itself exists. If so, start with that. src/cmd/ksh93/tests/variables.sh: - Add tests for these expansions. src/cmd/ksh93/sh.1: - Fix the incomplete documentation of these expansions. src/cmd/ksh93/COMPATIBILITY: - Note the change as it's potentially incompatible in corner cases. Resolves: https://github.com/ksh93/ksh/issues/183	2021-03-09 05:00:04 +00:00
Martijn Dekker	e58637752a	sh_debug(): restore NV_NOFREE attributes (re: `c928046a`) Removing the nv_putval() calls also stopped making sure the NV_NOFREE attribute was set for those variables, causing an invalid free later on. This caused the funcname.ksh script: https://gist.github.com/ormaaj/12874b68acd06ee98b59 to crash even more readily than it did before. Even after this commit there are various crashing bugs left for that script, all intermittent and with different backtraces and dependent on the operating system and malloc variant used. Investigation ongoing at: https://github.com/ksh93/ksh/issues/212	2021-03-08 21:21:37 +00:00
hyenias	5aba0c7251	Fix set/unset state for short integer (typeset -si) (#211 ) This commit fixes at least three bugs: 1. When issuing 'typeset -p' for unset variables typeset as short integer, a value of 0 was incorrectly diplayed. 2. ${x=y} and ${x:=y} were still broken for short integer types (re: `9f2389ed`). ${x+set} and ${x:+nonempty} were also broken. 3. A memory fault could occur if typeset -l followed a -s option with integers. Additonally, now the last -s/-l wins out as the option to utilize instead of it always being short. src/cmd/ksh93/include/name.h: - Fix the nv_isnull() macro by removing the direct exclusion of short integers from this set/unset test. This breaks few things (only ${.sh.subshell} and ${.sh.level}, as far as we can tell) while potentially correcting many aspects of short integer use (at least bugs 1 and 2 above), as this macro is widely used. - union Value: add new pid_t pidp pointer member for PID values (see further below). src/cmd/ksh93/bltins/typeset.c: b_typeset(): - To fix bug 3 above, unset the 'shortint' flag and NV_SHORT attribute bit upon encountering the -l optiobn. To fix ${.sh.subshell} to work with the new nv_isnull(): src/cmd/ksh93/sh/defs.h: - Add new 'realsubshell' member to the shgd (aka shp->gd) struct which will be the integer value for ${.sh.subshell}. src/cmd/ksh93/sh/init.c, src/cmd/ksh93/data/variables.c: - Initialize SH_SUBSHELLNOD as a pointer to shgd->realsubshell instead of using a short value (.s) directly. Using a pointer allows nv_isnull() to return a positive for ${.sh.subshell} as a non-null pointer is what it checks for. - While we're at it, initialize PPIDNOD ($PPID) and SH_PIDNOD (${.sh.pid}) using the new pdip union member, which is more correct as they are values of type pid_t. src/cmd/ksh93/sh/subshell.c, src/cmd/ksh93/sh/xec.c: - Update the ${.sh.subshell} increases/decreases to refer to shgd->realsubshell (a.k.a. shp->gd->realsubshell). * To fix ${.sh.level} after changing nv_isnull(): src/cmd/ksh93/sh/macro.c: varsub(): - Add a specific exception for SH_LEVLNOD to the nv_isnull() test, so that ${.sh.level} is always considered to be set. Its handling throughout the code is too complex/special for a simple fix, so we have to special-case it, at least for now. *** Regression test additions: src/cmd/ksh93/tests/attributes.sh: - Add in missing short integer tests and correct the one that existed. The -si test now yields 'typeset -x -r -s -i foo' instead of 'typeset -x -r -s -i foo=0' which brings it in line with all the others. - Add in some other -l attribute tests for floats. Note, -lX test was not added as the size of long double is platform dependent. src/cmd/ksh93/tests/variables.sh: - Add tests for ${x=y} and ${x:=y} used on short int variables. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2021-03-08 04:19:36 +00:00
Martijn Dekker	40860dac20	job_init(): fix init on setpgid() permission denied (re: `41ebb55a`) Symptoms of this bug below. These only seem to occur on Linux and only if you replace your initial login shell by ksh using 'exec'. 1. An erroneous 'Interrupt' message is printed after stopping the read builtin in a script. Reproducer: $ exec arch//bin/ksh $ cat ./reproducer.sh #!/bin/sh read foo $ ./reproducer.sh ^C$ <Enter> [1] + Interrupt ../reproducer.sh 2. Ctrl+C fails to stop /bin/package make. Reproducer: $ exec arch//bin/ksh $ mv arch arch.old $ bin/package make # Press Ctrl+C multiple times Analysis: In `41ebb55a`, I made an error in changing job_init() to work correctly on non-interactive shells. This line from before: 552\| if(possible = (setpgid(0,job.mypgid)>=0) \|\| errno==EPERM) was changed to: 555\| possible = (setpgid(0,job.mypgid) >= 0); 556\| if(sh_isoption(SH_INTERACTIVE) && (possible \|\| errno==EPERM)) That is wrong. Before, 'possible' was set to 1 (true) if setpgid() either succeeded or failed with EPERM. After, it is only set to 1 if setpgid() succeeds. As a result, job control initialisation is aborted later on upon a test for non-zero 'possible'. src/cmd/ksh93/sh/jobs.c: job_init(): - Once again set possible to 1 even if setpgid() fails with EPERM. Thanks to @JohnoKing for the bug report and reproducers. Resolves: https://github.com/ksh93/ksh/issues/210	2021-03-07 17:01:17 +00:00
Martijn Dekker	89c69b076d	Fix command history corruption on syntax error (re: `e999f6b1`) Analysis: When a syntax error occurs, the shell performs a longjmp(3) back to exfile() in main.c on line 417: 415\| if(jmpval) 416\| { 417\| Sfio_t top; 418\| sh_iorestore((void)shp,0,jmpval); 419\| hist_flush(shp->gd->hist_ptr); 420\| sfsync(shp->outpool); The first thing it does is restore the file descriptor state (sh_iorestore), then it flushes the history file (hist_flush), then it synchronises sfio's logical stream state with the physical stream state using (sfsync). However, the fix applied in `e999f6b1` caused sh_iorestore() to sync all sfio streams unconditionally. So this was done before hist_flush(), which caused unpredictable behaviour, including temporary and/or permanent history corruption, as this also synched shp->outpool before hist_flush() had a chance to do its thing. The fix is to only call sfsync() in sh_iorestore() if we're actually about to call ftruncate(2), and not otherwise. Moral of the story: bug fixes should be as specific as possible to minimise the risk of side effects. src/cmd/ksh93/sh/io.c: sh_iorestore(): - Only call sfsync() if we're about to truncate a file. src/cmd/ksh93/tests/pty.sh: - Add test. Thanks to Marc Wilson for reporting the bug and to Johnothan King for finding the commit that introduced it. Resolves: https://github.com/ksh93/ksh/issues/209 Relevant: https://github.com/att/ast/issues/61	2021-03-07 00:27:33 +00:00
Martijn Dekker	9f2389ed93	Fix ${x=y} and ${x:=y} for numeric types of x These POSIX expansions first assign y to x if x is unset or empty, respectively, and then they yield the value of x. This was not working on any ksh93 version if x was typeset as numeric (integer or float) but still unset, as in not assigned a value. $ unset a; typeset -i a; printf '%q\n' "${a:=42}" "$a" 0 '' Expected output: 42 42 src/cmd/ksh93/sh/macro.c: - Fix the test for set/unset variable. It was broken because it only checked for the existence of the node, which exists after 'typeset', but did not check if a value had been assigned. This additional check needs to be done with the nv_isnull() macro, but only for expansions of the regular M_BRACE type. Special expansions cannot have an unset state. - As of commit `95294419`, we know that an nv_optimize() call may be needed before using nv_isnull() if the shell is compiled with SHOPT_OPTIMIZE. Move the nv_optimize() call from that commit forward to before the new check that calls nv_isnull(), and only bother with it if the type is M_BRACE. src/cmd/ksh93/tests/variables.sh: - Add tests for this bug. Test float and integer, and also check that ${a=b} and ${a:=b} correctly treat the value of 'b' as an arithmetic expression of which the result is assigned to 'a' if 'a' was typeset as numeric. src/cmd/ksh93/tests/attributes.sh, src/cmd/ksh93/tests/comvar.sh, src/cmd/ksh93/tests/nameref.sh, src/cmd/ksh93/tests/types.sh: - Fix a number of tests to report failures correctly. Resolves: https://github.com/ksh93/ksh/issues/157	2021-03-06 03:56:52 +00:00
Martijn Dekker	f8f2c4b608	Remove obsolete quote balancing hack The old Bourne shell failed to check for closing quotes and command substitution backticks when encountering end-of-file in a parser context (such as a script). ksh93 implemented a hack for partial compatibility with this bug, tolerating unbalanced quotes and backticks in backtick command subsitutions, 'eval', and command line invocation '-c' scripts only. This hack became broken for backtick command substitutions in fe20311f/350b52ea as a memory leak was fixed by adding a newline to the stack at the end of the command substitution. That extra newline becomes part of any string whose quotes are not properly terminated, causing problems such as the one detailed here: https://www.mail-archive.com/ast-developers@lists.research.att.com/msg01889.html $ touch abc $ echo `ls "abc` ls: abc : not found No other fix for the memory leak is known that doesn't cause other problems. (The alternative fix detailed in the referenced mailing list post causes a different corner-case regression.) Besides, the hack has always caused other corner case bugs as well: $ ksh -c '((i++' Actual: ksh: i++(: not found (If an external command 'i++(' existed, it would be run) Expect: ksh: syntax error at line 1: `(' unmatched $ ksh -c 'i=0; echo $((++i' Actual: (empty line; the arithmetic expansion is ignored) Expect: ksh: syntax error at line 1: `(' unmatched $ ksh -c 'echo $(echo "hi)' Actual: ksh: syntax error at line 1: `(' unmatched Expect: ksh: syntax error at line 1: `"' unmatched So, it's time to get rid of this hack. The old Bourne shell is dead and buried. No other shell tries to support this breakage. Tolerating syntax errors is just asking for strange side effects, inconsistent states, and corner case bugs. We should not want to do that. Old scripts that rely on this will just need to be fixed. src/cmd/ksh93/sh/lex.c: - struct lexdata: Remove 'char balance' member for remembering an unbalanced quote or backtick. - sh_lex(): Remove the back to remember and compensate for unbalanced quotes/backticks that was executed only if we were executing a script from a string, as opposed to a file. src/cmd/ksh93/COMPATIBILITY: - Note the change. Resolves: https://github.com/ksh93/ksh/issues/199	2021-03-05 22:17:14 +00:00
Martijn Dekker	b48e5b3365	Fix arbitrary command execution vuln in array subscripts in arith This commit fixes an arbitrary command execution vulnerability in array subscripts used within the arithmetic subsystem. One of the possible reproducers is: var='1$(echo INJECTION >&2)' ksh -c \ 'typeset -A a; ((a[$var]++)); typeset -p a' Output before this commit: INJECTION typeset -A a=([1]=1) The 'echo' command has been surreptitiously executed from an external environment variable. Output after this commit: typeset -A a=(['1$(echo INJECTION >&2)']=1) The value is correctly used as an array subscript and nothing in it is parsed or executed. This is as it should be, as ksh93 supports arbitrary subscripts for associative arrays. If we think about it logically, the C-style arithmetic subsystem simply has no business messing around with shell expansions or quoting at all, because those don't belong to it. Shell expansions and quotes are properly resolved by the main shell language before the arithmetic subsystem is even invoked. It is particularly important to maintain that separation because the shell expansion mechanism also executes command substitutions. Yet, the arithmetic subsystem subjected array subscripts that contain `$` (and only array subscripts -- how oddly specific) to an additional level of expansion and quote resolution. For some unfathomable reason, there are two lines of code doing specifically this. The vulnerability is fixed by simply removing those. Incredibly, variants of this vulnerability are shared by bash, mksh and zsh. Instead of fixing it, it got listed in Bash Pitfalls! http://mywiki.wooledge.org/BashPitfalls#y.3D.24.28.28_array.5B.24x.5D_.29.29 src/cmd/ksh93/sh/arith.c: - scope(): Remove these two lines that implement the vulnerability. if(strchr(sub,'$')) sub = sh_mactrim(shp,sub,0); - scope(), arith(): Remove the NV_SUBQUOTE flag from two nv_endsubscript() calls. That flag causes the array subscript to retain the current level of shell quoting. The shell quotes everything as in "double quotes" before invoking the arithmetic subsystem, and the bad sh_mactrim() call removed one level of quoting. Since we're no longer doing that, this flag should no longer be passed, or subscripts may get extra backslash escapes. src/cmd/ksh93/include/name.h, src/cmd/ksh93/sh/array.c: - nv_endsubscript(): The NV_SUBQUOTE flag was only passed from arith.c. Since it is now unused, remove it. src/cmd/ksh93/tests/arith.sh: - Tweak some tests: fix typos, report wrong values. - Add 21 tests. Most are based on reproducers contributed by @stephane-chazelas and @hyenias. They verify that this vulnerability is gone and that no quoting bugs were introduced. Resolves: https://github.com/ksh93/ksh/issues/152	2021-03-04 13:37:13 +00:00
Martijn Dekker	6146848693	Fix compiling with SHOPT_REGRESS and SHOPT_P_SUID src/cmd/ksh93/Mamfile: - regress.c: add missing SH_DICT define for getopt self-doc string, needed after USAGE_LICENSE macros were removed. (re: `ede47996`) src/cmd/ksh93/init.c: sh_init(): - Do not set error_info.exit early in init. This is the function that is called when an error exits the shell. It defaults to exit(3). Setting it to sh_exit() early on can cause a crash if an error is thrown before shell initialisation is fully finished. So set it at the end of sh_init() instead. - __regress__: Remove error_info.exit workaround. (re: `506bd2b2`) - Fix SHOPT_P_SUID directive. This is not actually a 0/1 value, so we should use #ifdef and not #if. If SHOPT_REGRESS is on, it it set to a function call. (re: `2182ecfa`) src/cmd/ksh93/SHOPT.sh: - Document that SHOPT_P_SUID cannot be set to 0 to be turned off.	2021-02-28 23:24:58 +00:00
Johnothan King	7ad274f8b6	Add more out of memory checks (re: `18529b88`) (#192 ) The referenced commit neglected to add checks for strdup() calls. That calls malloc() as well, and is used a lot. This commit switches to another strategy: it adds wrapper functions for all the allocation macros that check if the allocation succeeded, so those checks don't need to be done manually. src/cmd/ksh93/include/defs.h, src/cmd/ksh93/sh/init.c: - Add sh_malloc(), sh_realloc(), sh_calloc(), sh_strdup(), sh_memdup() wrapper functions with success checks. Call nospace() to error out if allocation fails. - Update new_of() macro to use sh_malloc(). - Define new sh_newof() macro to replace newof(); it uses sh_realloc(). All other changed files: - Replace the relevant calls with the wrappers. - Remove now-redundant success checks from `18529b88`. - The ERROR_PANIC error message calls are updated to inclusive-or ERROR_SYSTEM into the exit code argument, so libast's error() appends the human-readable version of errno in square brackets. See src/lib/libast/man/error.3 src/cmd/ksh93/edit/history.c: - Include "defs.h" to get access to the wrappers even if KSHELL is not defined. - Since we're here, fix a compile error that occurred with KSHELL undefined by updating the type definition of hist_fname[] to match that of history.h. src/cmd/ksh93/bltins/enum.c: - To get access to sh_newof(), include "defs.h" instead of <shell.h> (note that "defs.h" includes <shell.h> itself). src/cmd/ksh93/Mamfile: - enum.c: depend on defs.h instead of shell.h. - enum.o: add an -I. flag in the compiler invocation so that defs.h can find its subsequent includes. src/cmd/builtin/pty.c: - Define one outofmemory() function and call that instead of repeating the error message call. - outofmemory() never returns, so remove superfluous exit handling. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2021-02-27 21:21:58 +00:00
Martijn Dekker	c928046aa9	Fix ${.sh.fun} leaking out of DEBUG trap The value of the ${.sh.fun} variable, which is supposed to contain the name of the function currently being executed, leaks out of the DEBUG trap if it executes a function. Reproducer: $ fn() { echo "executing the function"; } $ trap fn DEBUG $ trap - DEBUG executing the function $ echo ${.sh.fun} fn ${.sh.fun} should be empty outside the function. Annalysis: The sh_debug() function in xec.c, which executes the DEBUG trap action, contains these lines, which are part of restoring the state after running the trap action with sh_trap(): nv_putval(SH_PATHNAMENOD,shp->st.filename,NV_NOFREE); nv_putval(SH_FUNNAMENOD,shp->st.funname,NV_NOFREE); shp->st = savst; First the SH_PATHNAMENOD (${.sh.file}) and SH_FUNNAMENOD (${.sh.fun}) variables get restored from the values in the shell's scoped information struct (shp->st), but that is done before restoring the parent scope with 'shp->st = savst;'. It should be done after. Fixing the order is sufficient to fix the bug. However, I am not convinced that these nv_putval() calls are good for anything at all. Setting, unsetting, restoring, etc. the ${.sh.fun} and ${.sh.file} variables is already being handled perfectly well elsewhere in the code for executing functions and sourcing dot scripts. The DEBUG trap is neither here nor there. There's no reason for it to get involved with these variables. I was unable to break anything after simply removing those two lines. So I strongly suspect this is another case, out of many now, where a bug in ksh93 is properly fixed by removing some code. I couldn't get ${.sh.file} to leak similarly -- I think this is because SH_PATHNAMENOD (and not SH_FUNNOD) is set explicitly in exfile() in main.c, masking this incorrect restore. It is the only place where SH_PATHNAMENOD and SH_FUNNOD are not both set. src/cmd/ksh93/sh/xec.c: - Remove these two spurious nv_putval() calls. src/cmd/ksh93/tests/variables.sh: - Add regression test for leaking ${.sh.fun}.	2021-02-27 01:25:59 +00:00
Martijn Dekker	caf7ab6c71	Make PATH properly survive a shared-state ${ comsub; } Reproducer: $ ksh -c 'v=${ PATH=/dev/null; }; echo $PATH; whence ls' /dev/null /bin/ls The PATH=/dev/null assignment should survive the shared-state command substitution, and does, yet 'ls' is still found. The variable became inconsistent with the internal pathlist. This bugfix is from the 93v- beta. src/cmd/ksh93/sh/subshell.c: sh_subshell(): - Do not save and restore pathlist for a subshare. - A few other subshell tweaks from 93v- that made sense: . reset shp->subdup (bitmask for dups of 1) after saving it . use e_dot instead of "." for consistency . retry close(1) if it was interrupted src/cmd/ksh93/tests/path.sh: - Add test for this bug.	2021-02-23 22:16:06 +00:00
Johnothan King	733f70e94b	Fix many compiler warnings and remove unused variables (#191 ) Most of these changes remove unused variables, functions and labels to fix -Wunused compiler warnings. Somewhat notable changes: src/cmd/ksh93/bltins/print.c: - Removed the unused 'neg' variable. Patch from ksh2020: https://github.com/att/ast/pull/725 src/cmd/ksh93/bltins/sleep.c: - Initialized ns to fix three -Wsometimes-uninitialized warnings. src/cmd/ksh93/edit/{emacs,vi}.c: - Adjust strncpy size to fix two -Wstringop-truncation warnings. src/cmd/ksh93/include/shell.h: - The NOT_USED macro caused many -Wunused-value warnings, so it has been replaced with ksh2020's macro: `19d0620a` src/cmd/ksh93/sh/expand.c: - Removed an unnecessary 'ap = ' since 'ap' is never read between stakseek and stakfreeze. src/cmd/ksh93/edit/vi.c: refresh(): - Undef this function's 'w' macro at the end of it to stop it potentially interfering with future code changes. src/cmd/ksh93/sh/nvdisc.c, src/lib/libast/misc/magic.c, src/lib/libast/regex/regsubexec.c, src/lib/libast/sfio/sfpool.c, src/lib/libast/vmalloc/vmbest.c: - Fixed some indentation to silence -Wmisleading-indentation warnings. src/lib/libast/include/ast.h: - For clang, now only suppress hundreds of -Wparentheses warnings as well as a few -Wstring-plus-int warnings. Clang's -Wparentheses warns about things like if(foo = bar()) which assigns to foo and checks the assigned value. Clang wants us to change this into if((foo = bar())) Clang's -Wstring-plus-int warns about things like "string"+x where x is an integer, e.g. "string"+3 represents the string "ing". Clang wants us to change that to "string"[3] The original versions represent a perfectly valid coding style that was common in the 1980s and 1990s and is not going to change in this historic code base. (gcc does not complain about these.) Co-authored-by: Martijn Dekker <martijn@inlv.org>	2021-02-22 22:16:32 +00:00
Martijn Dekker	18529b88c6	Add lots of checks for out of memory (re: `0ce0b671`) Huge typeset -L/-R adjustment length values were still causing crashses on sytems with not enough memory. They should error out gracefully instead of crashing. This commit adds out of memory checks to all malloc/calloc/realloc calls that didn't have them (which is all but two or three). The stkalloc/stakalloc calls don't need the checks; it has automatic checking, which is done by passing a pointer to the outofspace() function to the stakinstall() call in init.c. src/lib/libast/include/error.h: - Change the ERROR_PANIC exit status value from ERROR_LEVEL (255) to 77, which is what it is supposed to be according to the libast error.3 manual page. Exit statuses > 128 for anything else than signals are not POSIX compliant and may cause misbehaviour. src/cmd/ksh93/include/defs.h, src/cmd/ksh93/sh/init.c: - To facilitate consistency, add a simple extern sh_outofmemory() function that throws an ERROR_PANIC "out of memory". src/cmd/ksh93/include/shell.h, src/cmd/ksh93/data/builtins.c: - Remove now-redundant e_nospace[] extern message; it is now only used in one place so it might as well be a string literal in sh_outofmemory(). All other changed files: - Verify the result of all malloc/calloc/realloc calls and call sh_outofmemory() if they fail.	2021-02-21 22:27:28 +00:00
hyenias	0ce0b67149	Fix segmentation fault for justified strings (re: `bdb99741`) (#190 ) Additional adjustments to previous commit `bdb9974` to correct crashes when the max size of a justified string is requested. This commit corrects the following: Before (Ubuntu 64bit): $ typeset -L $(((1<<31)-1)) s=h; typeset +p s Segmentation fault (core dumped) After: $ typeset -L $(((1<<31)-1)) s=h; typeset +p s typeset -L 2147483647 s src/cmd/ksh93/sh/name.c: nv_putval(): - Alter the variables size, dot, and append from int to unsigned int to prevent unwanted negative values from being expressed. - By creating size, dot, and append as unsigned ints; (unsigned) type casting is avoided.	2021-02-21 09:34:18 +00:00
Martijn Dekker	51b2e360fa	job_reap(): fix use of unitialised pointer This solves another intermittent crash that happened upon processing SIGWINCH in the emacs editor. See also: `7ff6b73b` I found this bug while testing ksh 93u+m on OpenBSD. Due to its pervasive security hardening, this system crashes a program reliably where others crash it intermittently, which is invaluable. src/cmd/ksh93/sh/jobs.c: job_reap(): - The pw pointer is not ever given a value if the loop breaks on line 318-319, but it is used unconditionally on lines 464-470, Initialise the pointer to null on function entry and do not call job_list() and job_unpost() if the pointer is still null.	2021-02-20 23:40:00 +00:00
Martijn Dekker	500757d78b	Error out on 'redirect >foo' inside ${ shared-state comsub; } The following caused an infinite loop: v=${ exec >/dev/tty; } v=${ redirect >/dev/tty; } Even the original authors didn't figure out how to 'exec >foo' or 'redirect >foo' inside a non-forking command substitution, so they fork it by calling sh_subfork(). If we delete that call, even normal command substitutions enter into that infinite loop. But of course a shared-state comsub can never fork as it would no longer share its state. Without a solution to make this work without forking, an error message is the only sensible thing left to do. src/cmd/ksh93/sh/io.c: sh_redirect(): - If we're redirecting standard output (1), the redirection is permanent as in 'exec'/'redirect' (flag==2), and we're in a subshare, then error out. Resolves: https://github.com/ksh93/ksh/issues/128	2021-02-20 19:52:08 +00:00
Martijn Dekker	bdb997415d	Fix multiple buffer overflows with justified strings (-L/-R/-Z) ksh crashed in various different and operating system-dependent ways when attempting to create or apply justification strings using typeset -L/-R/-Z, especially if large sizes are used. The crashes had two immediate causes: - In nv_newattr(), when applying justification attributes, a buffer was allocated for the justified string that was exactly 8 bytes longer than the original string. Any larger justification string caused a buffer overflow (!!!). - In nv_putval(), when applying existing attributes to a new value, the corresponding memmove() either did not zero-terminate the justified string (if the original string was longer than the justified string) or could read memory past the original string (if the original string was shorter than the justified string). Both scenarios can cause a crash. This commit fixes other minor issues as well, such as a mysterious 8 extra bytes allocated by several malloc/realloc calls. This may have been some naive attempt to paper over the above bugs. It seems no one can make any other kind of sense of it. A readjustment bug with zero-filling was also fixed. src/cmd/ksh93/sh/name.c: - nv_putval(): . Get rid of the magical +8 bytes for malloc and realloc. Just allocate one extra byte for the terminating zero. . Fix the memmove operation to use strncpy instead, so that buffer overflows are avoided in both scenarios described above. Also make it conditional upon a size adjustment actually happening (i.e. if 'dot' is nonzero). . Mild refactoring: combine two 'if(sp)' blocks into one; declare variables only used there locally for legibility. - nv_newattr(): * Replace the fatally broken "let's allocate string length + 8 bytes no matter the size of the adjustment" routine with a new one based on work by @hyenias (see comments in #142). It is efficient with memory, taking into account numeric types, growing strings, and shrinking strings. * Fix zero-filling in readjustment after changing the initial size of a -Z attribute. If the number was zero, all zeros were still skipped, leaving an empty string. Thanks to @hyenias for originally identifying this breakage and laying the groundwork for fixing nv_newattr(), and to @lijog for the crash analysis that revealed the key to the nv_putval() fix. Resolves: https://github.com/ksh93/ksh/issues/142 Resolves: https://github.com/ksh93/ksh/issues/181	2021-02-20 13:05:38 +00:00
Martijn Dekker	a959a35291	DEBUG trap: restore status 2 trigger to skip command (re: `d00b4b39`) So now we know what that faulty check for shp->indebug in sh_trap() was meant to do: it was meant to pass down the trap handler's exit status, via sh_debug(), down to sh_exec() (xec.c) so that it could then skip the execution of the next command if the trap's exit status is 2, as documented in the manual page. As of `d00b4b39`, exit status 2 was not passed down, so this stopped working. This commit reinstates that functionality, but without the exit status bug in command substitutions caused by the old way. src/cmd/ksh93/sh/fault.c: sh_trap(): - Save the trap's exit status before restoring the parent envionment's exit status. Make this saved exit status the return value of the function. (This does not break anything, AFAICT; the majority of sh_trap() calls ignore the return value, and the few that don't ignore it seem to expect it to return exactly this.) src/cmd/ksh93/sh/xec.c: sh_exec(): - The sh_trap() fix has one side effect: whereas the exit status of a skipped command was always 2 (as per the trap handler), now it is always 0, because it gets reset in sh_exec() but no command is executed. That is probably not a desirable change in behaviour, so let's fix that here instead: set sh.exitval to 2 when skipping commands. src/cmd/ksh93/sh.1: - Document that ${.sh.command} shell-quotes its arguments for use by 'eval' and such. This fact was not documented anywhere, AFAIK. src/cmd/ksh93/shell.3: - Document that $? (exit status) is made local to trap handlers. - Document that sh_trap() returns the trap handler's exit status. src/cmd/ksh93/tests/basic.sh: - Add test for this bug. - Add a missing test for the exit status 255 functionality (if a DEBUG trap handler yields this exit status and we're executing a function or dot script, a return is triggered). Fixes: https://github.com/ksh93/ksh/issues/187	2021-02-20 05:13:51 +00:00
Johnothan King	2b805f7f1c	Fix many spelling errors and word repetitions (#188 ) Many of the errors fixed in this commit are word repetitions such as 'the the' and minor spelling errors. One formatting error in the ksh man page has also been fixed.	2021-02-20 03:22:24 +00:00

1 2 3 4 5 ...

297 commits