|Version 3 (modified by rme, 7 years ago) (diff)|
Clozure CL provides a way for lisp objects to be watched so that a condition will be signaled when a thread attempts to write to the watched object. For a certain class of bugs (someone is changing this value, but I don't know who), this can be extremely helpful.
WATCH &optional object [function]
The WATCH function arranges for the specified object to be monitored for writes. This is accomplished by copying the object to its own set of virtual memory pages, which are then write-protected. This protection is enforced by the computer's memory-management hardware; the write-protection does not slow down reads at all.
When any write to the object is attempted, a WRITE-TO-WATCHED-OBJECT condition will be signaled.
When called with no arguments, WATCH returns a freshly-consed list of the objects currently being watched.
WATCH can monitor any memory-allocated lisp object.
In Clozure CL a memory-allocated object is either a cons cell or a uvector.
WATCH operates on cons cells, not lists. In order to watch a chain of cons cells, each cons cell must be watched individually. Because each watched cons cell takes up its own own virtual memory page (4 Kbytes), it's only feasible to watch relatively short lists.
If a memory-allocated object isn't a cons cell, then it is a vector-like object called a uvector.
A uvector is a memory-allocated lisp object whose first word is a header that describes the object's type and the number of elements that it contains.
So, a hash table is a uvector, as is a string, a standard instance, a double-float, a CL array or vector, and so forth.
Some CL objects, like strings and other simple vectors, map in a straightforward way onto the uvector representation. Simply watching the object directly is sufficient in these simple cases.
? (defvar *s* "xxxxx") *S* ? (watch *s*) "xxxxx" ? (setf (char *s* 3) #\o) > Error: Write to watched uvector "xxxxx" at index 3 > Faulting instruction: (movl (% eax) (@ -5 (% r15) (% rcx))) > While executing: SET-CHAR, in process listener(1). > Type :POP to abort, :R for a list of available restarts. > Type :? for other options.
In the case of more complicated objects (e.g., a hash-table, a standard-instance, a package, etc.), the elements of the uvector are like slots in a structure. It's necessary to know which one of those "slots" contains the data that will be changed when the object is written to.
An example might make this clearer. Suppose we have a standard-instance. It turns out that it is represented by uvector of three elements. One of those elements is an object called a slot-vector. To monitor writes to the slots of a standard-instance, it is necessary to watch the slot-vector, not the standard-instance itself.
? (defclass foo () (slot-a slot-b slot-c)) #<STANDARD-CLASS FOO> ? (defvar *a-foo* (make-instance 'foo)) *A-FOO* ? (watch *a-foo*) NIL ;;; Note that this doesn't catch the write.. ? (setf (slot-value *a-foo* 'slot-a) 'foo) FOO ;;; note use of internal accessor to get at the slot-vector ? (watch (ccl::instance-slots *a-foo*)) NIL ? (setf (slot-value *a-foo* 'slot-a) 'foo) > Error: Write to watched uvector #<SLOT-VECTOR #xD600D> at index 1 > Faulting instruction: (movq (% rsi) (@ -5 (% r8) (% rdi))) > While executing: %MAYBE-STD-SETF-SLOT-VALUE-USING-CLASS, in process listener(1). > Type :POP to abort, :R for a list of available restarts. > Type :? for other options.
Note that even though the write was to slot-a, the uvector index was 1 (not 0). This is because the first element of a slot-vector is a pointer to the instance that owns the slots.
UNWATCH object [function]
The UNWATCH function ensures that the specified object is in normal, non-monitored memory. If the object is not currently being watched, UNWATCH does nothing and returns NIL. Otherwise, the newly unwatched object is returned.
This condition is signaled when a watched object is written to. There are three slots of interest: object, offset, and instruction.
The object slot is the actual object that was written.
The offset slot is the byte offset from the tagged object pointer to the starting address of the write.
The instruction slot contains the disassembled machine instruction that attempted the write.
The :report function for this condition will attempt to use the offset to determine what uvector index (or which half, either the car or the cdr, for a cons cell) was attempted to be written. The disassembled instruction will also be displayed, if possible.
Two restarts are provided: one will skip over the faulting write instruction, and proceed; the other offers to unwatch the object and continue.
Although some care has been taken to minimize potential problems arising from watching and unwatching objects from multiple threads, there may well be subtle race conditions present that could cause bad behavior.
For example, suppose that a thread attempts to write to a watched object. This causes the operating system to generate an exception. The lisp kernel figures out what the exception is, and calls back into lisp to signal the write-to-watched-object condition and perhaps handle the error.
Now, as soon lisp code starts running again (for the callback), it's possible that some other thread could unwatch the very watched object that caused the exception, perhaps before we even have a chance to signal the condition, much less respond to it.
Having the object unwatched out from underneath a handler may at least confuse it, if not cause deeper trouble. Use caution with unwatch.
Here are a couple more examples in addition to the above examples of watching a string and a standard-instance.
? (defvar *f* (make-array '(2 3) :element-type 'double-float)) *F* ;;; Find the underlying data vector ? (ccl::array-data-and-offset *f*) #(0.0D0 0.0D0 0.0D0 0.0D0 0.0D0 0.0D0) 0 ? (watch *) #(0.0D0 0.0D0 0.0D0 0.0D0 0.0D0 0.0D0) ? (setf (aref *f* 1 2) pi) > Error: Write to watched uvector #<VECTOR 6 type DOUBLE-FLOAT, simple> at index 5 > Faulting instruction: (movq (% rax) (@ -5 (% r8) (% rdi))) > While executing: ASET, in process listener(1). > Type :POP to abort, :R for a list of available restarts. > Type :? for other options.
Note the the uvector index in the report is the row-major index of the element that was written to.
Hash tables are surprisingly complicated. The representation of a hash table includes an element called a hash-table-vector. The keys and values of the elements are stored pairwise in this vector.
? (defvar *h* (make-hash-table)) #<HASH-TABLE :TEST EQL size 0/60 #x3000412BF96D> ? (setf (gethash 'noise *h*) 'feep) FEEP ? (watch (ccl::nhash.vector *h*)) NIL ? (setf (gethash 'noise *h*) 'ding) > Error: Write to watched uvector #<HASH-TABLE-VECTOR #xD500D> at index 35 > Faulting instruction: (lock) > (cmpxchgq (% rsi) (@ (% r8) (% rdx))) > While executing: %STORE-NODE-CONDITIONAL, in process listener(1). > Type :POP to abort, :R for a list of available restarts. > Type :? for other options. ;;; see what value is being replaced... 1 > (uvref (write-to-watched-object-object *break-condition*) 35) FEEP ;;; backtrace shows useful context 1 > :b *(1A109F8) : 0 (%STORE-NODE-CONDITIONAL ???) NIL (1A10A50) : 1 (LOCK-FREE-PUTHASH NOISE #<HASH-TABLE :TEST EQL size 1/60 #x30004117F4ED> DING) 653 (1A10AC8) : 2 (CALL-CHECK-REGS PUTHASH NOISE #<HASH-TABLE :TEST EQL size 1/60 #x30004117F4ED> DING) 229 (1A10B00) : 3 (TOPLEVEL-EVAL (SETF (GETHASH # *H*) 'DING) NIL) 709 ...