1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
|
Smart pointers in aptitude
IMPORTANT NOTE:
All (or most of) the smart pointers documented here are deprecated
in favor of boost::shared_ptr, which is simpler, more flexible, and
probably more efficient (although I haven't checked that). New code
should use shared_ptr and/or scoped_ptr.
NEW STUFF:
The preferred convention for new code is to use boost's shared_ptr
and scoped_ptr classes. If you just have a private pointer that
should never be "owned" by other code, you might consider scoped_ptr.
Otherwise, shared_ptr is a better bet. You should think twice before
writing bare pointers.
shared_ptr is NOT MAGIC. You can still get memory leaks and crashes
if you misuse it; it is merely a tool to help you manage your object
lifetimes. The following notes will help.
* Do not write this code:
boost::shared_ptr<T> t = new T(...);
It is (potentially) exception-unsafe. Instead of thinking hard
about whether it's safe or not, do this, which is both safe and
more efficient:
boost::shared_ptr<T> t = boost::make_shared<T>(...);
* Be very careful about creating circular chains of shared_ptrs.
These cannot be reclaimed by the shared_ptr mechanism and will
result in memory leaks. Many of the notes below are just
ways of avoiding this.
* Although shared_ptr technically allows "sharing" of pointers, most
shared_ptr objects in aptitude should have a single owner at any
given time, just like bare pointers. For instance, look at how
search patterns are normally handled. pattern::create() might be
invoked from some subroutine. It creates a pattern in a
shared_ptr and returns it, passing ownership to its caller. Its
caller in turn invokes the constructor of another object, passing
the pattern's shared_ptr in. The constructor takes ownership and
stores the pattern in a member variable for later use.
The difference between shared_ptr and bare pointers is that
violating the above pattern will at worst produce a memory leak
rather than a crash.
* For "leaf" objects that don't store smart pointers to anything
else, using shared_ptr to freely share references is reasonable.
* Objects that form an acyclic graph can be safely
reference-counted. You can force the objects in a module to form
an acyclic graph by making their constructor(s) take pointers to
other objects, and by not providing a way to modify links after
the fact. For instance, the match pattern objects and the (not
shared) fragment objects use this approach. Be sure that you
don't have strong references to objects outside the module,
though, or you can still be bitten by this problem.
* Be very careful about embedding shared_ptrs into callbacks. If
the type you're embedding can't contain strong references, this is
OK. If it can, you can end up creating a hidden reference cycle
that will be hard to debug. Consider using boost's weak_ptr
object, which automatically becomes invalid when its parent
shared_ptr is destroyed.
Callbacks passed to the main loop are a special case, since those
are normally not stored permanently anywhere (i.e., they have no
strong incoming references), and there may be lifetime issues that
make weak references unacceptable. If you rely on this, be sure
to document it in comments and ensure that you really don't take a
strong reference to the callback. It's better not to do this, but
I know there were a few places that I ran into where it was handy.
* Because there is a cost to using shared_ptr, don't use it
willy-nilly, particularly for low-level code.
OLD DEPRECATED STUFF:
aptitude uses smart pointers in a number of ways internally. These
techniques generally simplify the code and make it easy to program
safely, but C++ being what it is, you should be aware of how they work
and what the caveats are.
* threading
As of this writing, none of the smart pointers lock their reference
counts (implementing locks in the imm::wtree class caused the
problem resolver's running time to increase by 50%!). Some of the
less-commonly-used ones may get locking in the future, but for the
time being you should handle objects that are given to another
thread carefully -- do deep copies of anything that's
reference-counted. (this is only the imm::* stuff at the moment, so
the problem is managable) For more information on threading, see
README.THREADS.
* reference-counted immutable values
aptitude employs reference-counting of immutable objects in several
places. For the most part this is invisible to clients; the typical
implementation looks like this:
class foo
{
class foo_impl
{
...
public:
void m1();
void m2(int);
...
};
public:
void m1();
void m2(int);
...
};
When you create a new "foo" object, a corresponding foo_impl is
created with reference-count 1. Reference counts are managed
in all the ways you expect, and the object is deleted when you're
done with it.
This use of reference-counting can be viewed as a performance hack:
since the objects are immutable, pass-by-value is indistinguishable
from pass-by-reference; if we can prove that no strong cycles will
be created, it's always safe to reference-count objects like this.
* auto_ptr for temporary return values
When building up a temporary return value, several routines use
auto_ptrs to store intermediates:
auto_ptr<T> v1 = parse_T_from_string(str1);
auto_ptr<T> v2 = parse_T_from_string(str2);
The advantage here is that you can freely throw exceptions and/or
break out of the function with an early "return" without worrying
about which of {v1,v2} have to be deleted. To actually return them,
you'd do something like this:
return pair<T*,T*>(v1.release(), v2.release());
The "release" calls tell v1 and v2 that they no longer "own" their
respective pointers.
* reference-counting of display widgets
Display widgets are also reference-counted. This is a
generalization of the idea that "the enclosing widget owns this
one"; it provides more support for extending the lifetime of a
widget if, for instance, you want to extract values from it after
its parent is done with it. Unfortunately, this reference counting
is NOT fully transparent, and you should be aware of some basic
principles:
- As with any reference counting scheme, it is up to you to
avoid creating strong cycles. By default, the only
non-transient strong references are references from a parent to
a child; your best bet is to ensure that all additional strong
references either have stack lifetime or do not close cycles.
If you must create a cycle, make sure it gets explicitly broken
before all external references to it are lost.
- Reference-counting is done via the generic ref_ptr class
(ref_ptr.h). This is a templated class, like auto_ptr, and
generally works as you expect. To enforce the use of ref_ptr,
all widget constructors are protected; static ::create methods
are provided to actually allocate a new widget. Because the
cwidget::widget class initializes its reference count to 1, the
::create method should explicitly decref() its return value; see
the existing ::create routines for examples.
To make code a bit more readable, adopt the convention of
creating typedefs for ref_ptr wrappers around new classes. For
instance, if you have just created the class vs_moo, add the
following line to its header file:
typedef ref_ptr<vs_moo> vs_moo_ref;
- Watch out for deletion of "this". If you aren't sure whether
"this" will be deleted in a method, I recommend creating a
method-scoped strong reference to it:
ref_ptr<this_type> thisref = this;
Doing so will prevent "this" from being deleted until the
current method terminates and is probably good practice in
general.
- Beware sigc::bind. sigc::bind is an easy way to create bad
circularities; moreover, it's actually unsafe to bind a ref_ptr
as a slot argument. The solution adopted in cwidget is to
exploit sigc++ weak references. If w is a ref_ptr, then rather
than closing w, you should close over w.weak_ref().
Unfortunately, w.weak_ref() will appear to the callee as a C++
reference, not a ref_ptr: if w has type T, you need a slot that
accepts a T&, not a ref_ptr<T>. To solve this problem, it is
conventional for widgets defining public interfaces that accept
a ref_ptr<T> to define a corresponding _bare method that accepts
a T&; the _bare method should simply instantiate a ref_ptr and
call the main interface. For instance,
void add_widget(const ref_ptr<cwidget::widget> &w);
void add_widget_bare(cwidget::widget &w)
{
add_widget(ref_ptr<cwidget::widget>(&w));
}
Obviously this is less than ideal, but it will work. Be aware,
though, that the bound argument is a *weak* reference: if there
are no strong references to a bound widget, the signal
connection will simply disappear.
|