-
Notifications
You must be signed in to change notification settings - Fork 1
Expand file tree
/
Copy pathtorque
More file actions
147 lines (99 loc) · 3.67 KB
/
torque
File metadata and controls
147 lines (99 loc) · 3.67 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
===============================================================
Tutorial
sudo su
apt-get install torque-server torque-client torque-mom torque-pam
# This installs with bad default setup.
# Redo setup.
/etc/init.d/torque-mom stop
/etc/init.d/torque-scheduler stop
/etc/init.d/torque-server stop
pbs_server -t create
killall pbs_server
echo localhost > /etc/torque/server_name
echo localhost > /var/spool/torque/server_priv/acl_svr/acl_hosts
echo root@localhost > /var/spool/torque/server_priv/acl_svr/operators
echo root@localhost > /var/spool/torque/server_priv/acl_svr/managers
echo "localhost np=8" > /var/spool/torque/server_priv/nodes
echo localhost > /var/spool/torque/mom_priv/config
/etc/init.d/torque-server start
/etc/init.d/torque-scheduler start
/etc/init.d/torque-mom start
# set scheduling properties
qmgr -c 'set server scheduling = true'
qmgr -c 'set server keep_completed = 10'
qmgr -c 'set server mom_job_sync = true'
# create default queue
qmgr -c 'create queue batch'
qmgr -c 'set queue batch queue_type = execution'
qmgr -c 'set queue batch started = true'
qmgr -c 'set queue batch enabled = true'
qmgr -c 'set queue batch resources_default.walltime = 300:00:00'
qmgr -c 'set queue batch resources_default.nodes = 1'
qmgr -c 'set server default_queue = batch'
# configure submission pool
qmgr -c 'set server submit_hosts = SERVER'
qmgr -c 'set server allow_node_submit = true'
===============================================================
Another tutorial
https://wiki.archlinux.org/index.php/TORQUE
1. Install torque-server and torque-mom
2. Check /etc/hosts/ for hostnames, or use 'host'
3. Put hostname in /var/spool/torque/server_name
4. Run pbs_server -t create
5. Use the following to set things:
qmgr -c "set server acl_hosts = <hostname>"
qmgr -c "set server scheduling=true"
qmgr -c "create queue batch queue_type=execution"
qmgr -c "set queue batch started=true"
qmgr -c "set queue batch enabled=true"
qmgr -c "set queue batch resources_default.nodes=1"
qmgr -c "set queue batch resources_default.walltime=3600"
qmgr -c "set server default_queue=batch"
6. Add nodes to /var/spool/torque/server_priv/nodes
mars np=4
phobos np=2
deimos np=2
7. Restart everything.
8. To start running jobs, use 'pbs_sched', 'pbs_mom'.
9. 'pbsnodes -a' to check on things.
This doesn't work for me anymore.
Trying downloading the tarball...
===============================================================
Another tutorial.
Reference:
http://ubuntuforums.org/showthread.php?t=289767
sudo apt-get install libxml2-dev libssl-dev libboost-dev
configure
make
sudo su
make install
# This may be causing server to be called "localhost" by default.
torque.setup brian
cd /var/spool/torque
# Make sure /etc/hosts has hostname in it. Workaround: use alias for localhost.
echo "veritas" > /var/spool/torque/server_name
pbsserver -t create
trqauthd
qmgr -c "set server acl_hosts = veritas"
qmgr -c "set server scheduling=true"
qmgr -c "create queue batch queue_type=execution"
qmgr -c "set queue batch started=true"
qmgr -c "set queue batch enabled=true"
qmgr -c "set queue batch resources_default.nodes=1"
qmgr -c "set queue batch resources_default.walltime=3600"
qmgr -c "set server default_queue=batch"
qmgr -c "set server keep_completed = 86400"
echo "veritas np=8" > /var/spool/torque/server_priv/nodes
echo "pbsserver veritas" > /var/spool/torque/mom_priv/config
echo "logevent 255" > /var/spool/torque/mom_priv/config
killall -s 9 pbs_server
pbs_server
pbs_mom
# To check:
pbsnodes -a
# If node is down, there's a problem with /etc/hosts, try using aliases or
# renaming.
pbs_sched
# Now everything should be working!
# To configure the scheduling policy:
sudo vi /var/spool/torque/sched_priv/sched_config