Re: Relating to Domains..
Thank you Murray. I've followed your directions exactly. You can see
where I do indeed do the strace on the child process ID, however the
strace.out doesn't show much it seems. And also, I started it BEFORE
sending the message, the email was sent successfully, and then I
stopped strace. HOWEVER, again the postfix log shows NOTHING. But,
again, the email was received. Latest in maillog followed by the
strace stuff.
Maillog:
Sep 28 19:42:56 private opendkim[26710]: OpenDKIM Filter v2.4.2
starting (args: -x /etc/opendkim.conf -P
/var/run/opendkim/opendkim.pid)
Strace:
[root_at_private ~]# ps laxww | fgrep opendkim
1 101 26709 1 22 0 5480 736 wait Ss ? 0:00
/usr/local/sbin/opendkim -x /etc/opendkim.conf -P
/var/run/opendkim/opendkim.pid
1 101 26710 26709 15 0 25968 1500 stext Sl ? 0:00
/usr/local/sbin/opendkim -x /etc/opendkim.conf -P
/var/run/opendkim/opendkim.pid
0 0 26880 26847 18 0 3988 564 pipe_w S+ pts/1 0:00
fgrep opendkim
[root_at_private ~]# strace -o strace.out -f -p 26710
Process 26710 attached with 3 threads - interrupt to quit
Process 26710 detached
Process 26711 detached
Process 26712 detached
anaconda-ks.cfg install.log install.log.syslog strace.out
[root_at_private ~]# cat strace.out
26712 rt_sigtimedwait([HUP INT TERM], <unfinished ...>
26711 rt_sigtimedwait([USR1], <unfinished ...>
26710 select(4, [3], NULL, [3], {1, 441000}) = 0 (Timeout)
26710 select(4, [3], NULL, [3], {5, 0}) = 0 (Timeout)
26710 select(4, [3], NULL, [3], {5, 0}) = 0 (Timeout)
26710 select(4, [3], NULL, [3], {5, 0}) = 0 (Timeout)
26710 select(4, [3], NULL, [3], {5, 0}) = 0 (Timeout)
26710 select(4, [3], NULL, [3], {5, 0}) = 0 (Timeout)
On Wed, Sep 28, 2011 at 2:50 PM, Murray S. Kucherawy <msk_at_blackops.org> wrote:
> On Wed, 28 Sep 2011, N. wrote:
>>
>> I'm having a very weird problem trying to get that port 8891 "free". I
>> keep trying to kill processes, and more spawn. And its from the opendkim
>> user.
>>
>> Actually, I didn't have this error before and opendkim was starting fine.
>> But it just wasn't logging or signing email.
>>
>> Also, with this instruction Murray:
>>
>> "2) Start strace watching the child process. Use "ps laxww" to figure
>> out which one was the child."
>>
>> Could you link me to the text that discusses the "child"? I assume
>> this is opendkim "child" yes? I used ps laxww but I saw a lot of
>> things I wasn't familiar with. How do I identify the "child" in that
>> output? (what text will I see)? Also, do you happen to have a command
>> for strace handy that I should use to have it running in the
>> background? I really appreciate it.
>
> medusa[1127]% ps laxww | fgrep opendkim
> 1000 879 1 386 8 0 17608 0 wait IWs ?? 0:00.00
> /home/msk/auth/opendkim -x /home/msk/auth/opendkim.conf (opendkim-2.4.2)
> 1000 880 879 0 20 0 33944 16776 kserel S ?? 4:31.29
> /home/msk/auth/opendkim -x /home/msk/auth/opendkim.conf (opendkim-2.4.2)
>
> The first column is the UID of the process. The second is the pid of the
> process, and the third is the pid of its parent. You can see here that the
> second one is the child, as its parent is 879 which is also an opendkim
> process. The first one's parent is the init process, so it is not the
> opendkim child.
>
> For strace, try:
>
> strace -o strace.out -f -p 880
>
> (replace "880" with the process ID of your opendkim child process)
>
> ...and then send a message through postfix.
>
> You are probably seeing 8891 not free because opendkim is configured to
> restart the child when it dies. Instead, you need to kill the parent. Using
> "kill -9" is too severe to get it to shutdown cleanly unless you use it on
> both the parent and the child at the same time. Instead, using "kill -15"
> on the parent should complete the process.
>
> On the other hand, you can simplify your work by using "strace" on the
> process that "lsof" tells you is holding port 8891, and then run your
> postfix test.
>
Received on Thu Sep 29 2011 - 00:40:14 PST
This archive was generated by hypermail 2.3.0
: Mon Oct 29 2012 - 23:20:20 PST