首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >子进程中的多个管道

子进程中的多个管道
EN

Stack Overflow用户
提问于 2015-03-03 19:42:44
回答 2查看 4.8K关注 0票数 9

我试图在ruffus管道中使用将多个fastq文件作为参数的Sailfish。我使用python中的子流程模块执行Sailfish,但是子进程调用中的<()即使在设置shell=True时也不能工作。

下面是我想使用python执行的命令:

代码语言:javascript
复制
sailfish quant [options] -1 <(cat sample1a.fastq sample1b.fastq) -2 <(cat sample2a.fastq sample2b.fastq) -o [output_file]

或(最好):

代码语言:javascript
复制
sailfish quant [options] -1 <(gunzip sample1a.fastq.gz sample1b.fastq.gz) -2 <(gunzip sample2a.fastq.gz sample2b.fastq.gz) -o [output_file]

概括:

代码语言:javascript
复制
someprogram <(someprocess) <(someprocess)

我该如何在python中这样做呢?子流程是正确的方法吗?

EN

回答 2

Stack Overflow用户

回答已采纳

发布于 2015-03-03 20:03:54

模仿bash过程替代

代码语言:javascript
复制
#!/usr/bin/env python
from subprocess import check_call

check_call('someprogram <(someprocess) <(anotherprocess)',
           shell=True, executable='/bin/bash')

在Python中,您可以使用命名管道:

代码语言:javascript
复制
#!/usr/bin/env python
from subprocess import Popen

with named_pipes(n=2) as paths:
    someprogram = Popen(['someprogram'] + paths)
    processes = []
    for path, command in zip(paths, ['someprocess', 'anotherprocess']):
        with open(path, 'wb', 0) as pipe:
            processes.append(Popen(command, stdout=pipe, close_fds=True))
    for p in [someprogram] + processes:
        p.wait()

其中named_pipes(n)是:

代码语言:javascript
复制
import os
import shutil
import tempfile
from contextlib import contextmanager

@contextmanager
def named_pipes(n=1):
    dirname = tempfile.mkdtemp()
    try:
        paths = [os.path.join(dirname, 'named_pipe' + str(i)) for i in range(n)]
        for path in paths:
            os.mkfifo(path)
        yield paths
    finally:
        shutil.rmtree(dirname)

实现bash进程替换的另一种更可取的方法(不需要在磁盘上创建命名条目)是将/dev/fd/N文件名(如果可用的话)作为@Dunes建议。在FreeBSD,)为进程打开的所有文件描述符创建条目。。要测试可用性,请运行:

代码语言:javascript
复制
$ test -r /dev/fd/3 3</dev/null && echo /dev/fd is available

如果失败,请尝试将/dev/fdproc(5)进行符号链接,就像在某些Linuxes上所做的那样:

代码语言:javascript
复制
$ ln -s /proc/self/fd /dev/fd

下面是/dev/fd-based实现someprogram <(someprocess) <(anotherprocess) bash命令:

代码语言:javascript
复制
#!/usr/bin/env python3
from contextlib import ExitStack
from subprocess import CalledProcessError, Popen, PIPE

def kill(process):
    if process.poll() is None: # still running
        process.kill()

with ExitStack() as stack: # for proper cleanup
    processes = []
    for command in [['someprocess'], ['anotherprocess']]:  # start child processes
        processes.append(stack.enter_context(Popen(command, stdout=PIPE)))
        stack.callback(kill, processes[-1]) # kill on someprogram exit

    fds = [p.stdout.fileno() for p in processes]
    someprogram = stack.enter_context(
        Popen(['someprogram'] + ['/dev/fd/%d' % fd for fd in fds], pass_fds=fds))
    for p in processes: # close pipes in the parent
        p.stdout.close()
# exit stack: wait for processes
if someprogram.returncode != 0: # errors shouldn't go unnoticed
   raise CalledProcessError(someprogram.returncode, someprogram.args)

注意:在我的Ubuntu机器上,subprocess代码只在Python3.2中工作,尽管从Python3.2开始pass_fds就可用了。

票数 11
EN

Stack Overflow用户

发布于 2015-03-03 23:27:20

虽然J.F. Sebastian提供了一个使用命名管道的答案,但是使用匿名管道可以做到这一点。

代码语言:javascript
复制
import shlex
from subprocess import Popen, PIPE

inputcmd0 = "zcat hello.gz" # gzipped file containing "hello"
inputcmd1 = "zcat world.gz" # gzipped file containing "world"

def get_filename(file_):
    return "/dev/fd/{}".format(file_.fileno())

def get_stdout_fds(*processes):
    return tuple(p.stdout.fileno() for p in processes)

# setup producer processes
inputproc0 = Popen(shlex.split(inputcmd0), stdout=PIPE)
inputproc1 = Popen(shlex.split(inputcmd1), stdout=PIPE)

# setup consumer process
# pass input processes pipes by "filename" eg. /dev/fd/5
cmd = "cat {file0} {file1}".format(file0=get_filename(inputproc0.stdout), 
    file1=get_filename(inputproc1.stdout))
print("command is:", cmd)
# pass_fds argument tells Popen to let the child process inherit the pipe's fds
someprogram = Popen(shlex.split(cmd), stdout=PIPE, 
    pass_fds=get_stdout_fds(inputproc0, inputproc1))

output, error = someprogram.communicate()

for p in [inputproc0, inputproc1, someprogram]:
    p.wait()

assert output == b"hello\nworld\n"
票数 3
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/28840575

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档