从编译到启动

当我们按下 command+B之后发生了什么？App启动之前又做了什么？进程是如何创建的？主线程又是从哪里来的？ASLR如何生成的？

背景

本文的重点不在于“编译的细节（预处理，词法分析等等）”，而在于除了这些之外：xcode还做了什么。以及App启动之前，或者说dyld之前又做了什么。

下面开始第一部分：编译。

xcode build

xcode的构建主要是下面任务的集合：

源码的编译，链接。
资源(asset,storyBoard,headers)的拷贝和处理
代码签名校验和一些自定义的脚本。

同时这些任务是按照特定的顺序来执行的：依赖关系。

我们可倒推一下这个依赖关系，如下(图片来源WWDC)：

在构建的第一步就会解析项目的配置，构建依赖关系，这一步会形成一个“定向图”

依赖的组成(来源)

依赖的来源主要有一下几个部分：

xcode 内置的编译规则，比如：编译器，链接器，还有一些资源目录，这些规则决定了哪些是输入文件，哪些是输出。
显式声明的依赖关系：Target Dependencies
隐式声明的依赖关系：Linked Frameworks and Libraries
Build Phase

构建优化

增量编译

通过上面的“定向图”，我们知道一次编辑，会执行上面的所有内容，为了提高编译速度，减少无效的任务，xcode使用了增量编译技术。

在每一个构建任务中，都会生成相应的签名，这个“签名”任务的输入路径、修改时间、编译器版本信息、有关任务的元数据等组成。
在构建之前会检查签名是否和当前一直，来决定是否跳过当前任务。

并行编译

在scheme的设置中，可以选择是否开启并行编译（默认开启）。

经测试，在有缓存的情况下如下:
开启并行：

关闭：

clean之后
开启并行：

关闭：

案例

这是作者的一个真实项目，我们先来看一下构建流程

xcode 构建日志

先来大致梳理一下构建过程：

创建app包文件。
创建库目录。
文件写入。包括：Entitlements.plist，xxx.hmap。
执行CocoaPods的编译前脚本：检查Manifest.lock文件。
写入对应的，编译.m文件，生成.o文件。
编译assets，编译storyboard，链接storyboard。
写入debug信息。
执行CocoaPods编译后脚本：拷贝CocoaPods Target生成的Framework。
拷贝swift标准库。
签名并验证app。

下面我们来特别说明在wwdc中提到的，其中的一些步骤。
构建的第一个步骤是创建app文件。这个步骤只有在第一次构建的时候才会有创建，在后续的增量编译中，不会再重复创建。

headermap

先来看一下headermap内容。这里有个查看hmap的工具。

我们可以看到headermap其实就是编译器找到头文件的辅助文件：存储这头文件到其物理路径的映射关系。
再来看一下编译器是如何使用的。我们从构建日志中复制一个Compile的完整命令，再后面加上-v。

linkfilelist

这个文件包含了所有需要被链接的目标文件(.o)，和Link Map File不同，前者是链接时就需要的辅助文件，后者是链接之后的产物。来看一个具体的文件

/Users/xxx/Library/Developer/Xcode/DerivedData/sqt-ios-garxjzclbjremabwulpvhqmspmyr/Build/Intermediates.noindex/sqt-ios.build/Debug-iphoneos/sqt-ios.build/Objects-normal/arm64/NIMSessionMessageContentView.o
/Users/xxx/Library/Developer/Xcode/DerivedData/sqt-ios-garxjzclbjremabwulpvhqmspmyr/Build/Intermediates.noindex/sqt-ios.build/Debug-iphoneos/sqt-ios.build/Objects-normal/arm64/SQTUserCommerceInfoModel.o
/Users/xxx/Library/Developer/Xcode/DerivedData/sqt-ios-garxjzclbjremabwulpvhqmspmyr/Build/Intermediates.noindex/sqt-ios.build/Debug-iphoneos/sqt-ios.build/Objects-normal/arm64/SQTAlipayAuthInfoAPI.o
/Users/xxx/Library/Developer/Xcode/DerivedData/sqt-ios-garxjzclbjremabwulpvhqmspmyr/Build/Intermediates.noindex/sqt-ios.build/Debug-iphoneos/sqt-ios.build/Objects-normal/arm64/SQTCommercaListVC.o
/Users/xxx/Library/Developer/Xcode/DerivedData/sqt-ios-garxjzclbjremabwulpvhqmspmyr/Build/Intermediates.noindex/sqt-ios.build/Debug-iphoneos/sqt-ios.build/Objects-normal/arm64/CTMediator+MemberList.o
/Users/xxx/Library/Developer/Xcode/DerivedData/sqt-ios-garxjzclbjremabwulpvhqmspmyr/Build/Intermediates.noindex/sqt-ios.build/Debug-iphoneos/sqt-ios.build/Objects-normal/arm64/SQTTopicSupportAPI.o
n/Users/xxx/Library/Developer/Xcode/DerivedData/sqt-ios-garxjzclbjremabwulpvhqmspmyr/Build/Intermediates.noindex/sqt-ios.build/Debug-iphoneos/sqt-ios.build/Objects-normal/arm64/NIMVideoContentConfig.o
/Users/xxx/Library/Developer/Xcode/DerivedData/sqt-ios-garxjzclbjremabwulpvhqmspmyr/Build/Intermediates.noindex/sqt-ios.build/Debug-iphoneos/sqt-ios.build/Objects-normal/arm64/CALayer+YYAdd.o
/Users/xxx/Library/Developer/Xcode/DerivedData/sqt-ios-garxjzclbjremabwulpvhqmspmyr/Build/Intermediates.noindex/sqt-ios.build/Debug-iphoneos/sqt-ios.build/Objects-normal/arm64/SQTMemoryCache.o

下面来看一下什么时候被使用：

由于命令过长（demo为真实项目），这不在展开。

下面进入第二部分：App启动。

启动

进程的创建

在UNIX模型中(也是OSX)模型，是不支持“空进程”或“新进程”的，同时进程不能被创建出来的，只能通过fork系统调用复制出来。如下图所示：

最终创建的流程会被汇总到fork1中，这里需要主要vfork流程并不会创建真正的mach层面的进程。而是由后面execve函数来进行真正的创建，这个在二进制加载章节会讲。

二进制文件的加载

经由上面流程“创建”出来的进程没有实质上的作用，除非执行了另外一个可执行程序。因此，进程创建的核心在于二进制文件的加载和执行。

__\mac_execve

进程创建的核心在于二进制文件的加载和执行，下面我们来查看一下加载过程


nt
__mac_execve(proc_t p, struct __mac_execve_args *uap, int32_t *retval)
{
    char *bufp = NULL;
    struct image_params *imgp;
    struct vnode_attr *vap;
    struct vnode_attr *origvap;
    int error;
    int is_64 = IS_64BIT_PROCESS(p);
    struct vfs_context context;
    struct uthread  *uthread;
    task_t old_task = current_task();
    task_t new_task = NULL;
    boolean_t should_release_proc_ref = FALSE;
    boolean_t exec_done = FALSE;
    boolean_t in_vfexec = FALSE;
    void *inherit = NULL;

    context.vc_thread = current_thread();
    context.vc_ucred = kauth_cred_proc_ref(p);      /* XXX must NOT be kauth_cred_get() */

    /* Allocate a big chunk for locals instead of using stack since these
     * structures a pretty big.
     */
     //申请内存
    MALLOC(bufp, char *, (sizeof(*imgp) + sizeof(*vap) + sizeof(*origvap)), M_TEMP, M_WAITOK | M_ZERO);
    imgp = (struct image_params *) bufp;
    if (bufp == NULL) {
        error = ENOMEM;
        goto exit_with_error;
    }
    vap = (struct vnode_attr *) (bufp + sizeof(*imgp));
    origvap = (struct vnode_attr *) (bufp + sizeof(*imgp) + sizeof(*vap));
    //初始化ip 结构
    /* Initialize the common data in the image_params structure */
    imgp->ip_user_fname = uap->fname;
    imgp->ip_user_argv = uap->argp;
    imgp->ip_user_envv = uap->envp;
    imgp->ip_vattr = vap;
    imgp->ip_origvattr = origvap;
    imgp->ip_vfs_context = &context;
    imgp->ip_flags = (is_64 ? IMGPF_WAS_64BIT_ADDR : IMGPF_NONE) | ((p->p_flag & P_DISABLE_ASLR) ? IMGPF_DISABLE_ASLR : IMGPF_NONE);
    imgp->ip_seg = (is_64 ? UIO_USERSPACE64 : UIO_USERSPACE32);
    imgp->ip_mac_return = 0;
    imgp->ip_cs_error = OS_REASON_NULL;
    imgp->ip_simulator_binary = IMGPF_SB_DEFAULT;

#if CONFIG_MACF
    if (uap->mac_p != USER_ADDR_NULL) {
        error = mac_execve_enter(uap->mac_p, imgp);
        if (error) {
            kauth_cred_unref(&context.vc_ucred);
            goto exit_with_error;
        }
    }
#endif
    //获取mach对应的bsd线程。thread_t->uthread_t
    uthread = get_bsdthread_info(current_thread());
    if (uthread->uu_flag & UT_VFORK) {
        imgp->ip_flags |= IMGPF_VFORK_EXEC;
        in_vfexec = TRUE;
    } else {
        imgp->ip_flags |= IMGPF_EXEC;

        /*
         * For execve case, create a new task and thread
         * which points to current_proc. The current_proc will point
         * to the new task after image activation and proc ref drain.
         *
         * proc (current_proc) <-----  old_task (current_task)
         *  ^ |                                ^
         *  | |                                |
         *  | ----------------------------------
         *  |
         *  --------- new_task (task marked as TF_EXEC_COPY)
         *
         * After image activation, the proc will point to the new task
         * and would look like following.
         *
         * proc (current_proc)  <-----  old_task (current_task, marked as TPF_DID_EXEC)
         *  ^ |
         *  | |
         *  | ----------> new_task
         *  |               |
         *  -----------------
         *
         * During exec any transition from new_task -> proc is fine, but don't allow
         * transition from proc->task, since it will modify old_task.
         */
        imgp->ip_new_thread = fork_create_child(old_task,
            NULL,
            p,
            FALSE,
            p->p_flag & P_LP64,
            task_get_64bit_data(old_task),
            TRUE);
        /* task and thread ref returned by fork_create_child */
        if (imgp->ip_new_thread == NULL) {
            error = ENOMEM;
            goto exit_with_error;
        }

        new_task = get_threadtask(imgp->ip_new_thread);
        context.vc_thread = imgp->ip_new_thread;
    }
    //处理二进制的主要逻辑
    error = exec_activate_image(imgp);
    /* thread and task ref returned for vfexec case */

    if (imgp->ip_new_thread != NULL) {
        /*
         * task reference might be returned by exec_activate_image
         * for vfexec.
         */
        new_task = get_threadtask(imgp->ip_new_thread);
#if defined(HAS_APPLE_PAC)
        ml_task_set_disable_user_jop(new_task, imgp->ip_flags & IMGPF_NOJOP ? TRUE : FALSE);
        ml_thread_set_disable_user_jop(imgp->ip_new_thread, imgp->ip_flags & IMGPF_NOJOP ? TRUE : FALSE);
#endif
    }

    if (!error && !in_vfexec) {
        p = proc_exec_switch_task(p, old_task, new_task, imgp->ip_new_thread);
        /* proc ref returned */
        should_release_proc_ref = TRUE;

        /*
         * Need to transfer pending watch port boosts to the new task while still making
         * sure that the old task remains in the importance linkage. Create an importance
         * linkage from old task to new task, then switch the task importance base
         * of old task and new task. After the switch the port watch boost will be
         * boosting the new task and new task will be donating importance to old task.
         */
        inherit = ipc_importance_exec_switch_task(old_task, new_task);
    }

    kauth_cred_unref(&context.vc_ucred);

    /* Image not claimed by any activator? */
    if (error == -1) {
        error = ENOEXEC;
    }

    if (!error) {
        exec_done = TRUE;
        assert(imgp->ip_new_thread != NULL);

        exec_resettextvp(p, imgp);
        error = check_for_signature(p, imgp);
    }

    /* flag exec has occurred, notify only if it has not failed due to FP Key error */
    if (exec_done && ((p->p_lflag & P_LTERM_DECRYPTFAIL) == 0)) {
        proc_knote(p, NOTE_EXEC);
    }

    if (imgp->ip_vp != NULLVP) {
        vnode_put(imgp->ip_vp);
    }
    if (imgp->ip_scriptvp != NULLVP) {
        vnode_put(imgp->ip_scriptvp);
    }
    if (imgp->ip_strings) {
        execargs_free(imgp);
    }
#if CONFIG_MACF
    if (imgp->ip_execlabelp) {
        mac_cred_label_free(imgp->ip_execlabelp);
    }
    if (imgp->ip_scriptlabelp) {
        mac_vnode_label_free(imgp->ip_scriptlabelp);
    }
#endif
    if (imgp->ip_cs_error != OS_REASON_NULL) {
        os_reason_free(imgp->ip_cs_error);
        imgp->ip_cs_error = OS_REASON_NULL;
    }

    if (!error) {
        /*
         * We need to initialize the bank context behind the protection of
         * the proc_trans lock to prevent a race with exit. We can't do this during
         * exec_activate_image because task_bank_init checks entitlements that
         * aren't loaded until subsequent calls (including exec_resettextvp).
         */
        error = proc_transstart(p, 0, 0);
    }

    if (!error) {
        task_bank_init(new_task);
        proc_transend(p, 0);

#if __arm64__
        proc_legacy_footprint_entitled(p, new_task, __FUNCTION__);
#endif /* __arm64__ */

        /* Sever any extant thread affinity */
        thread_affinity_exec(current_thread());

        /* Inherit task role from old task to new task for exec */
        if (!in_vfexec) {
            proc_inherit_task_role(new_task, old_task);
        }

        thread_t main_thread = imgp->ip_new_thread;

        task_set_main_thread_qos(new_task, main_thread);

#if CONFIG_ARCADE
        /*
         * Check to see if we need to trigger an arcade upcall AST now
         * that the vnode has been reset on the task.
         */
        arcade_prepare(new_task, imgp->ip_new_thread);
#endif /* CONFIG_ARCADE */

#if CONFIG_MACF
        /*
         * Processes with the MAP_JIT entitlement are permitted to have
         * a jumbo-size map.
         */
        if (mac_proc_check_map_anon(p, 0, 0, 0, MAP_JIT, NULL) == 0) {
            vm_map_set_jumbo(get_task_map(new_task));
            vm_map_set_jit_entitled(get_task_map(new_task));
        }
#endif /* CONFIG_MACF */

        if (vm_darkwake_mode == TRUE) {
            /*
             * This process is being launched when the system
             * is in darkwake. So mark it specially. This will
             * cause all its pages to be entered in the background Q.
             */
            task_set_darkwake_mode(new_task, vm_darkwake_mode);
        }

#if CONFIG_DTRACE
        dtrace_thread_didexec(imgp->ip_new_thread);

        if ((dtrace_proc_waitfor_hook = dtrace_proc_waitfor_exec_ptr) != NULL) {
            (*dtrace_proc_waitfor_hook)(p);
        }
#endif

#if CONFIG_AUDIT
        if (!error && AUDIT_ENABLED() && p) {
            /* Add the CDHash of the new process to the audit record */
            uint8_t *cdhash = cs_get_cdhash(p);
            if (cdhash) {
                AUDIT_ARG(data, cdhash, sizeof(uint8_t), CS_CDHASH_LEN);
            }
        }
#endif

        if (in_vfexec) {
            vfork_return(p, retval, p->p_pid);
        }
    } else {
        DTRACE_PROC1(exec__failure, int, error);
    }

exit_with_error:

    /*
     * clear bsd_info from old task if it did exec.
     */
    if (task_did_exec(old_task)) {
        set_bsdtask_info(old_task, NULL);
    }

    /* clear bsd_info from new task and terminate it if exec failed  */
    if (new_task != NULL && task_is_exec_copy(new_task)) {
        set_bsdtask_info(new_task, NULL);
        task_terminate_internal(new_task);
    }

    if (imgp != NULL) {
        /* Clear the initial wait on the thread transferring watchports */
        if (imgp->ip_new_thread) {
            task_clear_return_wait(get_threadtask(imgp->ip_new_thread), TCRW_CLEAR_INITIAL_WAIT);
        }

        /* Transfer the watchport boost to new task */
        if (!error && !in_vfexec) {
            task_transfer_turnstile_watchports(old_task,
                new_task, imgp->ip_new_thread);
        }
        /*
         * Do not terminate the current task, if proc_exec_switch_task did not
         * switch the tasks, terminating the current task without the switch would
         * result in loosing the SIGKILL status.
         */
        if (task_did_exec(old_task)) {
            /* Terminate the current task, since exec will start in new task */
            task_terminate_internal(old_task);
        }

        /* Release the thread ref returned by fork_create_child */
        if (imgp->ip_new_thread) {
            /* wake up the new exec thread */
            task_clear_return_wait(get_threadtask(imgp->ip_new_thread), TCRW_CLEAR_FINAL_WAIT);
            thread_deallocate(imgp->ip_new_thread);
            imgp->ip_new_thread = NULL;
        }
    }

    /* Release the ref returned by fork_create_child */
    if (new_task) {
        task_deallocate(new_task);
        new_task = NULL;
    }

    if (should_release_proc_ref) {
        proc_rele(p);
    }

    if (bufp != NULL) {
        FREE(bufp, M_TEMP);
    }

    if (inherit != NULL) {
        ipc_importance_release(inherit);
    }

    return error;
}

其中主要的逻辑是构造image_params。其定义如下：

struct image_params {
    user_addr_t     ip_user_fname;          /* argument 文件名*/
    user_addr_t     ip_user_argv;           /* argument 参数*/
    user_addr_t     ip_user_envv;           /* argument 环境参数*/
    int             ip_seg;                 /* segment for arguments 参数段*/
    struct vnode    *ip_vp;                 /* file 文件*/
    struct vnode_attr       *ip_vattr;      /* run file attributes，run文件属性*/
    struct vnode_attr       *ip_origvattr;  /* invocation file attributes，invocation文件属性*/
    cpu_type_t      ip_origcputype;         /* cputype of invocation file ，invocation文件的cpu类型*/
    cpu_subtype_t   ip_origcpusubtype;      /* subtype of invocation file ，invocation文件的cpu子类型*/
    char            *ip_vdata;              /* file data (up to one page)，文件数据（最多一页）*/
    int             ip_flags;               /* image flags，标志位*/
    int             ip_argc;                /* argument count，参数计数*/
    int             ip_envc;                /* environment count，环境变量计数*/
    int             ip_applec;              /* apple vector count，apple向量计数*/

    char            *ip_startargv;          /* argument vector beginning，参数向量开始*/
    char            *ip_endargv;            /* end of argv/start of envv，argv结束/envv开始*/
    char            *ip_endenvv;            /* end of envv/start of applev，envv结束/applev开始*/

    char            *ip_strings;            /* base address for strings，字符串的基址*/
    char            *ip_strendp;            /* current end pointer，当前end指针*/

    int             ip_argspace;            /* remaining space of NCARGS limit (argv+envv)，NCARGS限制(argv+envv)中剩下的空间*/
    int             ip_strspace;            /* remaining total string space，总字符串空间剩下的空间*/

    user_size_t     ip_arch_offset;         /* subfile offset in ip_vp，ip_vp中子文件的偏移*/
    user_size_t     ip_arch_size;           /* subfile length in ip_vp，ip_vp中子文件的长度*/
    char            ip_interp_buffer[IMG_SHSIZE];   /* interpreter buffer space，解释器缓冲区*/
    int             ip_interp_sugid_fd;             /* fd for sugid script，sugid脚本的fd*/

    /* Next two fields are for support of architecture translation... ，下面两个字段用于架构的翻译*/
    struct vfs_context      *ip_vfs_context;        /* VFS context vfs上下文 */
    struct nameidata *ip_ndp;               /* current nameidata 当前的nameidata*/
    thread_t        ip_new_thread;          /* thread for spawn/vfork，用于vfor/或者spawn的线程*/

    struct label    *ip_execlabelp;         /* label of the executable，可执行文件的标签*/
    struct label    *ip_scriptlabelp;       /* label of the script，脚本标签*/
    struct vnode    *ip_scriptvp;           /* script 脚本*/
    unsigned int    ip_csflags;             /* code signing flags 签名标志位*/
    int             ip_mac_return;          /* return code from mac policy checks */
    void            *ip_px_sa;
    void            *ip_px_sfa;
    void            *ip_px_spa;
    void            *ip_px_smpx;            /* MAC-specific spawn attrs. */
    void            *ip_px_persona;         /* persona args */
    void            *ip_px_pcred_info;      /* posix cred args */
    void            *ip_cs_error;           /* codesigning error reason */

    uint64_t ip_dyld_fsid;
    uint64_t ip_dyld_fsobjid;
    unsigned int    ip_simulator_binary;    /* simulator binary flags */
};

__\mac_execve函数的主要流程如下：

创建相关结构体指针。
为imgp申请大块内存。
设置线程信息：（vfork则直接设置标记，否则重新创建线程）。
执行加载imgp。

接下来是执行激活二进制

exec_activate_image

static int
exec_activate_image(struct image_params *imgp)
{
    struct nameidata *ndp = NULL;//https://linux.die.net/man/1/namei
    const char *excpath;
    int error;
    int resid;
    int once = 1;   /* save SGUID-ness for interpreted files */
    int i;
    int itercount = 0;
    //从vfs向下文环境中获得 proc_t 
    proc_t p = vfs_context_proc(imgp->ip_vfs_context);

    //分配内核内存用于保存用户空间的参数和镜像的第一个页面
    error = execargs_alloc(imgp);
    if (error) {
        goto bad_notrans;
    }
    //保存程序路径并修正参数
    error = exec_save_path(imgp, imgp->ip_user_fname, imgp->ip_seg, &excpath);
    if (error) {
        goto bad_notrans;
    }

    /* Use excpath, which contains the copyin-ed exec path */
    DTRACE_PROC1(exec, uintptr_t, excpath);

    MALLOC(ndp, struct nameidata *, sizeof(*ndp), M_TEMP, M_WAITOK | M_ZERO);
    if (ndp == NULL) {
        error = ENOMEM;
        goto bad_notrans;
    }

    NDINIT(ndp, LOOKUP, OP_LOOKUP, FOLLOW | LOCKLEAF | AUDITVNPATH1,
        UIO_SYSSPACE, CAST_USER_ADDR_T(excpath), imgp->ip_vfs_context);

again:
    //通过 namei() 方法找二进制文件
    error = namei(ndp);
    if (error) {
        goto bad_notrans;
    }
    imgp->ip_ndp = ndp;     /* successful namei(); call nameidone() later */
    imgp->ip_vp = ndp->ni_vp;       /* if set, need to vnode_put() at some point */

    /*
     * Before we start the transition from binary A to binary B, make
     * sure another thread hasn't started exiting the process.  We grab
     * the proc lock to check p_lflag initially, and the transition
     * mechanism ensures that the value doesn't change after we release
     * the lock.
     */
    //确保进程中没有其他线程执行exit
    proc_lock(p);
    if (p->p_lflag & P_LEXIT) {
        error = EDEADLK;
        proc_unlock(p);
        goto bad_notrans;
    }
    //标记转换开始
    error = proc_transstart(p, 1, 0);
    proc_unlock(p);
    if (error) {
        goto bad_notrans;
    }
    //权限检查，
    error = exec_check_permissions(imgp);
    if (error) {
        goto bad;
    }

    /* Copy; avoid invocation of an interpreter overwriting the original */
    if (once) {
        once = 0;
        *imgp->ip_origvattr = *imgp->ip_vattr;
    }
    //将第一个页面加载到内存中
    error = vn_rdwr(UIO_READ, imgp->ip_vp, imgp->ip_vdata, PAGE_SIZE, 0,
        UIO_SYSSPACE, IO_NODELOCKED,
        vfs_context_ucred(imgp->ip_vfs_context),
        &resid, vfs_context_proc(imgp->ip_vfs_context));
    if (error) {
        goto bad;
    }

    if (resid) {
        memset(imgp->ip_vdata + (PAGE_SIZE - resid), 0x0, resid);
    }

encapsulated_binary:
    /* Limit the number of iterations we will attempt on each binary */
    if (++itercount > EAI_ITERLIMIT) {
        error = EBADEXEC;
        goto bad;
    }
    error = -1;
    //遍历execsw数组，调用ex_imgact函数确定 二进制类型
    // "Mach-o Binary" "Fat Binary" "Interpreter Script" 
    for (i = 0; error == -1 && execsw[i].ex_imgact != NULL; i++) {
        error = (*execsw[i].ex_imgact)(imgp);

        switch (error) {
        /* case -1: not claimed: continue */
        case -2:                /* Encapsulated binary, imgp->ip_XXX set for next iteration */
            goto encapsulated_binary;

        case -3:                /* Interpreter */
#if CONFIG_MACF
            /*
             * Copy the script label for later use. Note that
             * the label can be different when the script is
             * actually read by the interpreter.
             */
            if (imgp->ip_scriptlabelp) {
                mac_vnode_label_free(imgp->ip_scriptlabelp);
            }
            imgp->ip_scriptlabelp = mac_vnode_label_alloc();
            if (imgp->ip_scriptlabelp == NULL) {
                error = ENOMEM;
                break;
            }
            mac_vnode_label_copy(imgp->ip_vp->v_label,
                imgp->ip_scriptlabelp);

            /*
             * Take a ref of the script vnode for later use.
             */
            if (imgp->ip_scriptvp) {
                vnode_put(imgp->ip_scriptvp);
            }
            if (vnode_getwithref(imgp->ip_vp) == 0) {
                imgp->ip_scriptvp = imgp->ip_vp;
            }
#endif

            nameidone(ndp);

            vnode_put(imgp->ip_vp);
            imgp->ip_vp = NULL;     /* already put */
            imgp->ip_ndp = NULL; /* already nameidone */

            /* Use excpath, which exec_shell_imgact reset to the interpreter */
            NDINIT(ndp, LOOKUP, OP_LOOKUP, FOLLOW | LOCKLEAF,
                UIO_SYSSPACE, CAST_USER_ADDR_T(excpath), imgp->ip_vfs_context);

            proc_transend(p, 0);
            goto again;

        default:
            break;
        }
    }

    if (error == 0) {
        if (imgp->ip_flags & IMGPF_INTERPRET && ndp->ni_vp) {
            AUDIT_ARG(vnpath, ndp->ni_vp, ARG_VNODE2);
        }

        /*
         * Call out to allow 3rd party notification of exec.
         * Ignore result of kauth_authorize_fileop call.
         */
        if (kauth_authorize_fileop_has_listeners()) {
            kauth_authorize_fileop(vfs_context_ucred(imgp->ip_vfs_context),
                KAUTH_FILEOP_EXEC,
                (uintptr_t)ndp->ni_vp, 0);
        }
    }
bad:
    proc_transend(p, 0);

bad_notrans:
    if (imgp->ip_strings) {
        execargs_free(imgp);
    }
    if (imgp->ip_ndp) {
        nameidone(imgp->ip_ndp);
    }
    if (ndp) {
        FREE(ndp, M_TEMP);
    }

    return error;
}

关于namei的相关内容可查看这里。

exec_activate_image函数的主要作用是进行二进制文件的查找和加载到内存中，最后由execsw数组中的函数指针来执行具体的二进制加载，其中execsw的结构如下：

struct execsw {
    int(*const ex_imgact)(struct image_params *);
    const char *ex_name;
}const execsw[] = {
    { exec_mach_imgact, "Mach-o Binary" },
    { exec_fat_imgact, "Fat Binary" },
    { exec_shell_imgact, "Interpreter Script" },
    { NULL, NULL}
};

exec_activate_image函数的流程如下：

从vfs上下文环境中获得 proc_t。
分配内核内存用于保存用户空间的参数和镜像的第一个页面。
保存程序路径并修正参数。
使用namei() 和 NDINIT宏来获取镜像文件。
确保进程中没有其他线程执行exit，并标记二进制文件转换开始。
权限检查。
将第一个页面加载到内存中。
遍历execsw数组，执行对应的函数。

这里是MachO类型，所以执行exec_mach_imgact

exec_mach_imgact

该函数比较长，这里不在给出源码，占用篇幅。该函数主要流程如下：

分析macho的头文件架构（64or32）。
对二进制进行分析评估是否满足加载要求，其中包括cpu类型等
针对__\mac_execve函数中的vfork标记来创建任务和线程（vfork不会创建任务和线程）。
调用load_machfile加载二进制文件

load_machfile

load_machfile函数负责设置内存映射，并最终加载各种LC_SEGMENT命令加载的内容。

oad_return_t
load_machfile(
    struct image_params     *imgp,//镜像参数
    struct mach_header      *header,
    thread_t                thread,//current_thread
    vm_map_t                *mapp,
    load_result_t           *result//加载结果
    )
{
    struct vnode            *vp = imgp->ip_vp;
    off_t                   file_offset = imgp->ip_arch_offset;
    off_t                   macho_size = imgp->ip_arch_size;
    off_t                   file_size = imgp->ip_vattr->va_data_size;
    pmap_t                  pmap = 0;       /* protected by create_map */
    vm_map_t                map;
    load_result_t           myresult;
    load_return_t           lret;
    boolean_t enforce_hard_pagezero = TRUE;
    int in_exec = (imgp->ip_flags & IMGPF_EXEC);
    task_t task = current_task();
    int64_t                 aslr_page_offset = 0;
    int64_t                 dyld_aslr_page_offset = 0;
    int64_t                 aslr_section_size = 0;
    int64_t                 aslr_section_offset = 0;
    kern_return_t           kret;
    unsigned int            pmap_flags = 0;

    if (macho_size > file_size) {
        return LOAD_BADMACHO;
    }

    result->is_64bit_addr = ((imgp->ip_flags & IMGPF_IS_64BIT_ADDR) == IMGPF_IS_64BIT_ADDR);
    result->is_64bit_data = ((imgp->ip_flags & IMGPF_IS_64BIT_DATA) == IMGPF_IS_64BIT_DATA);
#if defined(HAS_APPLE_PAC)
    pmap_flags |= (imgp->ip_flags & IMGPF_NOJOP) ? PMAP_CREATE_DISABLE_JOP : 0;
#endif /* defined(HAS_APPLE_PAC) */
    pmap_flags |= result->is_64bit_addr ? PMAP_CREATE_64BIT : 0;

    task_t ledger_task;
    if (imgp->ip_new_thread) {
        ledger_task = get_threadtask(imgp->ip_new_thread);
    } else {
        ledger_task = task;
    }
    //创建新的pmap
    pmap = pmap_create_options(get_task_ledger(ledger_task),
        (vm_map_size_t) 0,
        pmap_flags);
    if (pmap == NULL) {
        return LOAD_RESOURCE;
    }
    //创建新的vmmap
    map = vm_map_create(pmap,
        0,
        vm_compute_max_offset(result->is_64bit_addr),
        TRUE);

#if defined(__arm64__)
    if (result->is_64bit_addr) {
        /* enforce 16KB alignment of VM map entries */
        vm_map_set_page_shift(map, SIXTEENK_PAGE_SHIFT);
    } else {
        vm_map_set_page_shift(map, page_shift_user32);
    }
#elif (__ARM_ARCH_7K__ >= 2) && defined(PLATFORM_WatchOS)
    /* enforce 16KB alignment for watch targets with new ABI */
    vm_map_set_page_shift(map, SIXTEENK_PAGE_SHIFT);
#endif /* __arm64__ */

#ifndef CONFIG_ENFORCE_SIGNED_CODE
    /* This turns off faulting for executable pages, which allows
     * to circumvent Code Signing Enforcement. The per process
     * flag (CS_ENFORCEMENT) is not set yet, but we can use the
     * global flag.
     */
    if (!cs_process_global_enforcement() && (header->flags & MH_ALLOW_STACK_EXECUTION)) {
        vm_map_disable_NX(map);
        // TODO: Message Trace or log that this is happening
    }
#endif

    /* Forcibly disallow execution from data pages on even if the arch
     * normally permits it. */
    if ((header->flags & MH_NO_HEAP_EXECUTION) && !(imgp->ip_flags & IMGPF_ALLOW_DATA_EXEC)) {
        vm_map_disallow_data_exec(map);
    }

    /*
     * Compute a random offset for ASLR, and an independent random offset for dyld.
     */
    //计算ASLR
    if (!(imgp->ip_flags & IMGPF_DISABLE_ASLR)) {
        vm_map_get_max_aslr_slide_section(map, &aslr_section_offset, &aslr_section_size);
        aslr_section_offset = (random() % aslr_section_offset) * aslr_section_size;

        aslr_page_offset = random();
        aslr_page_offset %= vm_map_get_max_aslr_slide_pages(map);
        aslr_page_offset <<= vm_map_page_shift(map);

        dyld_aslr_page_offset = random();
        dyld_aslr_page_offset %= vm_map_get_max_loader_aslr_slide_pages(map);
        dyld_aslr_page_offset <<= vm_map_page_shift(map);

        aslr_page_offset += aslr_section_offset;
    }

    if (!result) {
        result = &myresult;
    }

    *result = load_result_null;

    /*
     * re-set the bitness on the load result since we cleared the load result above.
     */
    result->is_64bit_addr = ((imgp->ip_flags & IMGPF_IS_64BIT_ADDR) == IMGPF_IS_64BIT_ADDR);
    result->is_64bit_data = ((imgp->ip_flags & IMGPF_IS_64BIT_DATA) == IMGPF_IS_64BIT_DATA);
    //解析加载对应的macho
    lret = parse_machfile(vp, map, thread, header, file_offset, macho_size,
        0, aslr_page_offset, dyld_aslr_page_offset, result,
        NULL, imgp);

    //加载失败，返还内存    
    if (lret != LOAD_SUCCESS) {
        vm_map_deallocate(map); /* will lose pmap reference too */
        return lret;
    }

#if __x86_64__
    /*
     * On x86, for compatibility, don't enforce the hard page-zero restriction for 32-bit binaries.
     */
    if (!result->is_64bit_addr) {
        enforce_hard_pagezero = FALSE;
    }

    /*
     * For processes with IMGPF_HIGH_BITS_ASLR, add a few random high bits
     * to the start address for "anywhere" memory allocations.
     */
#define VM_MAP_HIGH_START_BITS_COUNT 8
#define VM_MAP_HIGH_START_BITS_SHIFT 27
    if (result->is_64bit_addr &&
        (imgp->ip_flags & IMGPF_HIGH_BITS_ASLR)) {
        int random_bits;
        vm_map_offset_t high_start;

        random_bits = random();
        random_bits &= (1 << VM_MAP_HIGH_START_BITS_COUNT) - 1;
        high_start = (((vm_map_offset_t)random_bits)
                << VM_MAP_HIGH_START_BITS_SHIFT);
        vm_map_set_high_start(map, high_start);
    }
#endif /* __x86_64__ */

    /*
     * Check to see if the page zero is enforced by the map->min_offset.
     */
    if (enforce_hard_pagezero &&
        (vm_map_has_hard_pagezero(map, 0x1000) == FALSE)) {
#if __arm64__
        if (!result->is_64bit_addr && /* not 64-bit address space */
            !(header->flags & MH_PIE) &&          /* not PIE */
            (vm_map_page_shift(map) != FOURK_PAGE_SHIFT ||
            PAGE_SHIFT != FOURK_PAGE_SHIFT) &&  /* page size != 4KB */
            result->has_pagezero &&     /* has a "soft" page zero */
            fourk_binary_compatibility_unsafe) {
            /*
             * For backwards compatibility of "4K" apps on
             * a 16K system, do not enforce a hard page zero...
             */
        } else
#endif /* __arm64__ */
        {
            vm_map_deallocate(map); /* will lose pmap reference too */
            return LOAD_BADMACHO;
        }
    }

    vm_commit_pagezero_status(map);

    /*
     * If this is an exec, then we are going to destroy the old
     * task, and it's correct to halt it; if it's spawn, the
     * task is not yet running, and it makes no sense.
     */
    if (in_exec) {
        proc_t p = vfs_context_proc(imgp->ip_vfs_context);
        /*
         * Mark the task as halting and start the other
         * threads towards terminating themselves.  Then
         * make sure any threads waiting for a process
         * transition get informed that we are committed to
         * this transition, and then finally complete the
         * task halting (wait for threads and then cleanup
         * task resources).
         *
         * NOTE: task_start_halt() makes sure that no new
         * threads are created in the task during the transition.
         * We need to mark the workqueue as exiting before we
         * wait for threads to terminate (at the end of which
         * we no longer have a prohibition on thread creation).
         *
         * Finally, clean up any lingering workqueue data structures
         * that may have been left behind by the workqueue threads
         * as they exited (and then clean up the work queue itself).
         */
        kret = task_start_halt(task);
        if (kret != KERN_SUCCESS) {
            vm_map_deallocate(map); /* will lose pmap reference too */
            return LOAD_FAILURE;
        }
        proc_transcommit(p, 0);
        workq_mark_exiting(p);
        task_complete_halt(task);
        workq_exit(p);

        /*
         * Roll up accounting info to new task. The roll up is done after
         * task_complete_halt to make sure the thread accounting info is
         * rolled up to current_task.
         */
        task_rollup_accounting_info(get_threadtask(thread), task);
    }
    *mapp = map;

#ifdef CONFIG_32BIT_TELEMETRY
    if (!result->is_64bit_data) {
        /*
         * This may not need to be an AST; we merely need to ensure that
         * we gather telemetry at the point where all of the information
         * that we want has been added to the process.
         */
        task_set_32bit_log_flag(get_threadtask(thread));
        act_set_astbsd(thread);
    }
#endif /* CONFIG_32BIT_TELEMETRY */

    return LOAD_SUCCESS;
}

该函数主要分为以下几个部分：

进行内存映射。
完善一些安全性设置，包括ASLR,禁用数据段执行。
调用parse_machfile，负责实际的加载工作
如果加载失败则返还前面映射内存
否则更新新的内存对象mapp
parse_machfile

static
load_return_t
parse_machfile(
    struct vnode            *vp,//imgp中的vnode
    vm_map_t                map,//load_machfile的初始化映射
    thread_t                thread,
    struct mach_header      *header,
    off_t                   file_offset,//文件偏移
    off_t                   macho_size,//大小
    int                     depth,//递归深度
    int64_t                 aslr_offset,//aslr
    int64_t                 dyld_aslr_offset,
    load_result_t           *result,
    load_result_t           *binresult,
    struct image_params     *imgp
    )
{
    uint32_t                ncmds;
    struct load_command     *lcp;
    struct dylinker_command *dlp = 0;
    integer_t               dlarchbits = 0;
    void *                  control;
    load_return_t           ret = LOAD_SUCCESS;
    void *                  addr;
    vm_size_t               alloc_size, cmds_size;
    size_t                  offset;
    size_t                  oldoffset;      /* for overflow check */
    int                     pass;
    proc_t                  p = vfs_context_proc(imgp->ip_vfs_context);
    int                     error;
    int                     resid = 0;
    int                     spawn = (imgp->ip_flags & IMGPF_SPAWN);
    int                     vfexec = (imgp->ip_flags & IMGPF_VFORK_EXEC);
    size_t                  mach_header_sz = sizeof(struct mach_header);
    boolean_t               abi64;
    boolean_t               got_code_signatures = FALSE;
    boolean_t               found_header_segment = FALSE;
    boolean_t               found_xhdr = FALSE;
    boolean_t               found_version_cmd = FALSE;
    int64_t                 slide = 0;
    boolean_t               dyld_no_load_addr = FALSE;
    boolean_t               is_dyld = FALSE;
    vm_map_offset_t         effective_page_mask = MAX(PAGE_MASK, vm_map_page_mask(map));
#if __arm64__
    uint32_t                pagezero_end = 0;
    uint32_t                executable_end = 0;
    uint32_t                writable_start = 0;
    vm_map_size_t           effective_page_size;

    effective_page_size = MAX(PAGE_SIZE, vm_map_page_size(map));
#endif /* __arm64__ */
    //文件类型检测
    if (header->magic == MH_MAGIC_64 ||
        header->magic == MH_CIGAM_64) {
        mach_header_sz = sizeof(struct mach_header_64);
    }

    /*
     *    Break infinite recursion
     */
    //深度>1返回失败，老版本里面可能是6
    if (depth > 1) {
        return LOAD_FAILURE;
    }

    depth++;

    /*
     *    Check to see if right machine type.
     */
    //验证架构
    if (((cpu_type_t)(header->cputype & ~CPU_ARCH_MASK) != (cpu_type() & ~CPU_ARCH_MASK)) ||
        !grade_binary(header->cputype,
        header->cpusubtype & ~CPU_SUBTYPE_MASK, TRUE)) {
        return LOAD_BADARCH;
    }

    abi64 = ((header->cputype & CPU_ARCH_ABI64) == CPU_ARCH_ABI64);

    switch (header->filetype) {
    //只有深度为1
    case MH_EXECUTE:
    
        if (depth != 1) {
            return LOAD_FAILURE;
        }
#if CONFIG_EMBEDDED
        if (header->flags & MH_DYLDLINK) {
            /* Check properties of dynamic executables */
            if (!(header->flags & MH_PIE) && pie_required(header->cputype, header->cpusubtype & ~CPU_SUBTYPE_MASK)) {
                return LOAD_FAILURE;
            }
            result->needs_dynlinker = TRUE;
        } else {
            /* Check properties of static executables (disallowed except for development) */
#if !(DEVELOPMENT || DEBUG)
            return LOAD_FAILURE;
#endif
        }
#endif /* CONFIG_EMBEDDED */

        break;
    //只有深度为2
    case MH_DYLINKER:
    
        if (depth != 2) {
            return LOAD_FAILURE;
        }
        is_dyld = TRUE;
        break;

    default:
        return LOAD_FAILURE;
    }

    /*
     *    Get the pager for the file.
     */
    control = ubc_getobject(vp, UBC_FLAGS_NONE);

    /* ensure header + sizeofcmds falls within the file */
    if (os_add_overflow(mach_header_sz, header->sizeofcmds, &cmds_size) ||
        (off_t)cmds_size > macho_size ||
        round_page_overflow(cmds_size, &alloc_size)) {
        return LOAD_BADMACHO;
    }

    /*
     * Map the load commands into kernel memory.
     */
    addr = kalloc(alloc_size);
    if (addr == NULL) {
        return LOAD_NOSPACE;
    }
    //把所有load command 都加载到内存
    error = vn_rdwr(UIO_READ, vp, addr, alloc_size, file_offset,
        UIO_SYSSPACE, 0, vfs_context_ucred(imgp->ip_vfs_context), &resid, p);
    if (error) {
        kfree(addr, alloc_size);
        return LOAD_IOERROR;
    }

    if (resid) {
        /* We must be able to read in as much as the mach_header indicated */
        kfree(addr, alloc_size);
        return LOAD_BADMACHO;
    }

    /*
     *    For PIE and dyld, slide everything by the ASLR offset.
     */
    //为pie和dyld 设置aslr
    if ((header->flags & MH_PIE) || is_dyld) {
        slide = aslr_offset;
    }

    //执行3轮扫码命令
    /*
     *  Scan through the commands, processing each one as necessary.
     *  We parse in three passes through the headers:
     *  0: determine if TEXT and DATA boundary can be page-aligned
     *  1: thread state, uuid, code signature
     *  2: segments
     *  3: dyld, encryption, check entry point
     */

    boolean_t slide_realign = FALSE;
#if __arm64__
    if (!abi64) {
        slide_realign = TRUE;
    }
#endif

    for (pass = 0; pass <= 3; pass++) {
        if (pass == 0 && !slide_realign && !is_dyld) {
            /* if we dont need to realign the slide or determine dyld's load
             * address, pass 0 can be skipped */
            continue;
        } else if (pass == 1) {
#if __arm64__
//pass1:
            boolean_t       is_pie;
            int64_t         adjust;

            is_pie = ((header->flags & MH_PIE) != 0);
            if (pagezero_end != 0 &&
                pagezero_end < effective_page_size) {
                /* need at least 1 page for PAGEZERO */
                adjust = effective_page_size;
                MACHO_PRINTF(("pagezero boundary at "
                    "0x%llx; adjust slide from "
                    "0x%llx to 0x%llx%s\n",
                    (uint64_t) pagezero_end,
                    slide,
                    slide + adjust,
                    (is_pie
                    ? ""
                    : " BUT NO PIE ****** :-(")));
                if (is_pie) {
                    slide += adjust;
                    pagezero_end += adjust;
                    executable_end += adjust;
                    writable_start += adjust;
                }
            }
            if (pagezero_end != 0) {
                result->has_pagezero = TRUE;
            }
            if (executable_end == writable_start &&
                (executable_end & effective_page_mask) != 0 &&
                (executable_end & FOURK_PAGE_MASK) == 0) {
                /*
                 * The TEXT/DATA boundary is 4K-aligned but
                 * not page-aligned.  Adjust the slide to make
                 * it page-aligned and avoid having a page
                 * with both write and execute permissions.
                 */
                adjust =
                    (effective_page_size -
                    (executable_end & effective_page_mask));
                MACHO_PRINTF(("page-unaligned X-W boundary at "
                    "0x%llx; adjust slide from "
                    "0x%llx to 0x%llx%s\n",
                    (uint64_t) executable_end,
                    slide,
                    slide + adjust,
                    (is_pie
                    ? ""
                    : " BUT NO PIE ****** :-(")));
                if (is_pie) {
                    slide += adjust;
                }
            }
#endif /* __arm64__ */

            if (dyld_no_load_addr && binresult) {
                /*
                 * The dyld Mach-O does not specify a load address. Try to locate
                 * it right after the main binary. If binresult == NULL, load
                 * directly to the given slide.
                 */
                slide = vm_map_round_page(slide + binresult->max_vm_addr, effective_page_mask);
            }
        }

        /*
         * Check that the entry point is contained in an executable segments
         */
        if (pass == 3) {
            if (depth == 1 && imgp && (imgp->ip_flags & IMGPF_DRIVER)) {
                /* Driver binaries must have driverkit platform */
                if (result->ip_platform == PLATFORM_DRIVERKIT) {
                    /* Driver binaries have no entry point */
                    ret = setup_driver_main(thread, slide, result);
                } else {
                    ret = LOAD_FAILURE;
                }
            } else if (!result->using_lcmain && result->validentry == 0) {
                ret = LOAD_FAILURE;
            }
            if (ret != KERN_SUCCESS) {
                thread_state_initialize(thread);
                break;
            }
        }

        /*
         * Check that some segment maps the start of the mach-o file, which is
         * needed by the dynamic loader to read the mach headers, etc.
         */
        if ((pass == 3) && (found_header_segment == FALSE)) {
            ret = LOAD_BADMACHO;
            break;
        }

        /*
         * Loop through each of the load_commands indicated by the
         * Mach-O header; if an absurd value is provided, we just
         * run off the end of the reserved section by incrementing
         * the offset too far, so we are implicitly fail-safe.
         */
        offset = mach_header_sz;
        ncmds = header->ncmds;
        //进入cmd加载环节
        while (ncmds--) {
            /* ensure enough space for a minimal load command */
            if (offset + sizeof(struct load_command) > cmds_size) {
                ret = LOAD_BADMACHO;
                break;
            }

            /*
             *    Get a pointer to the command.
             */
            lcp = (struct load_command *)(addr + offset);
            oldoffset = offset;

            /*
             * Perform prevalidation of the struct load_command
             * before we attempt to use its contents.  Invalid
             * values are ones which result in an overflow, or
             * which can not possibly be valid commands, or which
             * straddle or exist past the reserved section at the
             * start of the image.
             */
            if (os_add_overflow(offset, lcp->cmdsize, &offset) ||
                lcp->cmdsize < sizeof(struct load_command) ||
                offset > cmds_size) {
                ret = LOAD_BADMACHO;
                break;
            }

            /*
             * Act on struct load_command's for which kernel
             * intervention is required.
             * Note that each load command implementation is expected to validate
             * that lcp->cmdsize is large enough to fit its specific struct type
             * before dereferencing fields not covered by struct load_command.
             */
            switch (lcp->cmd) {
            case LC_SEGMENT: {
                struct segment_command *scp = (struct segment_command *) lcp;
                if (scp->cmdsize < sizeof(*scp)) {
                    ret = LOAD_BADMACHO;
                    break;
                }
                if (pass == 0) {
                    if (is_dyld && scp->vmaddr == 0 && scp->fileoff == 0) {
                        dyld_no_load_addr = TRUE;
                        if (!slide_realign) {
                            /* got what we need, bail early on pass 0 */
                            continue;
                        }
                    }

#if __arm64__
                    assert(!abi64);

                    if (scp->initprot == 0 && scp->maxprot == 0 && scp->vmaddr == 0) {
                        /* PAGEZERO */
                        if (os_add3_overflow(scp->vmaddr, scp->vmsize, slide, &pagezero_end)) {
                            ret = LOAD_BADMACHO;
                            break;
                        }
                    }
                    if (scp->initprot & VM_PROT_EXECUTE) {
                        /* TEXT */
                        if (os_add3_overflow(scp->vmaddr, scp->vmsize, slide, &executable_end)) {
                            ret = LOAD_BADMACHO;
                            break;
                        }
                    }
                    if (scp->initprot & VM_PROT_WRITE) {
                        /* DATA */
                        if (os_add_overflow(scp->vmaddr, slide, &writable_start)) {
                            ret = LOAD_BADMACHO;
                            break;
                        }
                    }
#endif /* __arm64__ */
                    break;
                }

                if (pass == 1 && !strncmp(scp->segname, "__XHDR", sizeof(scp->segname))) {
                    found_xhdr = TRUE;
                }

                if (pass != 2) {
                    break;
                }

                if (abi64) {
                    /*
                     * Having an LC_SEGMENT command for the
                     * wrong ABI is invalid <rdar://problem/11021230>
                     */
                    ret = LOAD_BADMACHO;
                    break;
                }

                ret = load_segment(lcp,
                    header->filetype,
                    control,
                    file_offset,
                    macho_size,
                    vp,
                    map,
                    slide,
                    result);
                if (ret == LOAD_SUCCESS && scp->fileoff == 0 && scp->filesize > 0) {
                    /* Enforce a single segment mapping offset zero, with R+X
                     * protection. */
                    if (found_header_segment ||
                        ((scp->initprot & (VM_PROT_READ | VM_PROT_EXECUTE)) != (VM_PROT_READ | VM_PROT_EXECUTE))) {
                        ret = LOAD_BADMACHO;
                        break;
                    }
                    found_header_segment = TRUE;
                }

                break;
            }
            case LC_SEGMENT_64: {
                struct segment_command_64 *scp64 = (struct segment_command_64 *) lcp;
                if (scp64->cmdsize < sizeof(*scp64)) {
                    ret = LOAD_BADMACHO;
                    break;
                }
                if (pass == 0) {
                    if (is_dyld && scp64->vmaddr == 0 && scp64->fileoff == 0) {
                        dyld_no_load_addr = TRUE;
                        if (!slide_realign) {
                            /* got what we need, bail early on pass 0 */
                            continue;
                        }
                    }
                }

                if (pass == 1 && !strncmp(scp64->segname, "__XHDR", sizeof(scp64->segname))) {
                    found_xhdr = TRUE;
                }

                if (pass != 2) {
                    break;
                }

                if (!abi64) {
                    /*
                     * Having an LC_SEGMENT_64 command for the
                     * wrong ABI is invalid <rdar://problem/11021230>
                     */
                    ret = LOAD_BADMACHO;
                    break;
                }

                ret = load_segment(lcp,
                    header->filetype,
                    control,
                    file_offset,
                    macho_size,
                    vp,
                    map,
                    slide,
                    result);

                if (ret == LOAD_SUCCESS && scp64->fileoff == 0 && scp64->filesize > 0) {
                    /* Enforce a single segment mapping offset zero, with R+X
                     * protection. */
                    if (found_header_segment ||
                        ((scp64->initprot & (VM_PROT_READ | VM_PROT_EXECUTE)) != (VM_PROT_READ | VM_PROT_EXECUTE))) {
                        ret = LOAD_BADMACHO;
                        break;
                    }
                    found_header_segment = TRUE;
                }

                break;
            }
            case LC_UNIXTHREAD:
                if (pass != 1) {
                    break;
                }
                ret = load_unixthread(
                    (struct thread_command *) lcp,
                    thread,
                    slide,
                    result);
                break;
            case LC_MAIN:
                if (pass != 1) {
                    break;
                }
                if (depth != 1) {
                    break;
                }
                ret = load_main(
                    (struct entry_point_command *) lcp,
                    thread,
                    slide,
                    result);
                break;
            case LC_LOAD_DYLINKER:
                if (pass != 3) {
                    break;
                }
                if ((depth == 1) && (dlp == 0)) {
                    dlp = (struct dylinker_command *)lcp;
                    dlarchbits = (header->cputype & CPU_ARCH_MASK);
                } else {
                    ret = LOAD_FAILURE;
                }
                break;
            case LC_UUID:
                if (pass == 1 && depth == 1) {
                    ret = load_uuid((struct uuid_command *) lcp,
                        (char *)addr + cmds_size,
                        result);
                }
                break;
            case LC_CODE_SIGNATURE:
                /* CODE SIGNING */
                if (pass != 1) {
                    break;
                }
                /* pager -> uip ->
                 *  load signatures & store in uip
                 *  set VM object "signed_pages"
                 */
                ret = load_code_signature(
                    (struct linkedit_data_command *) lcp,
                    vp,
                    file_offset,
                    macho_size,
                    header->cputype,
                    result,
                    imgp);
                if (ret != LOAD_SUCCESS) {
                    printf("proc %d: load code signature error %d "
                        "for file \"%s\"\n",
                        p->p_pid, ret, vp->v_name);
                    /*
                     * Allow injections to be ignored on devices w/o enforcement enabled
                     */
                    if (!cs_process_global_enforcement()) {
                        ret = LOAD_SUCCESS; /* ignore error */
                    }
                } else {
                    got_code_signatures = TRUE;
                }

                if (got_code_signatures) {
                    unsigned tainted = CS_VALIDATE_TAINTED;
                    boolean_t valid = FALSE;
                    vm_size_t off = 0;


                    if (cs_debug > 10) {
                        printf("validating initial pages of %s\n", vp->v_name);
                    }

                    while (off < alloc_size && ret == LOAD_SUCCESS) {
                        tainted = CS_VALIDATE_TAINTED;

                        valid = cs_validate_range(vp,
                            NULL,
                            file_offset + off,
                            addr + off,
                            PAGE_SIZE,
                            &tainted);
                        if (!valid || (tainted & CS_VALIDATE_TAINTED)) {
                            if (cs_debug) {
                                printf("CODE SIGNING: %s[%d]: invalid initial page at offset %lld validated:%d tainted:%d csflags:0x%x\n",
                                    vp->v_name, p->p_pid, (long long)(file_offset + off), valid, tainted, result->csflags);
                            }
                            if (cs_process_global_enforcement() ||
                                (result->csflags & (CS_HARD | CS_KILL | CS_ENFORCEMENT))) {
                                ret = LOAD_FAILURE;
                            }
                            result->csflags &= ~CS_VALID;
                        }
                        off += PAGE_SIZE;
                    }
                }

                break;
#if CONFIG_CODE_DECRYPTION
            case LC_ENCRYPTION_INFO:
            case LC_ENCRYPTION_INFO_64:
                if (pass != 3) {
                    break;
                }
                ret = set_code_unprotect(
                    (struct encryption_info_command *) lcp,
                    addr, map, slide, vp, file_offset,
                    header->cputype, header->cpusubtype);
                if (ret != LOAD_SUCCESS) {
                    os_reason_t load_failure_reason = OS_REASON_NULL;
                    printf("proc %d: set_code_unprotect() error %d "
                        "for file \"%s\"\n",
                        p->p_pid, ret, vp->v_name);
                    /*
                     * Don't let the app run if it's
                     * encrypted but we failed to set up the
                     * decrypter. If the keys are missing it will
                     * return LOAD_DECRYPTFAIL.
                     */
                    if (ret == LOAD_DECRYPTFAIL) {
                        /* failed to load due to missing FP keys */
                        proc_lock(p);
                        p->p_lflag |= P_LTERM_DECRYPTFAIL;
                        proc_unlock(p);

                        KERNEL_DEBUG_CONSTANT(BSDDBG_CODE(DBG_BSD_PROC, BSD_PROC_EXITREASON_CREATE) | DBG_FUNC_NONE,
                            p->p_pid, OS_REASON_EXEC, EXEC_EXIT_REASON_FAIRPLAY_DECRYPT, 0, 0);
                        load_failure_reason = os_reason_create(OS_REASON_EXEC, EXEC_EXIT_REASON_FAIRPLAY_DECRYPT);
                    } else {
                        KERNEL_DEBUG_CONSTANT(BSDDBG_CODE(DBG_BSD_PROC, BSD_PROC_EXITREASON_CREATE) | DBG_FUNC_NONE,
                            p->p_pid, OS_REASON_EXEC, EXEC_EXIT_REASON_DECRYPT, 0, 0);
                        load_failure_reason = os_reason_create(OS_REASON_EXEC, EXEC_EXIT_REASON_DECRYPT);
                    }

                    /*
                     * Don't signal the process if it was forked and in a partially constructed
                     * state as part of a spawn -- it will just be torn down when the exec fails.
                     */
                    if (!spawn) {
                        assert(load_failure_reason != OS_REASON_NULL);
                        if (vfexec) {
                            psignal_vfork_with_reason(p, get_threadtask(imgp->ip_new_thread), imgp->ip_new_thread, SIGKILL, load_failure_reason);
                            load_failure_reason = OS_REASON_NULL;
                        } else {
                            psignal_with_reason(p, SIGKILL, load_failure_reason);
                            load_failure_reason = OS_REASON_NULL;
                        }
                    } else {
                        os_reason_free(load_failure_reason);
                        load_failure_reason = OS_REASON_NULL;
                    }
                }
                break;
#endif
            case LC_VERSION_MIN_IPHONEOS:
            case LC_VERSION_MIN_MACOSX:
            case LC_VERSION_MIN_WATCHOS:
            case LC_VERSION_MIN_TVOS: {
                struct version_min_command *vmc;

                if (depth != 1 || pass != 1) {
                    break;
                }
                vmc = (struct version_min_command *) lcp;
                ret = load_version(vmc, &found_version_cmd, result);
                break;
            }
            case LC_BUILD_VERSION: {
                if (depth != 1 || pass != 1) {
                    break;
                }
                struct build_version_command* bvc = (struct build_version_command*)lcp;
                if (bvc->cmdsize < sizeof(*bvc)) {
                    ret = LOAD_BADMACHO;
                    break;
                }
                if (found_version_cmd == TRUE) {
                    ret = LOAD_BADMACHO;
                    break;
                }
                result->ip_platform = bvc->platform;
                found_version_cmd = TRUE;
                break;
            }
            default:
                /* Other commands are ignored by the kernel */
                ret = LOAD_SUCCESS;
                break;
            }
            if (ret != LOAD_SUCCESS) {
                break;
            }
        }
        if (ret != LOAD_SUCCESS) {
            break;
        }
    }

    if (ret == LOAD_SUCCESS) {
        if (!got_code_signatures && cs_process_global_enforcement()) {
            ret = LOAD_FAILURE;
        }

        /* Make sure if we need dyld, we got it */
        if (result->needs_dynlinker && !dlp) {
            ret = LOAD_FAILURE;
        }

        if ((ret == LOAD_SUCCESS) && (dlp != 0)) {
            /*
             * load the dylinker, and slide it by the independent DYLD ASLR
             * offset regardless of the PIE-ness of the main binary.
             */
            ret = load_dylinker(dlp, dlarchbits, map, thread, depth,
                dyld_aslr_offset, result, imgp);
        }

        if ((ret == LOAD_SUCCESS) && (depth == 1)) {
            if (result->thread_count == 0) {
                ret = LOAD_FAILURE;
            }
#if CONFIG_ENFORCE_SIGNED_CODE
            if (result->needs_dynlinker && !(result->csflags & CS_DYLD_PLATFORM)) {
                ret = LOAD_FAILURE;
            }
#endif
        }
    }

    if (ret == LOAD_BADMACHO && found_xhdr) {
        ret = LOAD_BADMACHO_UPX;
    }

    kfree(addr, alloc_size);

    return ret;
}

函数较长。具体总结流程如下：

进行加载之前的文件，参数，架构检测。
进入switch环节：
1. 只有深度为1的时候才执行MH_EXECUTE。
2. 只有深度为2的时候才执行MH_DYLINKER。
3. 其他的异常返回LOAD_FAILURE。
把所有指令都映射进内存。
根据PIE和dyld设置ASLR。
进行3轮扫描
根据switch分支加载响应的动作。
1. LC_SEGMENT/LC_SEGMENT_64：调用具体的 load_segment()。根据段指令将段直接映射进内存。
2. LC_UNIXTHREAD：调用load_unixthread()。
3. LC_MAIN：调用load_main()。
4. LC_LOAD_DYLINKER：如果在第3轮调且深度为1，则将命令保存到dlp变量中。
5. LC_UUID：调用load_uuid()。将UUID复制到结果中。
6. LC_CODE_SIGNATURE：调用load_code_signature()，只有第一趟扫描中。但是暂不验证。
7. 还有一些其他的命令将会被忽略，由后面的dylinker来完成。
这3趟循环结束之后，dlp变量中有一个保存的动态链接器命令。将动态链接器加载到新的映射中，可能要根据ASLR进行调整，load_dylinker 函数会继续递归调用parse_machfile

至此。machO 已经被映射到内存中。接下来会在DYLD的流程中进行动态库的插入，加载，链接，及rebase。然后执行相关的初始化函数。最终返回main。

总结

app启动前发生的事情。

由用户态触发进程创建。
通过系统调用执行内核创建进程。
读取镜像文件。加载header到内存
遍历excsw，执行对应的二进制格式加载逻辑
加载，解析对应的macho命令
保存dyld命令。
内核态完成。交由dyld进行动态库处理