EVALUATING COMPLEX AI APPLICATIONS