The best way to index this with Solr is to split out the nested data structures (cpcs) to flat documents and have the patent_id included there. That way an arbitrary combination of partial cpcs can be searched.