GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:46:52 Sequence gi568815592r:99269603_99494179 : 224577 bp : 39.54% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 261 256 6 1.05 1.05 Term - 11851 11562 290 2 2 82 45 399 0.990 29.55 1.04 Intr - 19855 19717 139 1 1 81 113 26 0.825 3.52 1.03 Intr - 21646 21604 43 0 1 73 93 43 0.667 0.62 1.02 Intr - 22218 22056 163 1 1 123 43 175 0.766 14.61 1.01 Init - 26107 25906 202 1 1 57 94 86 0.489 5.19 1.00 Prom - 31278 31239 40 -6.85 2.06 PlyA - 34355 34350 6 1.05 2.05 Term - 41336 41176 161 0 2 74 47 128 0.096 4.42 2.04 Intr - 54065 53842 224 1 2 45 30 257 0.867 12.45 2.03 Intr - 56457 56351 107 0 2 79 105 103 0.439 9.19 2.02 Intr - 73431 73296 136 2 1 81 80 62 0.095 4.35 2.01 Init - 79770 79505 266 0 2 79 71 155 0.077 9.54 2.00 Prom - 96207 96168 40 -3.75 3.05 PlyA - 96290 96285 6 1.05 3.04 Term - 100218 99998 221 1 2 115 38 137 0.976 7.62 3.03 Intr - 101985 101826 160 0 1 72 108 11 0.909 0.14 3.02 Intr - 106580 106338 243 2 0 23 78 221 0.901 11.37 3.01 Init - 107828 107784 45 2 0 52 100 25 0.571 0.86 3.00 Prom - 110333 110294 40 -5.85 4.00 Prom + 118225 118264 40 -6.15 4.01 Sngl + 124162 124635 474 0 0 43 43 290 0.956 16.12 4.02 PlyA + 126062 126067 6 1.05 5.13 PlyA - 126402 126397 6 1.05 5.12 Term - 132028 130938 1091 2 2 89 35 765 0.873 62.56 5.11 Intr - 133108 132938 171 2 0 53 44 315 0.994 22.59 5.10 Intr - 135100 135001 100 1 1 43 71 101 0.883 2.66 5.09 Intr - 136566 136429 138 0 0 68 30 229 0.915 14.84 5.08 Intr - 138669 138479 191 1 2 48 78 250 0.924 18.38 5.07 Intr - 139742 139571 172 2 1 5 -2 159 0.883 -2.81 5.06 Intr - 141323 141139 185 0 2 59 67 167 0.757 10.29 5.05 Intr - 143132 142949 184 2 1 1 72 123 0.004 0.44 5.04 Intr - 155686 155541 146 2 2 33 64 187 0.012 9.88 5.03 Intr - 167797 167644 154 1 1 60 72 134 0.587 7.72 5.02 Intr - 174060 173963 98 1 2 36 111 19 0.486 -2.19 5.01 Init - 176777 176195 583 2 1 42 91 392 0.489 30.29 5.00 Prom - 177461 177422 40 -7.45 6.00 Prom + 177778 177817 40 -7.95 6.01 Sngl + 179832 180965 1134 2 0 44 55 437 0.773 32.27 6.02 PlyA + 181182 181187 6 1.05 7.08 PlyA - 183136 183131 6 1.05 7.07 Term - 187331 187203 129 2 0 80 47 70 0.037 -0.70 7.06 Intr - 195145 195002 144 1 0 17 116 103 0.695 5.56 7.05 Intr - 195534 195478 57 0 0 81 86 42 0.529 1.46 7.04 Intr - 197161 197070 92 2 2 8 51 144 0.278 1.49 7.03 Intr - 199016 198935 82 2 1 43 95 68 0.020 1.29 7.02 Intr - 218693 218598 96 2 0 84 105 42 0.931 4.79 7.01 Init - 219084 219079 6 0 0 89 110 0 0.836 3.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 63909 63749 161 0 2 76 105 99 0.926 9.74 S.002 Term - 79626 79505 122 1 2 103 43 116 0.816 6.26 S.003 Init - 79820 79736 85 2 1 40 40 159 0.817 5.33 S.004 Term - 166244 166114 131 0 2 94 36 95 0.884 2.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:99269603_99494179|GENSCAN_predicted_peptide_1|278_aa MPCEGVLVRILQRNRTIRIYYMELAHTIMEADKSQDLQDELASWRPRRDQCPSLKAVRQK EICRRYSGDKKYIMGPKLSTLDATVFGHLAQAMWTLPGTRPERLIKGKPYTLLQVLGLTG AGCSESKLFFDVAHVTAGLDILASSDPPTSVSQVAEITGMNHCAQLTYDIFSFQWVYGDV SPRELINLAMYCERIRRKFWPEWHHDDDNTIYESEESSEGSKTHTPLLDFSFYSRTETFE DEGAENSFSRTPDTDFTGHSLFDSDVDMDDYTDHEQCK >gi568815592r:99269603_99494179|GENSCAN_predicted_CDS_1|837_bp atgccctgcgaaggtgtattagtcaggattcttcagagaaaccgaaccattaggatttat tatatggagttggctcacacaattatggaagctgacaagtcccaagatctgcaggatgag ctggcaagctggagacctaggagggaccagtgtcccagtttgaaggcagttaggcagaag gaaatctgtcgtcgttactcaggtgataagaagtacatcatggggcccaagctttccact cttgacgccactgtctttggacacttggcacaggcaatgtggaccttaccagggacaaga cccgaacggctgatcaaaggcaagccctacactcttcttcaggttttggggcttactggg gctggatgctctgagtctaaactgtttttcgatgttgcacatgtaactgctggtcttgac atcctggcctcaagtgatcctcctacctcagtctcccaagtagctgagataacaggtatg aaccactgtgcccagctgacttatgatatttttagctttcaatgggtttatggagatgta tccccacgtgagctgatcaaccttgccatgtactgtgagaggataaggaggaaattttgg ccagagtggcaccacgatgatgacaataccatctatgagtctgaggagagcagcgaaggc agcaaaacccacaccccgctgctggattttagcttttactcaaggacagagacctttgaa gatgagggagcagaaaacagtttttccagaaccccagacacagattttactggacactca ctctttgattcggatgtggacatggatgactatacagaccacgaacagtgcaagtga >gi568815592r:99269603_99494179|GENSCAN_predicted_peptide_2|297_aa MHWGVGFASSRPCVVDLSWNQSISFFGWWAGSEEPFSFYGDIIAFPLQDYGGIMAGLGSD PWWKKTLYLTGGALLAAAAYLLHELLVIRKQQEIDSKDAIILHQFARPNNGVPSLSPFCL KMETYLRMADLPYQASSNAACTHVPLTEVLAVRQAADALHAYTFTLCSARTLAYCQWVDN LNETRKMLSLSGGGPFSNLLRWVVCHITKGIVKREMHGHGIGRFSEEEIYMLMEKDMRSL AGLLDHATAPHRDAHLISTLGLQVLARSCLLTTQQPHTHTFYESEWEKPSEPGSGKG >gi568815592r:99269603_99494179|GENSCAN_predicted_CDS_2|894_bp atgcactggggggttggctttgcttcgtccaggccgtgcgtggtggatctgagctggaac cagagcatctccttcttcggctggtgggccgggtccgaggagcccttctccttttatggg gacatcatcgctttccctttgcaggattacggtgggatcatggcagggctgggctccgat ccctggtggaagaaaaccctttacttgaccgggggagctttgctggccgcagctgcgtat ctgctccacgaactcctggtcattaggaaacagcaagagattgactctaaagatgctatt attttgcatcagtttgcaagacctaacaatggtgttccaagtttatctcctttctgttta aagatggaaacttatttaaggatggctgacttaccgtatcaggcttcctcaaatgcagcc tgtacccacgtgccccttacagaagttcttgcagttcgtcaggcagctgatgctcttcat gcgtacaccttcaccctctgctctgcccggacattagcttattgccagtgggtggacaat ctcaatgagacccggaagatgctctctcttagtggtggtggtcccttcagcaacctgctg aggtgggttgtgtgccacataacgaaaggaattgtgaaacgcgagatgcacggccacggc attggccgcttctccgaggaagagatttacatgctgatggagaaggacatgcggtcttta gcagggcttttggatcatgcaacagcacctcatagagatgcccacctcatctccacatta ggccttcaagtcttggccagaagctgcttgcttacaacacagcagccacacactcatacc ttctatgaaagtgaatgggaaaagccttcagaaccaggcagtggaaaaggatga >gi568815592r:99269603_99494179|GENSCAN_predicted_peptide_3|222_aa MKILDVGCGGGLLTEPLGRLGASVIGIDPVDENIKTAQCHKSFDPVLDKRIEYRVCSLEE IVEETAETFDAVVASEVVEHVIDLETFLQCCCQVLKPGGSLFITTINKTQLSYALGIVFS EQIASIVPKGTHTWEKFVSPETLESILESNGLSVQTVVGMLYNPFSGYWHWSENTSLNYA AYAVKSRVQEHPASAEFVLKGETEELQANACTNPAVHEKLKK >gi568815592r:99269603_99494179|GENSCAN_predicted_CDS_3|669_bp atgaagattcttgacgttggctgtggtggtgggctgttaactgaacctctagggcggctt ggggcttcagttattggaatcgaccctgtggatgagaacattaaaacagcacaatgccat aaatcatttgatccagtcctggataagagaatagagtacagagtgtgttccctggaagag attgtggaagagactgcagaaacatttgatgctgttgtagcttctgaagttgtagaacat gtgattgatctagaaacatttttacagtgctgctgtcaagtgttaaaacccggtggttct ttattcattactacaatcaacaaaacacaactttcctatgccttgggaattgttttttca gagcaaattgcaagtattgtaccaaaaggtactcatacatgggagaagtttgtttcacct gaaacactagagagcattctggaatcaaatggtctgtcagttcaaacagtggtaggaatg ctctataaccccttctcaggttactggcattggagtgaaaataccagccttaactatgca gcttatgctgtgaaatccagggtccaggaacacccagcctctgctgagtttgttttaaag ggagaaacagaagagctccaagctaatgcctgcaccaatccagctgtgcatgaaaagctg aagaaatga >gi568815592r:99269603_99494179|GENSCAN_predicted_peptide_4|157_aa MTFRLQHTQSGSPAPGARSSATRDPRPALFQTGPLVLSSFPRRPPTGLCEVYNGERWGQD QALGFALTPREVQRQPTAPPPAHAQRRQRSQLTACEGPPHTHSPAEEIKGRAAFVLQPPG PSTLKNQPPEEPSLRPLHIATNDPTSFPVPPEKRSPT >gi568815592r:99269603_99494179|GENSCAN_predicted_CDS_4|474_bp atgaccttcagactgcagcacacgcagagcgggtcgccggctcctggggcaaggagctca gccacaagagacccgcggcctgcgcttttccagacaggaccactggtcctgagctctttt ccccggcgtccgccaacgggactatgcgaggtttacaatggagagagatggggccaggac caagccttaggtttcgcgctgacccctcgcgaggtccagagacaacccaccgccccaccc ccggcgcatgcgcagagacgccagcgctcgcaattgactgcatgcgaaggacctccgcac acacactcacccgccgaggaaattaagggacgcgcagcttttgtattacagcctccaggc cccagcactcttaaaaaccaacccccggaggagcccagcttacggccactccacatcgcg acaaacgatcccacctcttttccggtccctcccgaaaaaagatccccaacataa >gi568815592r:99269603_99494179|GENSCAN_predicted_peptide_5|1070_aa MNGDSLMFASLMNSESRLNESPTDDSEKEASHSESNVDADSEPSESESASKQTGLFRSSS GSGVQPDGPLYPLSAGKLLYTKETDSGDKEMAEAISELRLSSTVTGDQDFDRENQPLNIS NNLCFLEGKHLRSYSPQNAFQTLSQSYITTSKECSIQSCLYQFTSMELLMGNNKLLCENC TKNKQKYQEETSFAEKKVEGVYTNARKQLLISAVPAVLILHLKRFHQNASVGDKVLYGLY GIVEHSGSMREGHYTAYVKVRTPSRKLSEHNTKKKNVPEVEASKALERCYRNGGDKGVPE LEWVSDGEQLYVGTCRTITTFLVSVAKIDWAALAQAWIAQREASGQQSMVEQPPGMMPNG QDMSTMESGPNNHGNFQGDSNFNRMWQPDQPWMPPTPGPMDIVPPSEDSNSQDSGEFAPD NRHIFNQNNHNFGGPPDNFAVGPVNQFDYQHGAAFGPPQGGFHPPYWQPGPPGPPAPPQN RRERPSSFRDRQRSPIALPVKQEPPQIDAVKRRTLPAWIREGLEKMEREKQKKLEKERME QQRSQLSKKEKKATEDAEGGDGPRLPQRSKFDSDEEEEDTENVEAASSGKVTRSPSPVPQ EEHSDPEMTEEEKEYQMMLLTKMLLTEILLDVTDEEIYYVAKDAHRKATKGGLGGYGSGD SEDERSDRGSESSDTDDEELRHRIRQKQEAFWRKEKEQQLLHDKQMEEEKQQTERVTKEM NEFIHKEQNSLSLLEAREADGDVVNEKKRTPNETTSVLEPKKEHKEKEKQGRSRSGSSSS GSSSSNSRTSSTSSTVSSSSYSSSSGSSRTSSRSSSPKRKKRHSRSRSPTIKARRSRSRS YSRRIKIESNRARVKIRDRRRSNRNSIERERRRNRSPSRERRRSRSRSRDRRTNRASRSR SRDRRKIDDQRGNLSGNSHKHKGEAKEQERKKERSRSIDKDRKKKDKEREREQDKRKEKQ KREEKDFKFSSQDDRLKRKRESERTFSRSGSISVKIIRHDSRQDSKKSTTKDSKKHSGSD SSGRSSSESPGSSKEKKAKKPKHSRSRSVEKSQRSGKKASRKHKSKSRSR >gi568815592r:99269603_99494179|GENSCAN_predicted_CDS_5|3213_bp atgaatggggattctttaatgtttgccagcctcatgaattctgagtcacgtctgaatgaa agccctactgatgacagtgaaaaagaagccagccattctgaaagcaatgttgatgctgac agtgagccttcagaatctgaaagtgcttcaaagcagactgggctgttcagatccagtagt ggatccggtgtgcagccagatggacccctttaccctctgtcagcaggtaaactgctgtac accaaggagactgacagtggtgataaggaaatggcagaagctatttctgaacttcgtttg agcagcactgtaactggagatcaagattttgacagagaaaatcagccactaaatatttca aataatttatgttttttagaggggaagcatttgaggtcttatagtccccaaaatgctttt cagaccctttctcagagctatataactacttctaaagaatgttcaattcagtcctgtctc taccagtttacatctatggaattactaatggggaataataagcttctatgtgagaattgt actaaaaacaaacagaagtaccaagaagaaaccagttttgcagaaaagaaagtagaagga gtttatactaatgccaggaagcaattgctcatttctgctgttccagctgtcctaattctc cacctgaaaagatttcatcagaatgcaagtgtgggagataaagttctctacggtctctat ggcatagtggaacatagtggctcgatgagagaaggccactacactgcttatgtgaaagtg agaacaccctccaggaaattatcggaacataacactaaaaagaaaaatgtgcctgaagta gaagcatcgaaagcgttggagaggtgttaccggaacggcggcgacaagggtgttcccgaa ctagagtgggtaagtgatggagaacaattatacgtgggcacttgccggaccattacaacg ttcttggtttctgtggcgaagattgattgggctgcattggcccaagcttggattgcccaa agagaagcttcaggacagcaaagcatggtagaacaaccaccaggaatgatgccaaatgga caagatatgtctacaatggaatctggtccaaacaatcatgggaatttccaaggggattca aacttcaacagaatgtggcaaccagatcagccatggatgccaccaacaccaggcccaatg gacattgttcctccttctgaagacagcaacagtcaggacagtggggaatttgcccctgac aacaggcatatatttaaccagaacaatcacaactttggtggaccacccgataattttgca gtggggccagtgaaccagtttgactatcagcatggggctgcttttggtccaccgcaaggt ggatttcatcctccttattggcaaccaggacctccaggacctccagcacctccccagaat cgaagagaaaggccatcatcattcagggatcgtcagcgttcacctattgcacttcctgtg aagcaggagcctccacaaattgacgcagtaaaacgcaggactcttcccgcttggattcgc gaaggtcttgaaaaaatggaacgtgaaaagcagaagaaattggagaaagaaagaatggaa caacaacgttcacaattgtccaaaaaagaaaaaaaggccacagaagatgctgaaggaggg gatggccctcgtttacctcagagaagtaaatttgatagtgatgaggaagaagaagacact gaaaatgttgaggctgcaagtagtgggaaagtcaccagaagtccatccccagttcctcaa gaagagcacagtgaccctgagatgactgaagaggagaaagagtatcaaatgatgttgctg acaaaaatgcttctaacagaaattctgctggatgtcacagatgaagaaatttattacgta gccaaagatgcacaccgcaaagcaacgaaaggtggactgggtggttatggatcaggagac agtgaagatgagaggagtgacagaggatctgagtcatctgacactgatgatgaagaatta cggcatcgaatccggcaaaaacaggaagctttttggagaaaagaaaaagaacagcagcta ttacatgataaacagatggaagaagaaaagcagcaaacagaaagggttacaaaagagatg aatgaatttatccataaagagcaaaatagtttatcactactagaagcaagagaagcagac ggtgatgtggttaatgaaaagaagagaactccaaatgaaaccacatcagttttagaacca aaaaaagagcataaagaaaaagaaaaacaaggaaggagtaggtcgggaagttctagtagt ggtagttccagtagcaatagcagaactagtagtactagtagtactgtctctagctcttca tacagttctagctcaggtagtagtcgtacttcttctcggtcttcttctcctaaaaggaaa aagagacacagtaggagtagatctccaacaatcaaagctagacgtagcaggagtagaagc tattctcgcagaattaaaatagagagcaatagggctagggtaaagattagagatagaagg agatctaatagaaatagcattgaaagagaaagacgacgaaatcggagtccttcccgagag agacgtagaagtagaagtcgctcaagggatagacgaaccaatcgtgccagtcgcagtagg agtcgagataggcgtaaaattgatgatcaacgtggaaatcttagtgggaacagtcataag cataaaggtgaggctaaagaacaagagaggaaaaaggagaggagtcgaagtatagataaa gataggaaaaagaaagacaaagaaagggaacgtgaacaggataaaagaaaagagaaacaa aaaagggaagaaaaagattttaagttcagtagtcaggatgatagattaaaaaggaaacga gaaagtgaaagaacattttctaggagtggttctatatctgttaaaatcataagacatgat tctagacaggatagtaagaaaagtactaccaaagatagtaaaaaacattcaggctctgat tctagtggaaggagcagttctgagtctccaggaagtagcaaagaaaagaaggctaagaag cctaaacatagtcgatcgcgatccgtggagaaatctcaaaggtctggtaagaaggcaagc cgcaaacacaagtctaagtcccgatcaaggtag >gi568815592r:99269603_99494179|GENSCAN_predicted_peptide_6|377_aa MVKGSIQQEELTILNIYAPNTGAPNTGAPTFIKQVPRDLQRNLDSHTIIMGDFNTPLSTL DRSMRQKVNKDSQELNSALHQVDLVDMNRTLHPKSTEYTFFSAHHTYSKIDHTVGSQALL SKCKRTEIITNCLSDHSAIKLELRIKKLTQNRSTTWKLKNLLLNDYWLHNEMKAEIKMLF ETNENKDTTYQNLWDTFKAVCRGKFIVLNAHKRKQERSKIDTLTSQLKELEKQEQTNSKA SRRQEITKIRAELKEIETQKKTLQKINESRSWFFEKINKIDKPLARQIKKKREKNQIGTI KSDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLHTYTLPRLNQEELESLNRSI TGSEIEAIINSLPIKKS >gi568815592r:99269603_99494179|GENSCAN_predicted_CDS_6|1134_bp atggtaaagggatcaattcaacaagaagagctaactatcctaaatatatatgcacccaat acaggagcacccaatacaggagcacccacattcataaagcaagtccccagagacctacaa agaaacttagactcccacacaataataatgggagactttaacaccccactgtcaacatta gacagatcaatgagacagaaagttaacaaggatagccaggaattgaactcagctctgcat caagtggacctagtagacatgaacagaactctccaccccaaatcaacagaatatacattc ttctcagcacatcacacttattccaaaattgaccacacagttggaagtcaagcactcctc agcaaatgtaaaagaacagaaattataacaaattgtctctcagaccacagtgcaatcaaa ctagaactcaggattaagaaactcactcaaaaccgctcaactacatggaaactgaaaaac ctgctcctgaatgactactggttacataacgaaatgaaggcagaaataaagatgctcttt gaaaccaatgagaacaaagacacaacataccagaatctatgggacacatttaaagcagtg tgtagagggaaatttatagtactaaatgcccacaagagaaagcaggaaagatctaaaatt gacaccctaacatcacaattaaaagaactagagaagcaagagcaaacaaattcaaaagct agcagaaggcaagaaataactaagatcagagcagaactgaaagagatagagacacaaaaa aaaacccttcaaaaaatcaatgaatccaggagctggttttttgaaaagatcaacaaaatc gataaaccactagcaagacaaataaagaagaaaagagagaagaatcaaataggcacaata aaaagtgataaaggggatatcaccactgatcccacagaaatacaaactaccatcagagaa tactataaacacctctacgcaaataaactagaaaacctagaagaaatggataaattcctc cacacatacaccctcccaagactaaaccaggaagaacttgaatccctgaatagatcaata acaggctctgaaatagaggcaataattaatagcctaccaatcaaaaaaagttga >gi568815592r:99269603_99494179|GENSCAN_predicted_peptide_7|201_aa MQNLAQTYTLTDLMNEIKESSTKLKIFPSSDSQLRIQASILKAFNNPTTKTADDETRKKV KAYGKEGVKMNFIDRIFIGELTSTVMCEECANISTVKDPFIDISLPIIEERVSKPLLWGR MNKYRSLRETDHDRYSGNVTIENIHQPRAAKKHSSSKDKVCSLLRKRIQRYFSHLLLKEE KYGSVPPAHQRSEFKVISPIP >gi568815592r:99269603_99494179|GENSCAN_predicted_CDS_7|606_bp atgcagaacttggcacagacttatactcttactgatctgatgaatgagatcaaagaaagt agtacaaaactcaagatttttccttcctcagactctcagctgcgaatacaagctagcatt ctaaaagcatttaacaacccaactactaaaactgctgatgatgaaactagaaaaaaagtc aaagcatatggaaaagaaggtgtgaaaatgaacttcatagatcggatctttattggtgaa ttaactagcacggtcatgtgtgaagaatgtgcaaatatctccacggtgaaagatccattc attgatatttcacttcctataatagaagaaagggtttcaaaacctttactttggggaaga atgaataaatatagaagtttacgggagacagatcatgatcgatacagtggcaatgttact atagaaaatattcatcaacctagagctgccaagaagcattcttcatctaaagataaggta tgttccttgctgagaaaaagaattcagcgatatttctcccatttgcttttgaaagaagag aaatatggctctgttccgccagctcaccagaggtcagagtttaaggttatctctcctatt ccctga