GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:09:26 Sequence gi568815597r:167819979_168035841 : 215863 bp : 38.86% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.25 PlyA - 43 38 6 -0.45 1.24 Term - 347 82 266 0 2 79 43 270 0.207 16.29 1.23 Intr - 3145 3030 116 0 2 107 119 74 0.996 11.57 1.22 Intr - 4594 4498 97 2 1 86 68 44 0.981 0.35 1.21 Intr - 4877 4673 205 2 1 122 82 149 0.953 15.55 1.20 Intr - 13184 13009 176 0 2 113 86 121 0.984 13.14 1.19 Intr - 14099 13997 103 2 1 74 100 40 0.980 2.63 1.18 Intr - 16562 16331 232 1 1 28 103 184 0.350 10.75 1.17 Intr - 17340 17271 70 1 1 96 80 49 0.331 2.22 1.16 Intr - 25875 25585 291 1 0 108 25 309 0.559 22.78 1.15 Intr - 26285 26007 279 0 0 78 92 194 0.995 15.33 1.14 Intr - 28511 28383 129 0 0 104 116 86 0.994 12.75 1.13 Intr - 34511 34375 137 1 2 95 89 0 0.951 0.09 1.12 Intr - 36461 36187 275 2 2 75 99 250 0.962 20.21 1.11 Intr - 41085 40893 193 2 1 83 93 110 0.174 9.57 1.10 Intr - 58657 58468 190 2 1 81 65 135 0.381 8.22 1.09 Intr - 60213 60137 77 2 2 53 69 36 0.681 -3.56 1.08 Intr - 60631 60513 119 1 2 110 81 38 0.869 3.64 1.07 Intr - 63650 63459 192 2 0 58 115 166 0.962 15.07 1.06 Intr - 73963 73875 89 2 2 117 68 85 0.926 8.17 1.05 Intr - 76713 76617 97 0 1 83 106 18 0.151 1.76 1.04 Intr - 79650 79445 206 1 2 78 98 282 0.998 26.20 1.03 Intr - 81827 81684 144 0 0 47 87 125 0.586 7.63 1.02 Intr - 82076 82038 39 0 0 94 80 43 0.564 1.38 1.01 Init - 84002 83909 94 2 1 61 89 77 0.691 5.69 1.00 Prom - 85675 85636 40 -4.55 2.06 PlyA - 88133 88128 6 1.05 2.05 Term - 89909 89826 84 2 0 126 42 6 0.337 -3.53 2.04 Intr - 91715 91548 168 2 0 58 75 164 0.693 11.32 2.03 Intr - 104559 104519 41 1 2 132 106 -15 0.014 1.72 2.02 Intr - 115920 115755 166 0 1 46 95 293 0.990 24.41 2.01 Init - 116212 116156 57 1 0 99 73 78 0.790 6.80 2.00 Prom - 120248 120209 40 -7.55 3.00 Prom + 121550 121589 40 -7.45 3.01 Init + 124078 124143 66 0 0 101 72 73 0.723 6.12 3.02 Intr + 146651 146743 93 1 0 110 98 0 0.768 2.44 3.03 Intr + 154852 155037 186 0 0 117 64 113 0.976 10.76 3.04 Intr + 167517 167630 114 2 0 101 58 48 0.832 2.82 3.05 Intr + 171226 171361 136 0 1 65 110 155 0.999 14.62 3.06 Intr + 173248 173462 215 2 2 124 92 158 0.999 17.31 3.07 Intr + 182504 182597 94 1 1 50 48 155 0.786 6.32 3.08 Intr + 183892 184011 120 2 0 59 76 59 0.561 1.35 3.09 Intr + 184555 184815 261 2 0 32 81 171 0.803 7.34 3.10 Term + 195803 195981 179 0 2 69 35 154 0.725 5.07 3.11 PlyA + 196671 196676 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 79062 78901 162 0 0 70 42 125 0.834 3.05 S.002 Term - 115567 115431 137 2 2 51 47 130 0.843 2.40 S.003 Init + 116934 117030 97 2 1 71 80 114 0.878 9.48 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:167819979_168035841|GENSCAN_predicted_peptide_1|1271_aa MTEKFSSAMYMDRGAEQLVEILNYHISAIVEKVLIFGGDILKFAGDALLALWRVERKQLK NIITVVIKCSLEIHGLFETQEWEEGLDIRVKIGLAAGHISMLVFGDETHSHFLVIGQAVD DVRLAQNMAQMNDVILSPNCWQLCDRSMIEIESVPDQRAVKVNFLKPPPNFNFDEFFTKC TTFMHYYPSGEHKNLLRLACTLKPDPELEMSLQKYVMESILKQIDNKQLQGYLSELRPVT IVFVNLMFEDQDKAEEIGPAIQDAYMHITSVLKIFQGQINKVFMFDKGCSFLCVFGFPGE KVPDELTHALECAMDIFDFCSQVHKIQTVSIGVASGIVFCGIVGHTVRHEYTVIGQKVNL AARMMMYYPGIVTCDSVTYNGSNLPAYFFKELPKKVMKGVADSGPLYQYWGRTEKVIIAI SLNKISFHQTFYTIQMFMANVLGLDTCKHYKERQTNLRNKVMTLLDEKFYCLLNDIFHVQ IVKEERIIFIIDEAQFVDSTSWRFMEKLIRTLPIFIIMSLCPFVNIPCAAARAVIKNRNT TYIVIGAVQPNDISNKICLDLNVSCISKELDSYLGEGSCGIPFYCEELLKNLEHHEVLVF QQTESEEKTNRTWNNLFKYSIKLTEKLNMVTLHSDKESEEVCHLTSGVRLKNLSPPTSLK EISLIQLDSMRLSHQMLVRCAAIIGLTFTTELLFEILPCWNMKMMIKTLATLVESNIFYC FRNGKELQKALKQNDPSFEVHYRSLSLKPSEGMDHGEEEQLRELENEVIECHRIRFCNPM MQKTAYELWLKDQRKAMHLKCARFLEEDAHRCDHCRGRDFIPYHHFTVNIRLNALDMDAI KKMAMSHGFKTEEKLILSNSEIPETSAFFPENRSPEEIREKILNFFDHVLTKMKTSDEDI IPLESCQCEEILEIVILPLAHHFLALGENDKALYYFLEIASAYLIFCDNYMAYMYLNEGQ KLLKTLKKDKSWSQTFESATFYSLKGLFQYGPDSACQENAEEGTEAPQPNLSLQLNLLVS PYPCREKQTLSLCESAGPREPTSRIIKAYLDYSLYHHLAGYKGVWFKYEVMAMEHIFNLP LKGEGIEIVAYVAETLVFNKLIMGHLDLAIELGSRALQMWALLQNPNRHYQSLCRLSRCL LLNSRYPQLIQVLGRLWELSVTQEHIFSKAFFYFVCLDILLYSGRLVPFPETWLGGRRCG ASSPVVATTMDQDPVGPVERGEAVAASGAAAAAAFGESAGQEIKGSKKLSHGPKGNVDVR TTIAKFYLKDE >gi568815597r:167819979_168035841|GENSCAN_predicted_CDS_1|3816_bp atgactgagaagttcagcagtgccatgtacatggacagaggggctgagcagttggtggag atcctcaactaccacataagtgcaatagtggagaaagtgttgatttttggaggagacatc ctgaaatttgcaggtgatgcactgctagccctgtggagggtggagcgaaagcagctgaaa aacattatcacagtggtaattaaatgtagcctggagatccatggattgtttgagacccag gagtgggaagaaggcctagacatccgagtcaagataggactggctgctggccacatcagc atgttggtctttggagatgaaacacacagccactttctggtgattggtcaggcagtggac gatgtgcgccttgcccagaacatggctcagatgaatgatgttattctgtcaccaaactgc tggcagctctgtgaccggagcatgattgaaattgagagtgttccagatcagagagcagtt aaggttaacttcttaaaaccaccccccaattttaattttgatgaatttttcacaaagtgt acgaccttcatgcattattatccttctggtgagcacaaaaacctcctgaggcttgcatgc acgctgaagcctgatcctgaactggagatgtccctacaaaagtatgtgatggaaagcatt ttgaagcagattgataacaaacagcttcagggctatttatctgagcttcgcccagtgacg attgtgtttgtgaacctgatgtttgaagaccaagacaaagcagaagagataggcccagcc atccaggatgcctatatgcacatcacttctgtcctgaagatcttccaaggccaaatcaat aaagtcttcatgtttgacaagggctgctctttcctctgtgtctttggcttccctggggaa aaggtacctgacgagctcactcatgctctggaatgtgctatggatatatttgacttctgc tctcaagtccacaaaatccaaactgtatccatcggtgttgccagtgggattgtcttctgt gggatcgttggacacactgtgagacacgagtacacagtcattggtcaaaaagtcaactta gctgccaggatgatgatgtactacccaggaattgtgacctgcgactctgtcacctacaat gggagcaacctaccagcgtacttttttaaagagcttccaaagaaagttatgaaaggtgtt gcagattctggaccattgtatcagtattggggccgtactgagaaagtgattattgccatt tcattgaataagatcagcttccatcaaactttctataccatccagatgttcatggccaat gtcctaggcctagacacttgtaaacattataaagaacgacagaccaaccttcgaaataaa gtcatgacactgttggatgaaaagttctactgtcttcttaatgacattttccatgttcag atagtgaaagaggaaaggattatttttatcattgatgaggcccagtttgtggattcgacc tcctggagatttatggagaagcttatccggactcttcctatcttcatcattatgtccctg tgtcccttcgttaacattccctgtgcagctgccagggccgtaataaagaacaggaacacc acctacattgtcattggtgcagtacagcctaacgacatctccaacaagatctgtcttgac ctcaatgtgagctgcatctccaaagaactggactcgtacctgggggagggaagctgtggg attccattttactgtgaagaattgcttaaaaacctggaacatcatgaggtactcgttttc caacaaacggagtctgaggaaaagacaaataggacctggaataacctgttcaagtattcc attaagctaacagagaagttaaacatggttactctccatagtgataaggaaagtgaagaa gtctgtcacctcacaagtggtgtcagactgaaaaacctgtcacctccaacgtcattaaaa gaaatctctctgatccagctggatagcatgagactttcccaccaaatgctggtgagatgt gctgccatcattggcctgaccttcaccactgagttgttgtttgagattctcccctgttgg aatatgaagatgatgatcaagaccctggcaaccctagtggaatctaacattttttattgt ttccggaatggcaaggagcttcaaaaggccctgaaacagaatgatccctcatttgaggtg cactatcgttccttgtctctgaagcccagtgaagggatggatcacggtgaagaggaacag cttcgtgaactggagaatgaggtgatcgagtgccacaggattcgattctgtaaccctatg atgcagaaaacagcctacgagctgtggctcaaggaccagagaaaagccatgcacttgaaa tgtgcccgctttttagaagaagatgcccacagatgtgaccactgccgaggcagggacttc attccctatcatcacttcacagtgaatattcggctcaacgctttagacatggatgccatt aaaaagatggctatgtctcatggatttaaaactgaagaaaagcttatcttgtccaactca gagattcctgagacatctgcattttttcctgaaaatcgcagtcctgaagaaataagagaa aagatcttgaatttctttgaccacgttttaacaaaaatgaagacatctgacgaagacatt atccctctggaatcttgccagtgtgaagaaatcctagagattgtcatcttgcctctggcc caccattttctggctttgggagaaaatgacaaagccttatattacttcttagaaattgca tctgcttatctcatcttttgtgataactacatggcatacatgtatttgaatgaaggacag aagttgctaaaaactctcaagaaggacaaatcttggagccagacatttgagtctgccacc ttttacagcctcaaaggtctgtttcaatatgggccagatagtgcttgccaagaaaatgct gaggaaggcactgaagctcctcaaccgaatctttccttacaacttaatctccttgtttct ccatatccatgtcgagaaaaacagacactttcattatgtgaatcggcaggcccaagagag cccacctccaggatcattaaggcttacctagactattcgctataccaccacctggctggc tacaaaggtgtgtggttcaaatatgaagtcatggccatggagcacatcttcaacctcccc ctgaaaggcgagggcattgaaatcgtggcatacgtggctgagacactggtcttcaacaag ctcataatgggacacctggatttggccattgagttaggctcccgagcccttcagatgtgg gcactgctccagaatcccaaccgacattatcagtccctctgcagacttagcagatgtctc cttctgaacagcagatacccgcaattgatccaggtgctggggcggctgtgggagctttct gtaacacaggaacacatcttcagcaaggcatttttctattttgtctgcttggacatcctg ctttattctggccggctagtccccttcccagagacttggctaggcggccggcgctgcggc gccagcagccctgtggtggcgacgacgatggaccaggacccagtgggccctgtggaacga ggagaagccgtcgcagcctcgggagctgcggccgccgcggcattcggggaatctgcgggg caggaaattaagggatccaaaaagctgagtcatggtccaaaaggaaacgttgatgtcagg acaaccatagccaaattttatctcaaggatgagtaa >gi568815597r:167819979_168035841|GENSCAN_predicted_peptide_2|171_aa MAGSSACASALPAGRSADLPGTRTQPRSHPAANDPSAAMSAAGARGLRATYHRLLDKVEL MLPEKLRPLYNHPAGPRTVFFWAPIMKWRGYTRQQFCHESTLNKGDGVIYNLTRSPYCPL AGTGPHILYLSRSASNLELLKEAKLLLPLSLQRQETVREMACLLTTTYLAR >gi568815597r:167819979_168035841|GENSCAN_predicted_CDS_2|516_bp atggcgggcagctcggcctgcgcaagcgcgctgccagcagggcgcagcgcagacttgcca gggacgagaacacagccacgctcccacccggctgccaacgatccctcggcggcgatgtcg gccgccggtgcccgaggcctgcgggccacctaccaccggctcctcgataaagtggagctg atgctgcccgagaaattgaggccgttgtacaaccatccagcaggtcccagaacagttttc ttctgggctccaattatgaaatggagagggtatactcgccagcagttttgccacgagagt acactgaacaaaggagacggggtcatttataacctgacacgttcaccctactgcccattg gccggaacaggacctcacattctgtatttgtcccgatcggctagcaacttagaactttta aaagaggcaaagcttttacttcctctttctcttcagagacaagagacagtaagagaaatg gcctgtctcctcaccaccacttacctagcaagatga >gi568815597r:167819979_168035841|GENSCAN_predicted_peptide_3|487_aa MASIPRPCDLPASASQSAGITGVNTICWNDTGEYILSGSDDTKLVISNPYSRKVLTTIRS GHRANIFSAKFLPCTNDKQIVSCSGDGVIFYTNVEQDAETNRQCQFTCHYGTTYEIMTVP NDPYTFLSCGEDGTVRWFDTRIKTSCTKEDCKDDILINCRRAATSVAICPPIPYYLAVGC SDSSVRIYDRRMLGTRATGNYAGRGTTGMVARFIPSHLNNKSCRVTSLCYSEDGQEILVS YSSDYIYLFDPKDDTARELKTPSAEERREELRQPPVKRLRLRGDWSDTGPRARPESERER DGEQSPNVSLMQRMSDMLSRWFEEASEVAQSNRGRGRSRPRGGTSQSDISTLPTVPSSPD LEVSETAMEVDTPAEQFLQPSTSSTMSAQAHSTSSPTESPHSTPLLSSPDSEQRQSVEAS GHHTHHQSEFLRGPEIALLRKRLQQLRLKKAEQQRQQELAAHTQQQPSTSDQSSHEGSSQ DPHASGY >gi568815597r:167819979_168035841|GENSCAN_predicted_CDS_3|1464_bp atggcctcgatcccccgaccttgtgatctgcccgcctcggcctcccaaagtgctgggatc acaggggttaatacaatctgttggaatgacactggagaatatattttatctggctcagat gacaccaaattagtaattagtaatccttacagcagaaaggttttgacaacaattcgttca gggcaccgagcaaacatatttagtgcaaagttcttaccttgtacaaatgataaacagatt gtatcctgctctggagatggagtaatattttataccaacgttgagcaagatgcagaaacc aacagacaatgccaatttacgtgtcattatggaactacttatgagattatgactgtaccc aatgacccttacacttttctctcttgtggtgaagatggaactgttaggtggtttgataca cgcatcaaaactagctgcacaaaagaagattgtaaagatgatattttaattaactgtcga cgtgctgccacgtctgttgctatttgcccaccaataccatattaccttgctgttggttgt tctgacagctcagtacgaatatatgatcggcgaatgctgggcacaagagctacagggaat tatgcaggtcgagggactactggaatggttgcccgttttattccttcccatcttaataat aagtcctgcagagtgacatctctgtgttacagtgaagatggtcaagagattctcgttagt tactcttcagattacatatatctttttgacccgaaagatgatacagcacgagaacttaaa actccttctgcggaagagagaagagaagagttgcgacaaccaccagttaagcgtttgaga cttcgtggtgattggtcagatactggacccagagcaaggccggagagtgaacgagaacga gatggagagcagagtcccaatgtgtcattgatgcagagaatgtctgatatgttatcaaga tggtttgaagaagcaagtgaggttgcacaaagcaatagaggacgaggaagatctcgaccc agaggtggaacaagtcaatcagatatttcaactcttcctacggtcccatcaagtcctgat ttggaagtgagtgaaactgcaatggaagtagatactccagctgaacaatttcttcagcct tctacatcctctacaatgtcagctcaggctcattcgacatcatctcccacagaaagccct cattctactcctttgctatcttctccagacagtgaacaaaggcagtctgttgaggcatct ggacaccacacacatcatcagtctgaatttttaaggggccctgagatagctttgcttcgt aagcgcctgcaacaactgaggcttaagaaggctgagcagcagaggcagcaagagctagct gcacatacccagcaacagccttccacttctgatcagtcttctcatgagggctcttcacag gaccctcatgcttcaggttattaa