GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:55:31 Sequence gi568815596r:42662911_42864179 : 201269 bp : 44.76% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 316 440 125 0 2 69 68 48 0.268 0.55 1.02 Intr + 2659 2749 91 1 1 64 79 56 0.390 2.30 1.03 Intr + 19491 19679 189 2 0 80 78 107 0.572 8.68 1.04 Intr + 45964 46186 223 0 1 105 64 61 0.193 3.10 1.05 Intr + 56078 56164 87 0 0 113 80 3 0.319 1.94 1.06 Intr + 59979 60125 147 1 0 55 80 52 0.298 1.31 1.07 Intr + 64442 64576 135 0 0 100 68 79 0.031 7.64 1.08 Intr + 91309 91338 30 2 0 108 94 65 0.149 7.20 1.09 Term + 98167 98564 398 2 2 45 55 122 0.028 -0.16 1.10 PlyA + 99583 99588 6 1.05 2.15 PlyA - 99611 99606 6 1.05 2.14 Term - 101320 99998 1323 1 0 32 55 1293 0.539 111.97 2.13 Intr - 104237 104116 122 0 2 125 54 97 0.877 10.21 2.12 Intr - 104605 104497 109 1 1 112 69 30 0.961 3.36 2.11 Intr - 104767 104702 66 1 0 111 8 77 0.654 1.10 2.10 Intr - 105018 104818 201 0 0 47 64 124 0.957 5.38 2.09 Intr - 106948 106803 146 2 2 34 95 145 0.838 9.80 2.08 Intr - 107276 107233 44 1 2 140 81 57 0.999 8.08 2.07 Intr - 107672 107583 90 1 0 83 38 146 0.817 8.21 2.06 Intr - 120510 120404 107 0 2 101 99 100 0.620 11.41 2.05 Intr - 120957 120874 84 0 0 97 78 107 0.969 10.62 2.04 Intr - 125697 125619 79 2 1 60 92 140 0.328 11.15 2.03 Intr - 130462 130358 105 0 0 76 75 84 0.308 5.23 2.02 Intr - 133162 133091 72 0 0 125 81 19 0.296 3.42 2.01 Init - 134558 134446 113 2 2 46 59 107 0.495 3.28 2.00 Prom - 142460 142421 40 -6.46 3.04 PlyA - 145043 145038 6 1.05 3.03 Term - 147592 147359 234 1 0 47 49 205 0.857 8.82 3.02 Intr - 151208 151150 59 0 2 89 82 63 0.896 4.40 3.01 Init - 158730 158592 139 0 1 46 87 117 0.970 7.70 3.00 Prom - 161933 161894 40 -7.26 4.00 Prom + 162410 162449 40 -6.16 4.01 Init + 162730 162813 84 0 0 66 101 -6 0.507 -0.68 4.02 Intr + 163113 163196 84 2 0 79 72 29 0.306 0.42 4.03 Term + 164729 165004 276 1 0 70 33 223 0.536 10.46 4.04 PlyA + 165322 165327 6 -3.64 5.09 PlyA - 165485 165480 6 -0.45 5.08 Term - 166006 165924 83 2 2 68 47 118 0.649 3.56 5.07 Intr - 166245 166087 159 1 0 69 86 105 0.329 8.36 5.06 Intr - 187389 187181 209 2 2 74 52 82 0.723 1.92 5.05 Intr - 187826 187677 150 1 0 104 61 85 0.793 6.68 5.04 Intr - 188474 188304 171 1 0 28 110 73 0.260 2.56 5.03 Intr - 191375 191309 67 0 1 89 34 79 0.301 0.56 5.02 Intr - 193527 193466 62 2 2 58 105 73 0.361 4.38 5.01 Intr - 200667 200558 110 0 2 59 94 75 0.213 4.28 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:42662911_42864179|GENSCAN_predicted_peptide_1|474_aa MVESSRGSWADLPEVERACVGSWFTAAAAHCFPLLVNVQDSRARSALPWLEEQNVMSIVR ELDRSLRPAKVKFHAMDTLYRHSYDLSSAISVLVPLGGPVLCRDEMEEWSASEASLFEEA LEKYGKDFNDIRQDFDPRVRSHVSRQAMQGMPVRNTGSPKSAVKTRQAFFLHTTYFTKFA RQVCKNTLRLRQAARRPFVAINYAAIRAEYADRHAELSGSPLKSKSTRKPLACIIGYLEI HPAKKPNVIRSTPSLQTPTTKRMLTTPNHTSLSILGKRNYSHHNGLDGCRSSSVPYAVIV VSSPISSTVDIFTASKPGITYYVCPTWLTPSGSKPADEELLPDGELSPEKGTVGWGHSSD QGVSPTSMWRQPEACFAREHPTPFREGLANDGKAGKGGVTQTVSEEAVLRVKMGSKENPG QVRSAGLNPVTSIYPGTPQGQPLLSPLMGKDRTADMPANHGSICPFPFHQVMKE >gi568815596r:42662911_42864179|GENSCAN_predicted_CDS_1|1425_bp atggtggagagttctcgtggcagctgggctgacttgcctgaggtagagagggcctgtgtg ggctcttggttcacagctgcagcagctcactgcttcccactgctggtgaatgtgcaggac agcagagcaaggtctgcactcccctggcttgaagaacagaatgtcatgagcattgtgagg gagctggatcgaagccttagacctgccaaagtgaagtttcacgctatggatacattgtat agacacagctatgatttgagcagtgccattagtgtcttagtaccactcggaggacctgtt ttatgcagagatgaaatggaggaatggtcagcctctgaagctagcttatttgaagaggca ctggaaaaatatggcaaagacttcaatgacatacggcaagattttgaccctcgtgttaga agtcacgtgtcccgccaggccatgcagggaatgccagtccgaaacactgggagtccaaag tctgcagtgaagacccgccaagctttcttccttcatactacatatttcacaaaatttgct cgtcaggtctgcaaaaataccctccggctgcggcaggcagcaagacggccgtttgttgct attaattatgctgccattagggcagaatatgcagacagacatgctgaactatctggaagt ccactgaaaagcaaaagcactaggaagcctttggcatgtatcattgggtatttagagatc catcctgcaaagaaacctaatgtaattcgatctacaccaagcctgcaaaccccaactacc aagcggatgctaacaactccaaatcacacatctctgagcattctggggaaaagaaactac agtcatcacaatggtctggatggatgtcggtccagcagtgtgccgtatgctgtcattgtg gtttccagtcccatctcctccactgtggacatttttactgctagcaaaccaggtatcacc tactatgtttgccctacttggttaactccaagtggctccaaaccagctgatgaagaactg ctgccagatggagaactaagtccagagaagggtacagtgggctggggccatagttcggac cagggagtcagtcctacctccatgtggcggcagcctgaagcctgcttcgcgagggagcac ccaaccccgtttagagaaggccttgcaaatgatggaaaggcggggaagggtggtgtgacc cagactgtcagtgaggaagcagtcctcagggtcaagatggggagcaaggagaatccaggg caggttaggagtgcgggcctgaaccctgtgaccagcatctacccaggaactcctcaaggg cagcctctgctctctcccttgatgggaaaagacaggactgcagacatgcctgcaaaccat gggagtatctgtccatttcccttccatcaggtcatgaaggagtga >gi568815596r:42662911_42864179|GENSCAN_predicted_peptide_2|886_aa MDPPKIPHLKEKPYFGMRKMAVRWHHAENLVDRPQRGKTLTQASTHGGSSVPGDEISSWE TCCVRTEYVPKTPKEFSTGLGVHCDTARTAAARLGLRHQEQLKVMFIGGPNTRKDYHIEE GEEVFYQLEGDMVLRVLEQGKHRDVVIRQGEIFLLPARVPHSPQRFANTVGLVVERRRLE TELDGLRYYVGDTMDVLFEKWFYCKDLGTQLAPIIQEFFSSEQYRTGKPIPDQLLKEPPF PLSTRSIMEPMSLDAWLDSHHRELQAGTPLSLFGDTYETQVIAYGQGSSEGLRQNVDVWL WQLASGAQGQGCRTPCQRASARSSALGSHTGPTLGALGPRQTQTFTQEGSSVVTMGGRRL SLAPDDSLLVCLGANTRLCGPVCDPGPCLQEAPGVTLLPWPEAATDSVQHFHTKKALNKG FLRNAHLGVCRRQSSLAFPCAPQDHVEHRVFLSGCENADENPRMLCHRGGQLIVPIIPLC PEHSCRGRRLQNLLSGPWPKQPMELHNLSSPSPSLSSSVLPPSFSPSPSSAPSAFTTVGG SSGGPCHPTSSSLVSAFLAPILALEFVLGLVGNSLALFIFCIHTRPWTSNTVFLVSLVAA DFLLISNLPLRVDYYLLHETWRFGAAACKVNLFMLSTNRTASVVFLTAIALNRYLKVVQP HHVLSRASVGAAARVAGGLWVGILLLNGHLLLSTFSGPSCLSYRVGTKPSASLRWHQALY LLEFFLPLALILFAIVSIGLTIRNRGLGGQAGPQRAMRVLAMVVAVYTICFLPSIIFGMA SMVAFWLSACRSLDLCTQLFHGSLAFTYLNSVLDPVLYCFSSPNFLHQSRALLGLTRGRQ GPVSDESSYQPSRQWRYREASRKAEAIGKLKVQGEVSLEKEGSSQG >gi568815596r:42662911_42864179|GENSCAN_predicted_CDS_2|2661_bp atggatcctccgaaaatcccacacctgaaagagaaaccttattttggcatgcggaaaatg gcagtgcgctggcatcatgctgaaaatctggtggacaggccccaaagaggaaagacacta acccaggcttctacacatggtgggagcagcgttccaggtgatgagatcagcagctgggag acctgctgcgtgcgaactgagtatgttcctaagacgcctaaggagttcagcacggggctc ggggtccactgcgacactgcgcggacagcggccgctcggctgggcctcaggcaccaggag cagctcaaagtcatgttcatcggaggccccaacaccaggaaggactatcacatcgaagag ggtgaagaggtattttaccagctggagggagacatggttctccgagtcctggagcaaggg aaacaccgggatgtggtcattcggcagggagagatattcctcctgcctgccagggtgccc cactcaccacagaggtttgccaacaccgtggggctggtggttgagcgaaggcggctggag accgagctagatgggctcaggtactatgtgggcgacaccatggacgttctgtttgagaag tggttctactgcaaggacctcggcacgcagttggcccccatcatccaggagttcttcagc tctgagcagtacagaacaggaaagcccatccctgaccagctgctcaaggagccaccattc cctctgagcacacgatccatcatggagcccatgtccctggatgcctggctggacagccac cacagggagctgcaggcaggcacaccactcagcctgtttggggacacctatgagacccag gtgatcgcctatgggcaaggcagcagcgaaggcctgagacagaatgtggacgtgtggctg tggcagctggcaagtggggcccagggacagggttgcagaaccccctgccagagggcatca gcccgctcctcggccctgggctcccacacggggcccactcttggggccttggggcccagg cagactcagacattcacccaggagggctcctcggtggtgacaatggggggacggcgcctg agcctggcccctgatgacagcctcctggtatgcctgggagcgaacacaaggctctgtggc cctgtctgtgacccaggaccctgcctgcaagaagcccctggggtgaccctcttgccatgg cctgaagcagccacagactcagtgcagcacttccacaccaagaaggccctcaataaaggc ttcctgaggaacgcacacctgggggtctgccgcaggcagtcatcactggccttcccatgt gccccccaggaccatgtggaacacagggtgtttctcagtggctgcgagaatgctgatgaa aaccccaggatgttgtgtcaccgtggtggccagctgatagtgccaatcatcccactttgc cctgagcactcctgcaggggtagaagactccagaaccttctctcaggcccatggcccaag cagcccatggaacttcataacctgagctctccatctccctctctctcctcctctgttctc cctccctccttctctccctcaccctcctctgctccctctgcctttaccactgtggggggg tcctctggagggccctgccaccccacctcttcctcgctggtgtctgccttcctggcacca atcctggccctggagtttgtcctgggcctggtggggaacagtttggccctcttcatcttc tgcatccacacgcggccctggacctccaacacggtgttcctggtcagcctggtggccgct gacttcctcctgatcagcaacctgcccctccgcgtggactactacctcctccatgagacc tggcgctttggggctgctgcctgcaaagtcaacctcttcatgctgtccaccaaccgcacg gccagcgttgtcttcctcacagccatcgcactcaaccgctacctgaaggtggtgcagccc caccacgtgctgagccgtgcttccgtgggggcagctgcccgggtggccgggggactctgg gtgggcatcctgctcctcaacgggcacctgctcctgagcaccttctccggcccctcctgc ctcagctacagggtgggcacgaagccctcggcctcgctccgctggcaccaggcactgtac ctgctggagttcttcctgccactggcgctcatcctctttgctattgtgagcattgggctc accatccggaaccgtggtctgggcgggcaggcaggcccgcagagggccatgcgtgtgctg gccatggtggtggccgtctacaccatctgcttcttgcccagcatcatctttggcatggct tccatggtggctttctggctgtccgcctgccgatccctggacctctgcacacagctcttc catggctccctggccttcacctacctcaacagtgtcctggaccccgtgctctactgcttc tctagccccaacttcctccaccagagccgggccttgctgggcctcacgcggggccggcag ggcccagtgagcgacgagagctcctaccaaccctccaggcagtggcgctaccgggaggcc tctaggaaggcggaggccatagggaagctgaaagtgcagggcgaggtctctctggaaaag gaaggctcctcccagggctga >gi568815596r:42662911_42864179|GENSCAN_predicted_peptide_3|143_aa MLLNDQWVNEEIKKKIEKLLETNDNGNSISKPMGYSKSSTSREVYSCCVTPSSDEPFFIQ AFYKAKGALAPAAGLGRACAFAARAQARAFCDFAPSSRLFRSRSSRGSLRVSPSGMEGPP SLAVTGGPRGPCKERSAQPRASP >gi568815596r:42662911_42864179|GENSCAN_predicted_CDS_3|432_bp atgctcctgaatgaccagtgggtcaatgaagaaattaagaagaaaattgaaaaacttctt gaaacaaatgataatggaaacagtatatcaaagcctatgggatacagcaaaagtagtact agcagggaagtttatagctgctgtgtcacccctagctctgatgagcccttcttcatccaa gcattctacaaagccaagggtgcgctggcgcccgcggcgggcctcggcagagcatgcgcg ttcgcggcgcgagcccaggcccgagccttctgcgacttcgcgccgagctcccggctgttc cgcagccgatcgagccggggaagcctgcgagtgtcgccctcggggatggaggggccgcct agcctggccgtgacgggagggccacgggggccctgcaaagagcgcagcgcgcagccccgg gcctcgccgtag >gi568815596r:42662911_42864179|GENSCAN_predicted_peptide_4|147_aa MVNFNQIDTSVFLFYVSYTLTSYCLASNRNTKKYCNYGYGRFSSFFKYRTQLWFLPARPK HRERQEGAFATPVSLAPALTVDYYKDFCGPALSRRFWPCRRTSTTRWRCGSAIEGSSRSR RQAGPATGPASDSDFIRTLRGNVGAYP >gi568815596r:42662911_42864179|GENSCAN_predicted_CDS_4|444_bp atggtgaattttaaccaaattgatacctctgtattcttattttatgtttcctatactctt acatcatactgcttggcaagtaatagaaatacgaagaaatactgcaattatggttatggg agattttctagcttcttcaagtacaggacacaactatggttcctaccagcaaggcccaaa caccgggaacggcaagagggtgcatttgccactcccgtgtcgctagcaccggccttgaca gtggactattacaaggatttctgtgggcccgctctgagccgccgtttctggccgtgcaga cggacttctaccacacggtggcgctgcggctctgccatagaaggctcctctcggtccagg agacaggcagggccggccacaggaccagcctccgacagcgatttcatccgcaccctgcgg ggcaacgtgggggcctatccgtag >gi568815596r:42662911_42864179|GENSCAN_predicted_peptide_5|336_aa WVMHPHSSHATSNDIRGRRILPHTTELATCPASANGRDSIRFLGRMIEQGVTAQEVGGSK AHLVPKESSLGNDLNSRREGSLLGNAAPPTQHQTKPHEGAIQLADGPCGEMPAALCLRQA GHAHQALCMNHPSPPNRLLKPKPLKIAPSNSQNLTLPFSKLLQQPPYIHVANSSWRPLDE QRAQSRDEATVLGGWEGPWHPKGDEGGGPGLIHTTGAVSAEGNAPSDLSHSLVGHEELSL EKGYVHAVEFYATLKIRHFRSLLPMTPEMNWVKNGPLPYTLHVWERPEVEPAHAKAMLPP AGATKRHSKASYLFPHLVIQYAGWNKNIIPFYGRDN >gi568815596r:42662911_42864179|GENSCAN_predicted_CDS_5|1011_bp tgggtcatgcatccccactctagccatgcgacctccaatgacatccgtggaaggcgcatc cttccccacacgactgaactggccacatgcccagcctcggccaatggcagagactctatc cgcttcctaggacggatgattgagcagggcgtgaccgcccaggaggtgggaggcagcaag gcccacttggttccaaaggaaagttctctgggcaatgacctcaacagccgcagggaagga tccctcctgggcaatgctgcaccccccacccaacaccagacaaagccccatgaaggagct attcagcttgcagatgggccgtgtggagaaatgccagctgccctctgcctccgccaggca ggccatgcacaccaagccctctgcatgaaccatccaagcccccccaacaggcttctaaaa ccgaagcccctcaagattgcccccagcaactcccagaacctcactcttcccttctcaaag ctgctacaacaaccaccttacatccatgtggcaaactcatcctggaggccactggatgag caacgggctcagagcagagatgaagcaacggtcctgggagggtgggaagggccctggcat cccaaaggggatgagggaggaggaccggggctcatccacacaactggagctgtctccgca gaagggaatgcgcccagtgacctgagccactctctggtgggccacgaggagctctccttg gagaaaggttatgttcatgcggtggaattctacgcaacgttgaaaataaggcatttcaga tccctccttcccatgacccctgaaatgaattgggtgaagaatggtcccctgccctacacg ctgcatgtctgggagaggccagaggtggagccggcacatgcaaaggcaatgctgccacct gctggcgccacgaagagacacagcaaagcttcttacctgttccctcatttagtcatccag tatgccggctggaacaagaacatcatcccattttatggacgagacaactga