GENSCAN 1.0 Date run: 3-Nov-116 Time: 09:50:41 Sequence gi568815593f:10277796_10533681 : 255886 bp : 44.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 1506 1501 6 1.05 1.05 Term - 2837 2658 180 2 0 61 43 113 0.760 1.71 1.04 Intr - 4493 4402 92 0 2 99 105 55 0.994 8.01 1.03 Intr - 8701 8559 143 0 2 115 54 25 0.977 1.80 1.02 Intr - 10734 10627 108 2 0 71 95 66 0.962 5.00 1.01 Init - 12967 12753 215 1 2 65 60 180 0.984 11.12 1.00 Prom - 17830 17791 40 -3.46 2.00 Prom + 25479 25518 40 -3.36 2.01 Init + 41589 41767 179 2 2 62 77 94 0.203 4.43 2.02 Intr + 46297 46454 158 1 2 27 85 60 0.061 -0.75 2.03 Intr + 50563 50594 32 2 2 62 94 28 0.045 -1.35 2.04 Intr + 55515 55645 131 2 2 44 25 135 0.103 2.29 2.05 Intr + 62311 62436 126 1 0 67 53 106 0.063 4.79 2.06 Intr + 95785 95878 94 1 1 111 43 91 0.035 6.87 2.07 Intr + 97413 97790 378 2 0 -8 49 256 0.006 7.26 2.08 Intr + 100003 100099 97 0 1 59 115 32 0.193 2.58 2.09 Intr + 104005 104148 144 2 0 74 84 24 0.238 0.85 2.10 Intr + 109199 109271 73 0 1 83 92 60 0.819 4.36 2.11 Intr + 112537 112705 169 1 1 96 78 -9 0.714 -1.15 2.12 Intr + 113747 113936 190 1 1 60 57 258 0.796 19.06 2.13 Intr + 122989 123047 59 2 2 83 94 10 0.670 -0.30 2.14 Intr + 125612 125746 135 1 0 114 107 21 0.575 7.36 2.15 Intr + 129307 129407 101 0 2 101 111 -15 0.163 1.01 2.16 Intr + 133538 133742 205 2 1 53 73 75 0.129 1.70 2.17 Intr + 136638 136707 70 2 1 101 80 22 0.105 1.45 2.18 Intr + 137693 137874 182 0 2 72 80 58 0.104 2.99 2.19 Intr + 139475 139609 135 1 0 99 99 -11 0.093 1.86 2.20 Intr + 145940 146029 90 1 0 67 115 3 0.673 1.09 2.21 Intr + 148595 148727 133 1 1 93 84 93 0.998 9.62 2.22 Intr + 152098 152233 136 2 1 104 108 96 0.989 12.83 2.23 Term + 155799 155889 91 0 1 59 39 122 0.956 1.59 2.24 PlyA + 157012 157017 6 1.05 3.00 Prom + 160951 160990 40 -5.56 3.01 Init + 164373 164503 131 2 2 81 86 216 0.559 20.32 3.02 Intr + 170465 170588 124 2 1 52 80 55 0.734 1.69 3.03 Intr + 172157 172318 162 1 0 66 63 87 0.218 4.17 3.04 Intr + 183389 183564 176 1 2 55 92 183 0.325 14.14 3.05 Term + 202801 202987 187 1 1 67 54 182 0.785 9.66 3.06 PlyA + 207791 207796 6 1.05 4.00 Prom + 214488 214527 40 -5.86 4.01 Init + 218087 218099 13 1 1 114 96 1 0.714 4.07 4.02 Intr + 224914 225130 217 2 1 90 66 101 0.299 5.66 4.03 Term + 246996 247170 175 0 1 51 55 125 0.016 2.73 4.04 PlyA + 247497 247502 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 139478 139609 132 1 0 63 99 117 0.840 8.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:10277796_10533681|GENSCAN_predicted_peptide_1|245_aa MANEAYPCPCDIGHRLEYGGLGREVQVEHIKAYVTKSPVDAGKAVIVIQDIFGWQLPNTR YIADMISGNGYTTIVPDFFVGQEPWDPSGDWSIFPEWLKTRNAQKIDREISAILKYLKQQ CHAQKIGIVGFCWGGTAVHHLMMKYSEFRAGVSVYGIVKDSEDIYNLKNPTLFIFAENDV VIPLKDVSLLTQKLKEHCKVEYQIKTFSGQTHGFVHRKREDCSPADKPYIDEARRNLIEW LNKYM >gi568815593f:10277796_10533681|GENSCAN_predicted_CDS_1|738_bp atggctaacgaagcttatccttgtccgtgtgacattggccacagacttgagtatggaggg ctaggccgtgaagttcaagtcgagcacatcaaggcttatgtcaccaaatcccccgttgat gcaggcaaagctgtgattgtcattcaagatatatttggctggcagttgcccaataccaga tatatagctgacatgatctcaggaaatggatacacaaccattgttccagacttctttgta gggcaagagccttgggacccctctggcgactggtctatcttccctgagtggctgaaaaca agaaatgcccagaagatcgatagagagatcagtgctatcttgaagtatctgaaacaacag tgtcatgcccagaaaattggcatcgtgggattctgctggggtggaactgctgtccatcat ttgatgatgaaatactcagaattcagggcaggggtgtccgtctatggcattgtcaaggat tctgaagacatttacaatttaaagaaccccactttgttcatttttgctgaaaatgatgtt gtgattccactcaaggacgtatctttgctgactcagaagttgaaagaacactgcaaagtt gaatatcaaattaaaacattttctgggcagactcatgggttcgtgcatcggaagagagaa gattgctcacctgcagacaagccctacattgacgaggccagaaggaatttaattgagtgg ctgaacaagtacatgtag >gi568815593f:10277796_10533681|GENSCAN_predicted_peptide_2|1035_aa MAGLARSEGGVFRPLTSKDESEDMKILTVHPSEICQNDFVASSLLSNYSAASSPLKQGCR VPKLYHDIASSLCKEISLTELNIEAGFLSLCSKVAGFKEMDSGSNAYSNSSLGEPMKGVS SNQRQSTPAVLGAPEAKHIRTPNPLDRRGPAIWVRFPKVAADLRPRGLLWTLWFGFLDWL LQIVGHTSAVIYGLAAVCIFCLSMQQLGRQQCASRTVFAIPQAIVNGRPVTMFLEGSVPR LEFRVGMGLVGLHSEQPAGPAGPGSERLGTRASGCRGCTGSPSSAGPPALCSISCRALAA FPLGRARDLQPAMPEPPTHSMGSCAARASPTSTTLCSTAPSPIDHPRAEECQRMARDWQA APPAARTYVECVGQKEHLRNRFIILVYVLAVLSLSIKNVYSPDMPSRLPIQDIFAGLVTS IGTAIRYWFHYTLVAFAWLGVVPLTACRIYKCLFTGSVSSLLTLPLDMLSTENLLADCLQ GCFVVTCTLCAFISLVWLREQIVHGGAPIWLEHAAPPFNAAGHHQNEAPAGGNGAENVAA DQPANPPAENAVVGENPDAQDDQAEEEEEDNEEEDDAGVEDAADANNGAQAFCPYHIGHF SLVGLGFEEHEMFDATLKDRELSFQSAPGTTMFLHWLVGMVYVFYFASFILLLREIVFGS IVLLMLWLPIRIIKSVLPNFLPYNVMLYRDLHSYLLGDQEENENSANQQVNNNQHARNNN AIPVVGEGLHAAHQAILQQGGPVGFQPYRRPLNFPLRIFLLIVFMCITLLIASLICLTLP VFAGRWLMSFWTGTAKIHELYTAACGLYVCWLTIRAVTVMVAWMPQGRRVIFQKVKEWSL MIMKTLIVAVLLAGVVPLLLGLLFELVIVAPLRVPLDQTPLFYPWQDWALGVLHAKIIAA ITLMGPQWWLKTVIEQVYANGIRNIDLHYIVRKLAAPVISVLLLSLCVPYVIASGVVPLL GVTAEMQNLVHRRIYPFLLMVVVLMAILSFQVRQFKRLYEHIKNDKYLVGQRLVNYERKS GKQGSSPPPPQSSQE >gi568815593f:10277796_10533681|GENSCAN_predicted_CDS_2|3108_bp atggcagggttagcacgatccgagggtggagttttcaggcctctgacatcaaaagatgag tcagaggacatgaaaatcctcactgtgcatccttcagagatctgccagaacgacttcgtg gccagcagtctcttatctaactactccgcagctagtagtcctcttaagcaaggatgccgt gttcctaaactgtaccatgacattgcaagctcattgtgcaaagagatcagtcttactgag ttgaatatagaggcgggatttctcagtctctgcagcaaggtggctgggttcaaagaaatg gattctggatccaatgcttattcaaacagcagccttggagaacccatgaaaggagtatca tccaatcagcgccagagcaccccagccgtcctgggagcccccgaagccaagcatattcga actccgaatccgctcgatcgccggggacctgccatctgggttcggttccccaaggtcgct gccgaccttagaccgcggggattgctgtggacgttgtggtttggcttcctggactggctg ctgcagattgtgggtcacacttctgctgtcatctatgggctggctgctgtctgtatattc tgtctctctatgcagcagttgggcaggcagcaatgtgccagcaggactgtctttgccata ccccaggccatagtcaatggccgacccgtcacgatgttcctggagggcagcgtcccacgg ctggagttccgggtgggcatgggcttggtgggcctgcactcggagcagccggccggccct gccggcccaggcagtgagagacttggcacccgggccagcggctgcagagggtgtactggg tcccccagcagtgccggcccaccggcgctgtgctcgatttcttgccgggccttagctgcc ttcccgctgggcagggctcgggacctgcagcccgccatgcctgagcctcccacccactcc atgggctcctgtgcggcccgagcctccccgacgagcaccaccctctgctccacggcgccc agtcccatcgaccacccaagggctgaggagtgccagcgcatggcgcgggactggcaggca gctccacctgcagcccggacatatgtagagtgtgtcggtcagaaggaacacctgagaaac cgctttatcatccttgtgtatgtactggcagtattaagtttatccatcaagaatgtttat tctccagatatgccttcacggcttccaattcaagacatatttgctggactggttacaagt attggcactgcaatacgatattggtttcattatacacttgtggcctttgcatggttggga gttgttcctcttacagcatgccgcatctacaagtgcttgtttactggctccgtgagctca ctactgacgctgccattagatatgctgtcaacggaaaatttgttggcagattgtttgcag ggttgttttgtggtgacgtgcacactgtgtgcattcatcagcctggtgtggttgagagag cagatagtccatgggggagcaccaatttggttggagcatgctgccccaccgttcaatgct gcggggcatcaccaaaatgaggctccagcaggaggaaatggtgcagaaaatgttgctgct gatcagcctgctaacccaccagctgagaacgcagtggtgggggaaaaccctgatgcccag gatgaccaggcagaagaggaggaggaggacaatgaggaggaagatgacgctggtgtggag gatgcggcagatgctaataacggagcccaggcattttgcccttaccatattggtcatttc tcccttgttggtttgggatttgaagaacacgaaatgtttgatgctactctgaaagatcga gaactgagctttcagtcggctccaggtactaccatgtttctgcattggctagtgggaatg gtatatgtcttctactttgcctccttcattctactactgagagagattgtctttggctcc attgtcctcctgatgctttggcttcctatacgtataattaagagtgtgctgcctaatttt cttccatacaatgtcatgctctacagggatcttcattcttatttattgggagaccaggaa gaaaatgaaaacagtgcaaatcaacaagttaacaataatcagcatgctcgaaataacaac gctattcctgtggtgggagaaggccttcatgcagcccaccaagccatactccagcaggga gggcctgttggctttcagccttaccgccgacctttaaattttccactcaggatatttctg ttgattgtcttcatgtgtataacattactgattgccagcctcatctgccttactttacca gtatttgctggccgttggttaatgtcgttttggacggggactgccaaaatccatgagctc tacacagctgcttgtggtctctatgtttgctggctaaccataagggctgtgacggtgatg gtggcatggatgcctcagggacgcagagtgatcttccagaaggttaaagagtggtctctc atgatcatgaagactttgatagttgcggtgctgttggctggagttgtccctctccttctg gggctcctgtttgagctggtcattgtggctcccctgagggttcccttggatcagactcct cttttttatccatggcaggactgggcacttggagtcctgcatgccaaaatcattgcagct ataacattgatgggtcctcagtggtggttgaaaactgtaattgaacaggtttacgcaaat ggcatccggaacattgaccttcactatattgttcgtaaactggcagctcccgtgatctct gtgctgttgctttccctgtgtgtaccttatgtcatagcttctggtgttgttcctttacta ggtgttactgcggaaatgcaaaacttagtccatcggcggatttatccatttttactgatg gtcgtggtattgatggcaattttgtccttccaagtccgccagtttaagcgcctttatgaa catattaaaaatgacaagtaccttgtgggtcaacgactcgtgaactacgaacggaaatct ggcaaacaaggctcatctccaccacctccacagtcatcccaagaataa >gi568815593f:10277796_10533681|GENSCAN_predicted_peptide_3|259_aa MPLPDTMFCAQQIHIPPELPDILKQFTKAAIRTQPADVLRWSAGYFSALSRGDPLPVKDR MEMPTATQKTDTGLTQGLLKVLHKQCHHKRYVELTDLEQKWKNLCLPKEKFKALLQLDPC ENKIKWINFLALGCSMLGGSLNTALKHLCEILTDDPEGGPARIPFKTFSYVYRYLARLDS DVSPLETESYLASLKENMHASECGLMLSVSSSVSGSSECGPRTRSISIPWESLGNAESWD PAQTQSESDFQAARVLIDV >gi568815593f:10277796_10533681|GENSCAN_predicted_CDS_3|780_bp atgccgcttcccgacaccatgttctgcgctcagcagatccacattcccccggagctgccg gacatcctgaagcaattcaccaaggctgccatccgcacccagccggccgacgtgctgcgg tggtccgcgggctatttttcagctctgtcgagaggagatccacttcctgtaaaggacaga atggaaatgcccacggcaacccagaaaacagacacaggcctgactcaaggactcctgaaa gttttgcacaagcagtgtcaccacaagcggtatgtggaattaacagatcttgagcagaag tggaagaacttgtgcctgccgaaggaaaaattcaaagcgctcttacaactggatccttgt gaaaacaaaatcaagtggataaactttttagcgcttggatgcagcatgcttggtgggtcc ttgaacactgcgctgaagcacctgtgcgagatcctcacggacgatccggagggcgggccc gctcgcatccccttcaagacgttttcctacgtttaccgctacttggccagattagactca gatgtgtctcccttggagacggaatcctaccttgcctctctaaaggaaaatatgcatgcc agcgagtgtggccttatgttgagtgtgtcctccagtgtgagtggctcctccgagtgtggt ccaaggacccgcagcatcagcatcccctgggagagccttggaaatgcagaatcttgggat ccagctcagacccagtcagaatcagatttccaggcagctcgtgtgctcattgatgtctga >gi568815593f:10277796_10533681|GENSCAN_predicted_peptide_4|134_aa MPGQASALFTPDGRQRWSQESSAHDCAPGGSTSVCTQKAHRRDVVACKALLPARLHGRLH RLLGNDSRSKDEGYSARGLAIAKPPHVAFPEDSTITDLTVIGPIRRQLWTPDGPGVEPHC WWLSSLAHFLPQHL >gi568815593f:10277796_10533681|GENSCAN_predicted_CDS_4|405_bp atgcccggccaggcttccgccttattcacccctgatgggaggcagaggtggagccaggag agctctgctcatgactgtgctccaggaggctccacctccgtgtgcacccagaaggcccac aggagagatgttgtagcctgcaaagcactactgcctgcacgactgcacggccggctccac aggctcctcggaaacgacagcaggagcaaagatgagggctacagtgccagggggctggcc attgccaagcctccccacgtggcctttccagaagacagtaccatcactgacctcactgtc attggccccattcgccggcagctctggacaccagatgggcctggcgtggagcctcattgc tggtggttaagcagtttagctcacttcttacctcagcatctttga