GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:11:00 Sequence gi568815592r:34188756_34492362 : 303607 bp : 46.67% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1023 1093 71 2 2 80 92 113 0.473 11.42 1.02 Intr + 1333 1569 237 1 0 24 8 270 0.652 9.33 1.03 Intr + 1711 1957 247 1 1 -38 33 395 0.838 18.86 1.04 Intr + 2075 2288 214 1 1 -38 -12 258 0.002 1.49 1.05 Term + 7634 8052 419 0 2 12 48 251 0.800 8.94 1.06 PlyA + 8624 8629 6 1.05 2.00 Prom + 14911 14950 40 -2.86 2.01 Init + 30686 30965 280 1 1 48 -9 211 0.458 4.57 2.02 Intr + 34672 34709 38 2 2 142 95 2 0.606 4.28 2.03 Intr + 35545 35641 97 0 1 100 26 61 0.134 0.68 2.04 Intr + 43406 43539 134 0 2 96 76 48 0.934 4.76 2.05 Intr + 44083 44138 56 0 2 82 74 39 0.721 -0.32 2.06 Intr + 46070 46265 196 2 1 73 48 113 0.928 5.22 2.07 Intr + 47349 47418 70 2 1 93 86 52 0.978 4.25 2.08 Intr + 48127 48208 82 2 1 73 69 38 0.500 -0.90 2.09 Intr + 48449 48562 114 2 0 105 101 46 0.499 7.16 2.10 Intr + 52016 52127 112 2 1 81 117 104 0.500 12.98 2.11 Intr + 53957 54040 84 1 0 127 80 46 0.995 7.72 2.12 Intr + 55764 55860 97 2 1 138 72 31 0.965 6.08 2.13 Term + 56224 56327 104 2 2 97 48 52 0.959 0.54 2.14 PlyA + 57452 57457 6 1.05 3.07 PlyA - 57659 57654 6 1.05 3.06 Term - 58113 58048 66 0 0 103 43 93 0.987 4.24 3.05 Intr - 58394 58289 106 1 1 117 105 119 0.999 16.72 3.04 Intr - 58737 58712 26 0 2 114 101 -9 0.950 -0.18 3.03 Intr - 59003 58926 78 2 0 90 79 98 0.721 8.85 3.02 Intr - 60477 60224 254 1 2 60 94 127 0.903 7.65 3.01 Init - 61883 61835 49 2 1 86 58 43 0.853 0.21 3.00 Prom - 65390 65351 40 -4.76 4.03 PlyA - 66007 66002 6 1.05 4.02 Term - 74732 74550 183 2 0 54 55 381 0.991 28.94 4.01 Init - 74918 74817 102 2 0 98 -5 254 0.463 17.34 4.00 Prom - 75530 75491 40 -6.16 5.00 Prom + 79065 79104 40 -4.56 5.01 Init + 80539 80700 162 0 0 97 66 117 0.674 9.40 5.02 Intr + 83994 84096 103 2 1 108 67 43 0.623 3.95 5.03 Term + 95801 95877 77 0 2 88 44 122 0.857 5.80 5.04 PlyA + 96978 96983 6 1.05 6.09 PlyA - 97328 97323 6 1.05 6.08 Term - 100176 99998 179 1 2 125 42 94 0.983 6.35 6.07 Intr - 104780 104696 85 2 1 113 99 72 0.937 10.19 6.06 Intr - 153217 153107 111 1 0 86 69 84 0.144 6.88 6.05 Intr - 203668 203509 160 0 1 3 78 307 0.001 21.19 6.04 Intr - 203928 203739 190 1 1 70 -78 184 0.001 -1.36 6.03 Intr - 236085 235914 172 0 1 77 86 163 0.942 14.52 6.02 Intr - 236539 236317 223 0 1 46 91 239 0.475 18.13 6.01 Init - 237323 237277 47 2 2 40 105 62 0.368 2.98 6.00 Prom - 241832 241793 40 -2.36 7.00 Prom + 244851 244890 40 -4.66 7.01 Init + 244902 244971 70 2 1 64 70 101 0.967 7.11 7.02 Intr + 247266 247377 112 1 1 74 31 35 0.281 -4.16 7.03 Intr + 249849 249920 72 0 0 107 67 72 0.343 5.52 7.04 Intr + 250366 250461 96 1 0 69 69 118 0.286 7.12 7.05 Intr + 259705 259743 39 1 0 125 103 31 0.772 5.64 7.06 Term + 260461 260575 115 1 1 53 42 99 0.718 -0.16 7.07 PlyA + 261234 261239 6 1.05 8.00 Prom + 261893 261932 40 -6.56 8.01 Init + 266991 266997 7 2 1 114 100 0 0.372 4.99 8.02 Intr + 277369 277515 147 2 0 133 91 80 0.849 13.01 8.03 Intr + 283960 284002 43 2 1 138 116 -5 0.498 4.60 8.04 Intr + 297514 297649 136 1 1 78 111 40 0.033 5.87 8.05 Term + 299443 299598 156 0 0 91 43 46 0.033 -1.67 8.06 PlyA + 300125 300130 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 124706 124811 106 2 1 30 50 163 0.824 4.58 S.002 Init + 128727 128784 58 2 1 75 61 90 0.836 6.47 S.003 Init - 183349 183217 133 1 1 94 47 79 0.826 4.70 S.004 Sngl - 203889 203770 120 0 0 105 54 325 0.977 21.25 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:34188756_34492362|GENSCAN_predicted_peptide_1|395_aa MSFTTRSTTFSTNYQSLDSMQPPRVRSLETENRRLESKIQEYLEKKGPQVRDWGHYFKTM EDLRAQIFANSVNNASIILQIDNPHLAADETELAMCQSGERHLSGLTVEVDAPKSQDLAK IMADIQAQYDKLSQKNREKLNQYCSHQTEESTTVVTTQSAKIRAAEMTLMKLRRTVQSLE INLDSTRAEEQRQAQEYKALLNIQVKLEAEMATYHHLLEDGKDFNLGMPWTAAIPCKRPP SNRPPSPRRQDSGWQSGAAKRATTARGRQAATRDSQGQAHAGHQPHYSSASQVGTHGARA STLLTAASRIPLNSCPHGNPSLQGSRLPGIPVRPQRRPCGRSLGTAAAHQVCCTPSVRGP SECTLRLLHCERERGERNMDPKTLALAELRLGPMP >gi568815592r:34188756_34492362|GENSCAN_predicted_CDS_1|1188_bp atgagcttcaccactcgctctaccaccttctccaccaactaccagtccctggactcaatg cagccgcccagagtgaggagcctggagactgagaatcggaggctggagagcaaaatccag gagtatctggagaagaagggaccccaggtcagagattgggggcattacttcaagaccatg gaggacctgagggctcagatcttcgcaaattctgtgaacaatgccagcatcattctgcag attgacaatccccatcttgccgctgatgagacagagctggccatgtgccagtctggagag cgacatctctctgggttaaccgtggaggtagatgcccccaaatcacaggatcttgccaag atcatggcagacatccaggcccaatatgacaagctgtctcagaagaatcgagagaagctg aaccagtactgctcccaccagactgaggagagcaccacagtggtcaccacgcagtctgcc aagatcagagctgctgagatgacgctcatgaagctgagacgtacagtccagtccttggag atcaacctggactcaacccgggcagaggagcagcgccaggcccaggagtacaaggccctg ctgaatatccaggtcaagctggaggccgagatggccacctaccaccacctgctggaagac ggcaaggacttcaatcttgggatgccctggacagcagcaattccatgcaaacgaccacca tccaacagaccaccctccccccgccgccaggatagtggatggcaaagtggcgcggcgaag cgggccacgaccgcgagggggcgccaggcagccacgcgagactcccagggtcaggcccac gcgggccaccagccgcactactcttccgcttcccaagtcggcacacacggcgcccgtgcc tccacgctgctgactgctgcaagtcgaatccctctcaactcctgtccgcacgggaacccc agtttacaggggagccggctgcctggcatcccggtgcggccacagaggcgtccttgtgga aggagcctggggaccgctgcggcccatcaagtgtgctgcacgccctccgtgcgcggcccg tcggagtgcaccttgcgcctcctgcattgtgaaagggagcggggagaacggaacatggac ccaaagacactggcactggcagagctccgcctggggccgatgccttag >gi568815592r:34188756_34492362|GENSCAN_predicted_peptide_2|487_aa MGDVEKGKKIFIMKRSQCHIVEKGGKHKTGPNLHGLFGRKTGQAPGHSYIATIKNKDIIW GEDTLMEYLENPKKYIPGTKMIFVGIKKEEAGHGRLHGCCSTESQLTAARTPPAGSRYAA AARTPGSGLRVPVGLGERGWHYSMYHAGGIACVLVYPLVHPTQCKPLEDRNVATHFKATP LVVISSGVGEELGFLGDLALKKGGHTSHTDGERCTRERGRTGPSPHLPSRQAASPVPGPS SLRGLGEREVEGRAALGGSPSPDLGPKSQRKGPGPQVGLRISRTPSKSPQAPLSRCCAPL IGTPSRGYFWRWRGSKKASAFATSGGRGGARPVLSAQHRRSRQPGARTAGRRPSSREGKM SESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPKEPSEVPTPKRPRGRPKGSKNKGAAKT RGVSHILKLIAALSSALLPSLGLGAKGAWLLALGPPSPPPLAATPIFHLCPHHHTTQHTS RCRAPMG >gi568815592r:34188756_34492362|GENSCAN_predicted_CDS_2|1464_bp atgggtgatgttgagaaaggcaagaagatttttattatgaagcgttcccagtgccacatt gttgaaaagggaggcaagcataagactgggccaaatctccatggtctcttcgggcggaag acaggtcaggcccctggacactcttacatagccaccattaagaacaaagacatcatctgg ggagaggatacactgatggagtatttggagaatcccaagaagtacatccctggaacaaaa atgatctttgtcggcattaagaaggaagaggctgggcacggtcggctgcatgggtgctgc tcaacagaaagccagctgacagcagcacgcacacccccggccggctcacggtacgcggcc gctgcccgcactcccgggtcgggcctgcgcgtccctgtggggttgggcgagcggggctgg cattacagcatgtaccacgctggtggaattgcctgtgtacttgtctatcccctcgtgcac cccactcaatgcaagccccttgaggacaggaatgtagctacccatttcaaggccacaccc ttggtggtgatttcttcaggggttggagaggagctgggcttccttggggatttagccctg aagaagggcggccacacgtcccacacggacggggagagatgtacccgagagagagggcgg acaggaccctctccgcacctccccagccgccaggccgccagcccggtgcccgggcctagc agcctccgcggccttggagagcgcgaagtggaggggcgcgcggctctggggggcagcccg agcccagacctgggtcccaagtcccaacggaagggcccaggtccccaagtgggcctgcgt atctccagaacaccatctaagtcacctcaagctcccctgagccggtgctgcgctcctcta attgggactccgagccggggctatttctggcgctggcgcggctccaagaaggcatccgca tttgctaccagcggcggccgcggcggagccaggccggtcctcagcgcccagcaccgccgc tcccggcaacccggagcgcgcaccgcaggccggcggccgagctcgcgagaagggaagatg agtgagtcgagctcgaagtccagccagcccttggcctccaagcaggaaaaggacggcact gagaagcggggccggggcaggccgcgcaagcagcctccgaaggagcccagcgaagtgcca acacctaagagacctcggggccgaccaaagggaagcaaaaacaagggtgctgccaagacc cggggagtcagtcacatcctgaagctcattgctgccctgagctctgccctcctgccctcc ctgggcctgggggccaagggggcttggctcctggctctgggcccaccatcaccaccgcct ctggccgccacccccatcttccacctgtgccctcaccaccacactacacagcacaccagc cgctgcagggctcccatgggctga >gi568815592r:34188756_34492362|GENSCAN_predicted_peptide_3|192_aa MGFHLFGQAGLELLTSALGCPLPPPGGPCKPRCPRLRTAHPRLRIGGSRNLQATGFPLPD WLRPGAEARPISPGPLITRLTAFPGPGSQPRAFLSGLQRREANSDSMVGYVLGPFFLITL VGVVVAVVMYVQKKKRVDRLRHHLLPMYSYDPAEELHEAEQELLSDMGDPKVVHGWQSGY QHKRMPLLDVKT >gi568815592r:34188756_34492362|GENSCAN_predicted_CDS_3|579_bp atggggtttcacctctttggccaggctggtctcgaactcctgacatcagccctcggctgc cccctgccgccgccgggcggaccctgcaagccccgctgtccccgccttcgcaccgcccac ccccgcctccggattggcggctccagaaatctccaggccaccggctttccgctaccggat tggctgcgtccgggtgctgaggcccggcccatttccccgggtcctttgatcacgcgcctg acggcttttccggggcccgggagccaaccgagggcgttcctgtcggggctgcagcggcgg gaggccaacagcgactccatggtgggctatgtgttggggcccttcttcctcatcaccctg gtcggggtggtggtggctgtggtaatgtatgtacagaagaaaaagcgggtggaccggctg cgccatcacctgctccccatgtacagctatgacccagctgaggaactgcatgaggctgag caggagctgctctctgacatgggagaccccaaggtggtacatggctggcagagtggctac cagcacaagcggatgccactgctggatgtcaagacgtga >gi568815592r:34188756_34492362|GENSCAN_predicted_peptide_4|94_aa MAKIKARDLRGKKEELLKQLDDLKVELSQLRVAKTQKENLRKFYKGRNYKPLDLRPKKTR AMRRRLNKHEENPKTKKQQRKERLYPLGKYAVKA >gi568815592r:34188756_34492362|GENSCAN_predicted_CDS_4|285_bp atggccaagatcaaggctcgagatcttcgcgggaagaaggaggagctgctgaaacagctg gacgacctgaaggtggagctgtcccagctgcgcgtcgccaaaactcagaaagaaaacctc aggaaattctacaagggcaggaactacaagcccctggacctgcggcctaagaagacacgc gccatgcgccgccggctcaacaagcatgaggagaacccgaagaccaagaagcagcagcgg aaggagcggctgtacccgctggggaagtacgcggtcaaggcctga >gi568815592r:34188756_34492362|GENSCAN_predicted_peptide_5|113_aa MPEPPTHSMGSCAARASPTSTTPCSTAPSPIDHPRAEECERTAPDWQAAPPAAPPPAGFN CLCSEVWMEVPASQGRCEGLGPGPWAYADEDADSNKGIKKVLDENEKYVKDNM >gi568815592r:34188756_34492362|GENSCAN_predicted_CDS_5|342_bp atgcctgagcctcccacccactccatgggctcctgtgccgcccgagcctccccaacgagc accaccccctgctccacggcgcccagtcccatcgaccacccaagggctgaggagtgcgag cgcacggcgccggactggcaggcagctccacctgcagccccgccccctgcaggcttcaac tgcctctgcagtgaggtgtggatggaagtacctgcctctcagggtagatgtgaaggtctg ggtcctgggccctgggcctatgcagatgaggatgcggactccaacaaaggcattaagaaa gtactagatgaaaatgagaaatatgtgaaggataacatgtga >gi568815592r:34188756_34492362|GENSCAN_predicted_peptide_6|388_aa MLLPFQPRYRTLQPQRIKNLITGPRPRYSEVLLTIVSFLQMLMPKKNRIAIYELLFKEGV MVAKKDVHMPKHPELADKNVPNLHVMKAMQSLKSRGYVKEQFAWRHFYWYLTNEGIQYLR DYLHLPPEIVPATLRRSRPETGRPRPKGRCSLTPLTHRAQDGGAVVVGGGGGRGGGGPGG SGDGSRDGGSGGAGPGPGGMRRPRRASGGAVAGSPPLDSCDPHRTPTRARRMMKLKSNQT RTYDGDGYKKRAACLCFRSESEEEVLLVSSSRHPDRWIVPGGGMEPEEEPSVAAVREVCE ENQERKHRTYVYVLIVTEVLEDWEDSVNIGRKREWFKIEDAIKVLQYHKPVQASYFETLR QGYSANNGTPVVATTYSVSAQSSMSGIR >gi568815592r:34188756_34492362|GENSCAN_predicted_CDS_6|1167_bp atgctccttcctttccagccccggtaccggaccctgcagccgcagaggatcaagaacttg attaccgggccccggcctcgttactcagaagtgctcttgacaatcgtctccttcttgcag atgttgatgcctaagaagaaccggattgccatttatgaactcctttttaaggagggagtc atggtggccaagaaggatgtccacatgcctaagcacccggagctggcagacaagaatgtg cccaaccttcatgtcatgaaggccatgcagtctctcaagtcccgaggctacgtgaaggaa cagtttgcctggagacatttctactggtaccttaccaatgagggtatccagtatctccgt gattaccttcatctgcccccggagattgtgcctgccaccctacgccgtagccgtccagag actggcaggcctcggcctaaaggccggtgcagccttacgccgctgacgcatcgcgcccaa gatggcggcgcggtcgtcgtcgggggtggcggcggcagagggggcggcggccctggcggc agcggagacggcagccgtgacggtggcagcggcggcgcgggacctgggcctgggggaatg aggcggccgcggcgggccagcggcggagccgtagcgggctccccacccctcgactcctgc gacccgcaccgcacccccacccgggcccggaggatgatgaagctcaagtcgaaccagacc cgcacctacgacggcgacggctacaagaagcgggccgcatgcctgtgtttccgcagcgag agcgaggaggaggtgctactcgtgagcagtagtcgccatccagacagatggattgtccct ggaggaggcatggagcccgaggaggagccaagtgtggcagcagttcgtgaagtctgtgag gagaaccaggagaggaagcacaggacgtatgtctatgtgctcattgtcactgaagtgctg gaagactgggaagattcagttaacattggaaggaagagggaatggtttaaaatagaagac gccataaaagtgctgcagtatcacaaacccgtgcaggcatcatattttgaaacattgagg caaggctactcagccaacaatggcaccccagtcgtggccaccacatactcggtttctgct cagagctcgatgtcaggcatcagatga >gi568815592r:34188756_34492362|GENSCAN_predicted_peptide_7|167_aa MIHIAVEVVYTGGHHVAVNVYAQGSIPLNEYTKIVYAFFLDGHLSCYQFLAITNKAARSI RSLDKALILSTNCQSEKSLNPPNERAVTLTTKIRGFIFEVSETKNPPEGANSGHTDLLSQ GWQGIRPYRWSTGKADAIWRFHFLSHGTGSLQIGEQALSLTPAVFCA >gi568815592r:34188756_34492362|GENSCAN_predicted_CDS_7|504_bp atgatccacattgctgtggaggtggtctatactggaggccatcacgttgctgtcaatgtc tatgctcaaggaagtattccactgaatgaatataccaaaatagtttatgcattcttcctc gatggacatttgagttgttatcagtttttggctatcaccaataaagctgctagaagcatc aggtctttagacaaagctttaattctttcaaccaactgccaatcagaaaaatctttgaat ccacctaatgaaagagctgtaacactcaccaccaagatccgtggcttcatttttgaagtc agcgagaccaagaacccaccagaaggagccaattctggacacactgacctgctctcccag ggctggcagggcatacggccctatcggtggagcactggcaaggctgatgccatatggcgg ttccacttcctcagccatggaactggatctttgcaaattggagagcaggcgctgagcctc actcctgctgtcttctgtgcctga >gi568815592r:34188756_34492362|GENSCAN_predicted_peptide_8|162_aa MPTPARSQSDAVAAAAPAPSSPGKSAGLEAGPQTSRFAPSRRRWSPELLPPGLWCLKPGG MGKGVRAQLSAEREGGGGVVKYLSPMWFSQPWGEVGAIIPALKMRKQEGGQTLTIALSHF PSIPGKGAPRDVPGNYLAAKAIILPCPLPYKQCEACTEGPSG >gi568815592r:34188756_34492362|GENSCAN_predicted_CDS_8|489_bp atgcccaccccggcgagatcccagagcgacgcggtggcggcggcagcgccagccccctcc tcccccgggaagtcggccgggcttgaggccgggccccagacgtcccgcttcgccccgagt cgccgccgatggtccccggagctcctgcccccaggcctgtggtgcctgaaacctggcggg atgggaaagggggtaagagcacagctgtcagctgagagggagggaggaggaggtgttgta aaatatttgtctccaatgtggttctcacagccatggggtgaagtaggtgccatcatcccc gctttgaagatgagaaaacaggaagggggccagaccctgaccattgccctttcccatttc ccatcaataccaggcaagggagcaccaagagatgtcccagggaattacctggctgccaag gctattattcttccctgccccttgccatacaaacagtgtgaggcttgtacagaaggccca agtggatga