GENSCAN 1.0 Date run: 7-Nov-116 Time: 20:45:51 Sequence gi568815583r:60397458_60631853 : 234396 bp : 39.75% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6977 7336 360 0 0 58 96 400 0.281 32.07 1.02 Term + 30996 31147 152 1 2 40 39 126 0.010 -0.01 1.03 PlyA + 31679 31684 6 1.05 2.05 PlyA - 31779 31774 6 1.05 2.04 Term - 35608 35475 134 2 2 83 39 76 0.552 -0.53 2.03 Intr - 45088 44959 130 1 1 32 111 69 0.689 2.95 2.02 Intr - 50688 50513 176 1 2 11 89 99 0.660 1.04 2.01 Init - 52384 51391 994 1 1 43 105 813 0.801 72.03 2.00 Prom - 54870 54831 40 -7.05 3.22 PlyA - 55098 55093 6 1.05 3.21 Term - 57705 57538 168 0 0 100 28 131 0.615 5.20 3.20 Intr - 57985 57869 117 1 0 80 92 60 0.916 5.34 3.19 Intr - 59337 59200 138 0 0 77 87 16 0.431 0.14 3.18 Intr - 69256 69137 120 1 0 28 83 94 0.844 2.67 3.17 Intr - 70865 70604 262 1 1 63 76 207 0.870 13.57 3.16 Intr - 78710 78606 105 1 0 70 90 53 0.773 2.11 3.15 Intr - 78955 78814 142 2 1 77 78 35 0.147 -0.11 3.14 Intr - 84887 84838 50 1 2 57 85 81 0.015 2.11 3.13 Intr - 100162 100002 161 1 2 110 34 133 0.086 7.96 3.12 Intr - 102547 102435 113 2 2 28 113 84 0.990 4.08 3.11 Intr - 103612 103502 111 2 0 94 110 54 0.995 7.63 3.10 Intr - 105410 105303 108 0 0 79 82 117 0.992 9.54 3.09 Intr - 106210 106078 133 1 1 76 91 87 0.867 7.10 3.08 Intr - 108172 108051 122 2 2 98 86 91 0.997 9.19 3.07 Intr - 114164 113769 396 0 0 68 63 613 0.946 50.43 3.06 Intr - 117300 117159 142 0 1 59 89 91 0.790 5.31 3.05 Intr - 134394 134309 86 1 2 99 121 76 0.980 10.62 3.04 Intr - 136329 136280 50 2 2 92 86 18 0.688 -0.59 3.03 Intr - 139838 139726 113 2 2 82 67 75 0.782 3.06 3.02 Intr - 141043 140905 139 0 1 57 94 32 0.402 0.25 3.01 Init - 143332 143214 119 1 2 59 107 42 0.517 2.92 3.00 Prom - 146824 146785 40 -3.85 4.03 PlyA - 150530 150525 6 1.05 4.02 Term - 185922 185747 176 1 2 31 33 207 0.211 6.44 4.01 Init - 195089 195005 85 2 1 74 -13 233 0.276 10.83 4.00 Prom - 210290 210251 40 -5.55 5.04 PlyA - 210873 210868 6 1.05 5.03 Term - 217879 217548 332 2 2 61 53 212 0.866 8.83 5.02 Intr - 219432 219299 134 2 2 32 80 54 0.397 -1.63 5.01 Init - 230884 230691 194 1 2 69 98 199 0.813 17.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100162 99998 165 1 0 110 36 133 0.910 7.23 S.002 Term - 198525 198380 146 1 2 101 42 84 0.934 2.19 S.003 Intr - 198897 198835 63 1 0 103 72 56 0.855 3.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:60397458_60631853|GENSCAN_predicted_peptide_1|170_aa XPAPDATAAEMLMPKKNRIAIYELLFKEGAMVAKKDVYTPKQPELADKNVPNLHVMKAMQ SLKSRGYMKEQFAWRHFYWYLTNEGIHHLRDYLHLPPEIVPATLCHSRPETGRPRPKGLE VISVEKYKVFFHDGYDMVITGWDLMGPRELRDWKVHQEGRNVFYTNCMLS >gi568815583r:60397458_60631853|GENSCAN_predicted_CDS_1|513_bp nccccggccccagacgctacagccgccgaaatgttgatgcctaagaagaaccggattgcc atttatgaactcctttttaaggagggagccatggtggccaagaaggatgtctacacgcct aagcagccagagttggcagacaagaatgtgcccaaccttcatgtcatgaaggccatgcag tctctcaagtcccgaggctacatgaaggaacagtttgcctggagacatttctactggtac cttaccaatgagggtatccaccatctccgtgattaccttcatctgcccccggagattgtg cctgccaccctatgccacagccgtccagagactggcaggcctcggcctaaaggtctggag gtgatttcggtggaaaagtacaaggtattcttccatgatggatatgatatggtaataaca ggctgggatctaatgggacccagggaactgagagactggaaggtacaccaggaaggccgc aatgtgttttatacaaattgtatgctgtcctag >gi568815583r:60397458_60631853|GENSCAN_predicted_peptide_2|477_aa MSCEVNECRKIESLENLYLDFDDDVTELETFGVTTTKVSKSPSPASTSTVPNMTDAPTAP KAGTTTVAPSAPDISANSRSLSQILMEQLQKEKQLVTGMDGGPEECKNKDDQGFESCEKV SNSDKPLIQDSDLKTSDALQLENSQEIETSNKNDMTIDILHADGERPNVLENLDNSKEKT VGSEAAKTEDTVLCSSDTDEECLIIDTECKNNSDGKTAVVGSNLSSRPASPNSSSGQASV GNQTNTACSPEESCVLKKPIKRVYKKFDPVGEILKMQDELLKPISRKVPELPLMNLENSK QPSVSEQLSGPSDSSSWPKSGWPSAFQKPKGRLPYELQDYVEDTSEYLAPQEGNFVYKLF SLQDLLLLVRCSVQRIETRPRSKKRKKIRRQFPVYVLPKVEYQACYGVEALTESELCRLW TESLLHSNSSFYVAILSRVSGFSVITGLTYLSIFLEAHSPPSLKDKLFGILALLKTS >gi568815583r:60397458_60631853|GENSCAN_predicted_CDS_2|1434_bp atgtcctgtgaagtcaacgagtgccgaaaaattgagagtcttgaaaacttgtatttggat tttgatgatgatgtcacagaacttgaaacttttggagtaaccaccaccaaagtatcaaaa tcaccaagtccagcaagtacttccacagtacctaacatgacagatgctcctacagccccc aaagcaggaactacaactgtggcaccaagtgcaccagacatttctgctaattctagaagt ttatctcagattctgatggaacaattgcaaaaggagaaacagctggtcactggtatggat ggtggccctgaggaatgcaaaaataaagatgatcagggatttgaatcatgtgaaaaggta tcaaattctgacaagcctttgatacaagatagtgacttgaaaacatctgatgccttacag ttagaaaattctcaggaaattgaaacttctaataaaaatgatatgactatagatatatta catgctgatggtgaaagacctaatgttctagaaaacctagacaactcaaaggaaaagact gttggatcagaagcagcaaaaactgaagatacagttctctgcagcagtgatacagatgag gagtgtttaatcattgatacagaatgtaaaaataatagtgatggaaagacagctgttgtg ggttctaacttaagttccagaccagctagtccaaattcttcctcaggacaggcttctgta ggaaaccagactaatactgcttgtagtcctgaagagtcatgtgttttaaaaaaacctatc aaacgagtatataaaaaatttgatccagttggagagattttaaaaatgcaggatgagctc ttaaagccaatttccagaaaagtaccagaattgcccttaatgaatttagaaaattctaaa cagccttctgtttctgagcaattgtctggtccttcagactcctctagttggccgaaatct ggatggccttctgcatttcagaagccaaaaggacgattgccatatgaacttcaggactat gttgaagatacatcggaatacctagctcctcaggaaggaaattttgtttataagttattt agcctgcaagacctgttgttactcgtacgctgcagtgtccagaggatagagacaagacca cgttctaaaaaacggaagaaaatcagaagacaatttccagtttatgtactaccaaaagta gagtatcaagcttgttatggagttgaagctctgactgaaagtgaactttgtcgcttatgg actgaaagtttattgcattccaacagctcattttatgttgctatcctgtcccgagtgagt ggcttctctgtcatcacaggtttaacgtacctatcaatttttttggaagctcattcacca ccatctctgaaagacaaactcttcggcatattagctttactaaagacctcttag >gi568815583r:60397458_60631853|GENSCAN_predicted_peptide_3|964_aa MNPVYLVGEYFLSDIYSMSSSRSSECHRSSWKEMMGSFACWIWVIGHSIAIFAMNDVASF LKLPIYIGFSSSKSFVSTHLPPPAWELGRTGQSTTCMTIFRSPETRGPQNSRSLVLLKSL QGIRWFKDFMVPNYYIPRNQAQIEIIPCKICGDKSSGIHYGVITCEGCKGFFRRSQQSNA TYSCPRQKNCLIDRTSRNRCQHCRLQKCLAVGMSRDAVKFGRMSKKQRDSLYAEVQKHRM QQQQRDHQQQPGEAEPLTPTYNISANGLTELHDDLSNYIDGHTPEGSKADSAVSSFYLDI QPSPDQSGLDINGIKPEPICDYTPASGFFPYCSFTNGETSPTVSMAELEHLAQNISKSHL ETCQYLREELQQITWQTFLQEEIENYQNKQREVMWQLCAIKITEAIQYVVEFAKRIDGFM ELCQNDQIVLLKAGSLEVVFIRMCRAFDSQNNTVYFDGKYASPDVFKSLGCEDFISFVFE FGKSLCSMHLTEDEIALFSAFVLMSADRSWLQEKVKIEKLQQKIQLALQHVLQKNHREDG ILTKLICKVSTLRALCGRHTEKLMAFKAIYPDIVRLHFPPLYKELFTSEFEPAMQIDGSG GLEVQIQLPANLVPENVELIIGAGEFHNGPGFKAIQEVTLLSPQIWYMKLNIVRRDCLGM GGDISPKNGLKTFFSRENYKDHSMAPSLKELRVLSNRRIGENLNASASSVENEPAVSSAT QAKEKVKTTIGMVLLPKPRVPYPRFSRFSQREQRSYVDLLVKYAKIPANSKAVGINKNDY LQYLDMKKHVNEEVTEFLKFLQNSAKKCAQDYNMLSDDARLFTEKILRACIEQVKKYSEF YTLHEVTSLMGFFPFRVEMGLKLEKTLLALGSVKYVKTVFPSMPIKLQLSKDDIATIETS EQTAEAMHYDISKDPNAEKLVSRYHPQIALTSQSLFTLLNNHGPTYKEQWEIPVCIQVIP VAGL >gi568815583r:60397458_60631853|GENSCAN_predicted_CDS_3|2895_bp atgaatccagtttatttagttggggaatatttcttgagtgacatctattcaatgagcagt agcagatcttcagaatgccatcggagctcatggaaagaaatgatgggttcctttgcctgc tggatatgggtaataggacacagcatcgcaatctttgccatgaatgatgtggcaagcttc ctgaagcttcccatttatataggtttttcatcttcaaagagctttgtgagcactcattta ccaccacctgcctgggagctaggccgcactgggcagagcacaacctgcatgaccatcttt agaagccctgaaacaagggggccacaaaattctagaagcctggtgctcctgaagtccctc cagggaataaggtggttcaaggacttcatggtgcctaattattacattcctaggaatcag gctcaaattgaaattattccatgcaagatctgtggagacaaatcatcaggaatccattat ggtgtcattacatgtgaaggctgcaagggctttttcaggagaagtcagcaaagcaatgcc acctactcctgtcctcgtcagaagaactgtttgattgatcgaaccagtagaaaccgctgc caacactgtcgattacagaaatgccttgccgtagggatgtctcgagatgctgtaaaattt ggccgaatgtcaaaaaagcagagagacagcttgtatgcagaagtacagaaacaccggatg cagcagcagcagcgcgaccaccagcagcagcctggagaggctgagccgctgacgcccacc tacaacatctcggccaacgggctgacggaacttcacgacgacctcagtaactacattgac gggcacacccctgaggggagtaaggcagactccgccgtcagcagcttctacctggacata cagccttccccagaccagtcaggtcttgatatcaatggaatcaaaccagaaccaatatgt gactacacaccagcatcaggcttctttccctactgttcgttcaccaacggcgagacttcc ccaactgtgtccatggcagaattagaacaccttgcacagaatatatctaaatcgcatctg gaaacctgccaatacttgagagaagagctccagcagataacgtggcagacctttttacag gaagaaattgagaactatcaaaacaagcagcgggaggtgatgtggcaattgtgtgccatc aaaattacagaagctatacagtatgtggtggagtttgccaaacgcattgatggatttatg gaactgtgtcaaaatgatcaaattgtgcttctaaaagcaggttctctagaggtggtgttt atcagaatgtgccgtgcctttgactctcagaacaacaccgtgtactttgatgggaagtat gccagccccgacgtcttcaaatccttaggttgtgaagactttattagctttgtgtttgaa tttggaaagagtttatgttctatgcacctgactgaagatgaaattgcattattttctgca tttgtactgatgtcagcagatcgctcatggctgcaagaaaaggtaaaaattgaaaaactg caacagaaaattcagctagctcttcaacacgtcctacagaagaatcaccgagaagatgga atactaacaaagttaatatgcaaggtgtctaccttaagagccttatgtggacgacataca gaaaagctaatggcatttaaagcaatatacccagacattgtgcgacttcattttcctcca ttatacaaggagttgttcacttcagaatttgagccagcaatgcaaattgatggttctgga ggcttggaagtccagatccagctgccagccaatttggttcctgaaaatgtagagctgatt attggagctggagaattccataatgggcctgggtttaaagctatccaagaagtaacactt ctaagcccacagatctggtatatgaaattaaacatagtaagaagagattgtctggggatg gggggggatatttcccccaaaaatggccttaagacatttttctctcgagaaaattataaa gatcattccatggctccaagtttaaaagaactacgtgttttatccaacagacgtatagga gaaaatttgaatgcctcagcaagttctgtagaaaatgagccggcagttagttcagcaact caagcaaaggaaaaagttaaaaccacaattggaatggttcttcttccaaaaccaagagtt ccttatcctcgtttctctcgtttctcacagagagagcagaggagttatgtggacttgttg gttaaatacgcaaagattcctgcaaattccaaagctgttggaataaataaaaatgactac ttgcagtacttggatatgaaaaaacatgtgaacgaagaagttactgagttcctaaagttt ttgcagaattctgcaaagaaatgtgcgcaggattataatatgctttctgatgatgcccgt ctcttcacagagaaaattttaagagcttgcattgaacaagtgaaaaagtattcagaattc tatactctccacgaggtcaccagcttaatgggattcttcccattcagagtagagatggga ttaaagttagaaaaaactcttctcgcattgggcagtgtaaaatatgtgaaaacagtattt ccctcaatgcctataaagttgcagctgtcaaaggacgatatagctaccattgaaacgtca gaacaaacagctgaagctatgcattatgatattagtaaagatccaaatgcagagaagctt gtttccagatatcaccctcagatagctctaactagtcagtcattatttaccttattaaat aatcatggaccaacgtacaaggaacagtgggaaattccagtgtgtattcaagtaatacct gttgcaggtttgtaa >gi568815583r:60397458_60631853|GENSCAN_predicted_peptide_4|86_aa MALGGRAAAAAAAVARAAAARRRQRGLRGCFIQRQSKGFSALEADQCFLFSTWPFSGTEH LVCICQCPGSGGDIQTGETKCCLRAA >gi568815583r:60397458_60631853|GENSCAN_predicted_CDS_4|261_bp atggctctcggcgggcgggcagctgcggcagctgctgctgtggctcgggcggcggcggcg cggcggcggcagagggggctccggggttgctttattcagcgacaaagtaaaggtttcagt gccctggaagctgatcagtgcttcctgtttagcacatggcctttctcaggcactgagcat ctggtgtgtatctgccagtgtccaggctctggaggtgacatccaaacaggggagaccaaa tgctgccttcgtgcagcttag >gi568815583r:60397458_60631853|GENSCAN_predicted_peptide_5|219_aa MRKTSHKPIKGYSTTLLTSTPQNWQDHQKQGKFEKLSQTRGGKRDMTAKHNVRSDGILQQ EEDTRGRREAGLRTRGVLRTRGFPTAFPLTQQTLQLVELNLDVPQPQNADKIGKPSGLRS TRSQGSVEKRSHPVPLYLYPLTPSVVSMGYYRIQNTGIRAISSAKCTSTKMRKHERALGL GWLWIVAADPPPGSRKQKGGCVWVDAVLFSLTALQPVAP >gi568815583r:60397458_60631853|GENSCAN_predicted_CDS_5|660_bp atgagaaaaacatcacacaaacccattaagggatattctacaacactgctgaccagtact cctcaaaactggcaagatcatcaaaaacaaggaaagttcgagaaactgtcacagaccaga ggaggcaaaagagacatgacagctaaacataatgtgagatcggatgggatcctgcaacag gaagaggacaccagaggtagacgtgaggctggtctgagaacaaggggggttttgaggacc agggggtttcctacagcctttcccctgactcagcaaaccttgcagctagtcgagctgaac ctggatgtacctcaaccacagaacgcagataagataggcaaaccttcaggacttagaagc actcgatctcaaggctccgtggagaagagaagccacccagtgcccctttatctatatccc ctgaccccaagtgtggtcagcatgggctattacaggattcaaaatacaggaatcagagcc atcagctctgccaagtgcactagcactaagatgaggaaacacgagagagcattaggactt gggtggctgtggatagtggcagctgatccaccaccaggttctaggaagcagaaagggggc tgtgtgtgggttgacgctgttctgttttccctgacagctcttcaacctgtagctccctga