GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:14:42 Sequence gi568815591r:135262888_135538331 : 275444 bp : 40.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 PlyA - 10 5 6 -3.64 1.11 Term - 994 679 316 0 1 66 44 293 0.900 16.13 1.10 Intr - 3367 3279 89 1 2 38 91 101 0.851 3.25 1.09 Intr - 5834 5708 127 1 1 112 75 80 0.934 8.86 1.08 Intr - 7286 7132 155 2 2 99 74 69 0.663 4.55 1.07 Intr - 7433 7356 78 2 0 106 88 60 0.408 6.63 1.06 Intr - 7796 7684 113 0 2 -31 94 117 0.061 -0.42 1.05 Intr - 11769 11714 56 2 2 94 94 93 0.153 8.20 1.04 Intr - 14510 14335 176 2 2 88 103 161 0.051 15.42 1.03 Intr - 25510 25351 160 0 1 133 77 48 0.003 7.27 1.02 Intr - 44423 44267 157 0 1 103 96 43 0.739 4.75 1.01 Init - 48208 48055 154 1 1 64 90 68 0.571 4.79 1.00 Prom - 48467 48428 40 -8.85 2.00 Prom + 48477 48516 40 -6.25 2.01 Init + 50019 50149 131 2 2 72 105 48 0.773 4.57 2.02 Intr + 50827 51153 327 1 0 74 -45 248 0.558 3.99 2.03 Intr + 51486 51695 210 0 0 44 35 177 0.058 5.11 2.04 Intr + 55294 55412 119 1 2 28 94 52 0.039 -1.01 2.05 Term + 55846 55958 113 2 2 49 49 116 0.407 1.54 2.06 PlyA + 56197 56202 6 1.05 3.10 PlyA - 56818 56813 6 1.05 3.09 Term - 57585 57361 225 0 0 56 46 109 0.665 -0.70 3.08 Intr - 58132 57975 158 2 2 47 115 82 0.778 5.61 3.07 Intr - 59295 59200 96 1 0 13 85 122 0.276 3.46 3.06 Intr - 62950 62719 232 1 1 39 19 220 0.261 6.62 3.05 Intr - 63151 63089 63 1 0 97 73 55 0.364 2.90 3.04 Intr - 72560 72396 165 2 0 77 32 148 0.652 7.34 3.03 Intr - 79476 79335 142 2 1 97 49 36 0.121 0.03 3.02 Intr - 80986 80785 202 2 1 127 92 68 0.829 8.52 3.01 Init - 83589 83340 250 0 1 46 70 192 0.481 10.87 3.00 Prom - 86589 86550 40 -8.95 4.12 PlyA - 86621 86616 6 1.05 4.11 Term - 100299 99998 302 1 2 125 48 167 0.997 10.80 4.10 Intr - 101179 100967 213 2 0 43 93 115 0.088 5.26 4.09 Intr - 131528 131031 498 0 0 84 96 351 0.540 27.53 4.08 Intr - 132996 132747 250 0 1 91 76 130 0.994 8.19 4.07 Intr - 135339 135282 58 2 1 84 110 44 0.937 4.17 4.06 Intr - 147761 147628 134 2 2 70 102 76 0.853 5.72 4.05 Intr - 150726 150601 126 0 0 119 67 42 0.935 5.16 4.04 Intr - 151545 151444 102 0 0 49 85 98 0.969 5.05 4.03 Intr - 152375 152289 87 2 0 118 93 57 0.997 8.45 4.02 Intr - 159466 159269 198 1 0 52 107 175 0.968 14.43 4.01 Init - 175444 175271 174 1 0 65 94 205 0.996 18.19 4.00 Prom - 178090 178051 40 -4.75 5.00 Prom + 178905 178944 40 -9.05 5.01 Sngl + 181836 182045 210 2 0 67 43 213 0.883 9.67 5.02 PlyA + 182449 182454 6 1.05 6.03 PlyA - 182705 182700 6 1.05 6.02 Term - 195550 195444 107 2 2 46 49 150 0.919 4.49 6.01 Init - 197113 196969 145 1 1 44 35 147 0.625 5.33 6.00 Prom - 203852 203813 40 -5.75 7.00 Prom + 209984 210023 40 -4.25 7.01 Init + 246691 246911 221 0 2 73 33 157 0.911 6.85 7.02 Intr + 247031 247162 132 2 0 108 72 85 0.821 7.74 7.03 Term + 252453 252864 412 0 1 29 40 236 0.754 6.63 7.04 PlyA + 252948 252953 6 1.05 8.03 PlyA - 253850 253845 6 1.05 8.02 Term - 267461 267252 210 2 0 48 47 122 0.158 0.41 8.01 Init - 268965 268852 114 0 0 86 44 111 0.353 6.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 26523 26649 127 0 1 61 80 96 0.923 5.86 S.002 Init - 101150 100967 184 2 1 49 93 109 0.906 6.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:135262888_135538331|GENSCAN_predicted_peptide_1|526_aa MGKEAAHYDKHVVRRTLAVGNTGPCGKTTHFLILELAPSPLPQSCSSAEASALPHSPGGP HGGAIHPGQGPVPAAGPPDTELPHQHHFLCSSICMLLQVFLGVRLPIPQGGTFAFVVISL AMLSLPSWNCPEWTLSASQVNTNFPEFTEKWQKRIQEGAIMVTSCVRMLVGFSGLTGFLM GFICSLAVAPTNCLVALPLLDSAGNNAGIQWGISAMYCFVLRLRKDELWPFGSPRLRLPP SPPRDRRHVPTPVIGGMTLFGVITAVGISNLQYVEMNLSRSLFAFGFSIYCGLTIPNRGQ SQRSPRSPWMCRFCLHSRGSPAGPGCSDAADHGHVHQWISGFSSRQHHPRAPSIKPSNSL SCRVTLEMGGVTRWGGGAYDLALGYAMGAFLQHLHAVEDTRPLSSCLGDQIDRRGTAVLV LRTRTQSPPKDWRERSCSTVSVPPSTGTGWLPHMTGSVGVTRPAAQEPWARVGHWDRMSG NMNKLKHIPAQPAKLWAVTRSYLPDYKRRSVTLLGAHTLGLPKPEL >gi568815591r:135262888_135538331|GENSCAN_predicted_CDS_1|1581_bp atgggtaaggaagccgcccattatgataaacatgttgtgaggaggactttggcagttgga aacactggtccgtgtgggaaaactacccattttctgatcttggagttggccccatctcct ctcccacaaagctgctcctctgccgaggcatcggcacttcctcacagccctggggggcct catggcggtgccattcatcctggccaaggacctgtgcctgcagcaggaccccctgacaca gagctacctcatcagcaccattttctttgctccagcatctgcatgctcctgcaagtgttt ttaggggtcaggctgcccattccccagggaggtacgtttgcttttgtggtaatttctctg gccatgctctcccttccctcctggaattgccctgagtggacactcagtgccagccaggtg aacaccaactttccagaattcactgagaaatggcagaagaggatccaagagggtgctatc atggtcacttcctgtgtccggatgctggtgggcttctcaggcctgactggctttctcatg ggtttcatctgctccttggccgttgctccaactaactgcctagtggccctgcccctcttg gattctgcaggcaataatgccgggatccagtgggggatttctgccatgtattgcttcgtg ttgcgtcttcgcaaggatgagctctggccatttggttctccacggctgcgtttgccacca tccccaccccgtgatcggaggcatgtccccacccccgtgatcggaggcatgaccctgttt ggggtcatcactgccgtggggatctccaatctgcagtacgtggagatgaacttgtccagg agcctcttcgcctttggcttctccatctactgtgggctcaccattcccaaccgggggcag agccagcgcagcccccggtccccatggatgtgtcgcttttgtctccactccaggggttct ccagccggcccaggttgttcagatgctgctgaccatgggcatgttcatcagtggatttct gggttttcttctagacaacaccatccccgagctccttcaataaaaccttcaaatagcctt tcatgccgagtgaccttggaaatgggaggggtcaccaggtggggaggaggggcttacgac ctggctttgggatatgctatgggagcatttctccagcatctccatgctgtggaggatacc cgcccccttagtagctgtcttggtgatcagattgaccgtcgaggcactgcagtgcttgtc ctcagaacaagaacccagagcccaccaaaggactggcgtgaaaggagctgtagcactgta tccgtcccaccctccactggcaccggatggctgccccacatgacaggaagtgttggtgtg accagaccagcagcccaggagccatgggctagagtggggcattgggaccgaatgagcggt aacatgaacaagctgaaacacattcctgcccagcctgccaagctgtgggcagtgacacgc tcttatttgccagactacaagagaagatctgtgacccttctgggggcccataccttgggg ctccccaagccagagctgtga >gi568815591r:135262888_135538331|GENSCAN_predicted_peptide_2|299_aa MPPDIVKYPLGVKLPPIKNHYCESIGPLKEALTLGRCWVERMGRLLGKKGCIEAAWNTGC TWKKSAFPKVVKAEPKVDWRELWILSAYAEISVNVCRNPQGTVRKFTIEERGGGEGDGVI DRWILNRQVTEKQQQLGQCDSVTPQLSNHKHLTSVHISVDHRKVVETTSASALKQFSGKW ILPEGPHEKKLEHRTGRHLAFPENSWSFPCEESVTKKPTSQNCWNSVTPEMPNEMINREE DYALSWPQLLRLMGRAKASQGSDSCKDKNAAYLTRNLPTRKLLVDLKIFTLKQLCGISP >gi568815591r:135262888_135538331|GENSCAN_predicted_CDS_2|900_bp atgcctccagacattgtcaaatatcccctgggggtcaaactgccccccattaagaaccac tattgtgagagcattggtcccctcaaggaagccctaactttgggacggtgctgggtggaa aggatggggagacttctagggaaaaaaggatgtattgaggctgcttggaacacgggatgc acgtggaagaaatctgcttttcctaaagtggtcaaagccgagccaaaggtagactggagg gagctctggatcctgagtgcttatgctgagatttcagtgaatgtgtgtaggaatccacaa ggaacagtgagaaaattcacgattgaggaaagaggagggggagagggagatggagtgatt gaccgatggattctgaacaggcaagtcacagagaagcagcagcagttgggccaatgtgat tctgtcaccccccagctcagcaaccacaagcacctgacgtctgtccatatcagtgtggac catcggaaggtcgtggaaaccacttctgcatctgctttaaaacagttttctgggaaatgg atcctacctgagggtccacatgaaaaaaaattagaacatagaaccgggagacaccttgcc ttccctgagaacagctggtccttcccgtgtgaagagagtgtgaccaagaagcccaccagc cagaactgctggaacagcgtgacccctgagatgcctaatgagatgattaatagggaagaa gattacgccttgtcctggccacagctgctaaggcttatgggtagggcaaaggcctcacaa ggctcagatagctgcaaagataaaaacgctgcatacctcactcgcaatttgcccacgagg aaactccttgtggacctcaagatctttaccctgaaacagctctgtggaatttcaccctag >gi568815591r:135262888_135538331|GENSCAN_predicted_peptide_3|510_aa MELRSLREEKRSHKWDLGEHELVRLRRNRGKGGTGKKNGCMETLERNVMVNYVKRTGGHM GQRLKAIHRFGNLEVTDHLFQREGEDGFLDQPSGLKRYQWLGYARLTCDTLDVRKPGPHF CGTSTTCVTTEYLCCFALWVSFLIYKTRDLQLPLSVLAKCGRVWASHGQGVTLGCSLHLT LSWAFLGRLQTFLGWETQAQNRPVRAQCAWPLGLSMRPGHVHRTSSCKAAPWHLLLEEAQ GEEDGASWEDERENQDSGQEGGHRILAMQGDAEKADPETSSRDAVVAAPITQPIPMGPAA AFLAAMPTASTPHYEEVDLKEGATAPSEYDEGAGIVSPSKIWQGTQFGQGKIYSSRTDLA DQPEIGNSTPVGAVSWRMDSTKPDNFSGWVKAYPTKSKRATEAAKTLVRKIIPKFGLPCT TQSDSSPSLISEVIQKAFPRQPVDVGPAPSPGLATRPAQGIAVILFSGELHSGRSSCLPD TQQFPSYFWGERGALRMTVEDPHFFHGHTC >gi568815591r:135262888_135538331|GENSCAN_predicted_CDS_3|1533_bp atggaactgagatcacttagggaggaaaaaaggagccataagtgggatcttggagaacat gaactggtgagactgagaaggaacagagggaaaggagggaccgggaaaaaaaatggttgc atggaaactctcgagagaaatgtcatggtcaactatgtcaagcgcactggaggccacatg ggacaacgactgaaagcaattcatagatttggaaatttggaggtcactgatcaccttttc cagagggaaggtgaggatggttttcttgaccagccgtcaggacttaaacgttaccagtgg ttgggatacgcacggttgacatgtgacacactcgatgttaggaaacctggaccccacttc tgtgggacctctaccacctgtgtgaccactgaatacctatgttgctttgctttgtgggtc agtttcctcatctataaaacgagggatttgcagctgcctttgtcagtactggcgaagtgt ggaagggtgtgggcaagccacggccagggagtgaccctgggctgtagcctccatcttaca ctgagctgggcctttctgggaagactccagacctttttgggttgggaaacacaggctcag aacaggccagtgcgtgctcagtgtgcctggcccttggggctttccatgaggccagggcat gtacacagaacgagcagctgcaaagctgccccgtggcacctgctcctggaggaagcacag ggagaagaagatggagccagctgggaggatgagagagagaatcaggactcaggacaggaa gggggccataggatactagccatgcagggagatgcagagaaggcagacccagagacttct tccagggatgctgtggtagctgcacccataacacagcccattcccatggggccagcagct gctttcttagctgccatgccaactgcttccaccccacattatgaagaagtagatttgaaa gagggggccacagccccttctgagtatgatgagggagctggaatagtttccccctcaaag atctggcagggcacccagtttggacagggcaaaatctattccagcagaacagacctggca gaccagccagagatcgggaattctacaccagtgggagcagtttcatggaggatggacagc accaagccagacaacttttcaggatgggtaaaagcataccccaccaagtccaaaagagcc acagaagcagctaagacattagtgaggaaaatcattcccaagttcggactcccatgcact acacaaagtgacagcagtccttctctcatttcagaagttatccaaaaggccttcccacgg caaccagtggatgttggccctgctcccagtcctggcctggcaactcgcccagcccagggg attgcagtcatattgttttcaggggaactccatagcggcaggagctcctgccttcctgac acccagcagttcccatcttacttctggggtgaacgtggggccctaagaatgacggtggag gatccacactttttccatggccacacttgttga >gi568815591r:135262888_135538331|GENSCAN_predicted_peptide_4|713_aa MSRSPDAKEDPVECPLCMEPLEIDDINFFPCTCGYQICRFCWHRIRTDENGLCPACRKPY PEDPAVYKPLSQEELQRIKNEKKQKQNERKQKISENRKHLASVRVVQKNLVFVVGLSQRL ADPEVLKRPEYFGKFGKIHKVVINNSTSYAGSQGPSASAYVTYIRSEDALRAIQCVNNVV VDGRTLKASLGTTKYCSYFLKNMQCPKPDCMYLHELGDEAASFTKEEMQAGKHQEYEQKL LQELYKLNPNFLQLSTGSVDKNKNKVTPLQRYDTPIDKPSDSLSIGNGDNSQQISNSDTP SPPPGLSKSNPVIPISSSNHSARSPFEGAVTESQSLFSDNFRHPNPIPSGLPPFPSSPQT SSDWPTAPEPQSLFTSETIPVSSSTDWQAAFGFGSSKQPEDDLGFDPFDVTRKALADLIE KELSVQDQPSLSPTSLQNSSSHTTTAKGPGSGFLHPAAATNANSLNSTFSVLPQRFPQFQ QHRAVYNSFSFPGQAARYPWMAFPRNSIMHLNHTANPTSNSNFLDLNLPPQHNTGLGGIP VADNSSSVESLNMKEWQDGLRALLPNININFGGLPNSSSPSNANHSAPTSNTATTDSLSW DSPGSWTDPAIITGIPASSGNSLDSLQDDNPPHWLKSLQALTEMDGPSAAPSQTHHSAPF STQIPLHRASWNPYPPPSNPSSFHSPPPGFQTAFRPPSKTPTDLLQSSTLDRH >gi568815591r:135262888_135538331|GENSCAN_predicted_CDS_4|2142_bp atgtctcgcagtcctgatgcgaaggaagaccctgtggagtgccctctttgcatggagccc ttggagatagatgatatcaactttttcccttgcacctgtggctaccagatttgccgattt tgttggcatcgaattcgcactgatgaaaatgggctttgtcctgcatgtagaaagccatat ccagaagacccagcagtttataaaccactctcccaggaagagctgcaaaggataaagaat gagaaaaaacagaaacaaaatgagagaaaacagaaaatatcagaaaatcgcaaacatttg gctagtgtacgtgtcgtacaaaaaaacctcgtctttgttgtaggtttatctcagcgccta gcagacccagaggttttaaaacgaccagaatattttgggaagtttggtaaaatacataaa gttgtcatcaataatagcacatcatatgcaggctcacagggtccaagtgccagtgcttat gtaacctatatccggtcagaagacgctctcagagccatacagtgtgtcaacaatgtggta gtagatggcagaacacttaaggcatctctaggtacaacaaaatactgcagttacttctta aagaatatgcagtgtccaaaacctgactgcatgtatcttcatgaattgggggatgaggcg gccagcttcacaaaagaggaaatgcaggcgggtaaacaccaagaatatgaacagaagcta cttcaagaattatataaattaaatcccaattttcttcagctatctacgggttcagttgat aaaaataagaacaaagtgacaccactgcagaggtacgatacccccattgacaaaccttca gattctctcagtatagggaacggtgataattcccagcagatatctaacagtgatacgcct tcaccaccacctggtttgtcaaaatccaatccagtcatccccatcagttcatccaatcac agtgcacggtccccttttgaaggggcagtaacagagtcacagtcgttattctcagacaat tttcgccatcccaaccctatcccaagtgggcttcctcctttccccagctccccacagaca tccagtgactggcctacagcaccagaaccacagagcctcttcacatcagaaacaatccca gtatcatcctctacagactggcaagcagcttttggctttggttcttctaaacaaccagag gatgacttgggttttgatcccttcgatgtcactcgaaaagccttagcagacctgattgag aaggaactgtccgttcaagaccaaccttccctttcgcccacatctcttcagaactcctct tcacacactacaaccgccaaaggtccaggctctggattcctgcatcctgctgcagctaca aatgccaattctctcaatagtaccttttcagtcttgccccagaggttccctcaatttcag cagcaccgagcggtttataattcattcagttttccaggccaggcagcccgctatccttgg atggcctttccacgcaatagcatcatgcacttgaaccacacagcaaaccccacctcaaat agtaatttcttggacttgaatctcccgccacagcacaacacaggtctgggagggatccct gtagcagacaacagcagttctgtagagagtttaaatatgaaggaatggcaggacgggcta agggcacttctacccaacattaacatcaactttggtggactgcccaattcttcttccccc tccaacgccaaccacagtgcaccaacgtccaacactgccaccaccgacagcctgagttgg gacagccctggcagctggacagacccagccatcatcacaggtattccagcgtcttcagga aacagtttagactctcttcaagatgacaatcctccacactggctaaaatcccttcaggcc ctcacagagatggacggccccagcgctgctccatcacagacccaccacagcgcccccttc agcacacagatcccgctgcacagagccagttggaatccctaccctcctccttcaaaccct tccagcttccactccccacccccaggctttcagacagccttcagaccccccagcaaaacc cccacagatttactacagagttcaacactggaccgccattag >gi568815591r:135262888_135538331|GENSCAN_predicted_peptide_5|69_aa MGYSLAATLTLHGHWGLGQVVTDYVHGDASQKAAKAGLLALSALTFAGLCYFNYHDVGIC KAVAMLWKL >gi568815591r:135262888_135538331|GENSCAN_predicted_CDS_5|210_bp atgggctattccctggctgcgaccctcactcttcatggtcactggggccttggacaagtt gttactgactatgttcatggggatgcctcgcagaaagctgccaaggcagggcttttggca ctttcagctttaacctttgctgggctttgctatttcaactatcacgatgtgggcatctgc aaagctgttgccatgctgtggaagctctga >gi568815591r:135262888_135538331|GENSCAN_predicted_peptide_6|83_aa MFKEGSHFHNIKVLGETPSADVETVVSCPEDLAKIMDEDGYTKQIFNVEIATATPSFGND HPDQSAAINIKARPSTSKKTVTV >gi568815591r:135262888_135538331|GENSCAN_predicted_CDS_6|252_bp atgtttaaggaaggaagccatttccataacataaaagtgctaggtgaaacaccaagtgct gatgtagaaactgtagtcagttgtccagaagatctagctaagatcatggatgaagatggc tacactaaacagattttcaatgtagaaattgctacagctactccatcttttggcaacgac caccctgatcaatcagcagccatcaacatcaaggcaagaccctccaccagcaaaaagact gtgactgtctga >gi568815591r:135262888_135538331|GENSCAN_predicted_peptide_7|254_aa MQGEKLETKSRLGSPIKEILEGESLLAAALPSDPLPFPYGNCCGVKGRGRKRRRRKKTKE EEGEDEAGGGEKGGRLHGLPRPHKKPPNERRGKNSSEPGLIASASPTFTAERETLSFRSL SFVENGLEKGKNENGTTNEENISVIQMVVLNVVVAVKIGGKKGDEFKIYFFGVEIIGLGD VLHMKFSSMTLRFSGFNGLVDSGSFSKRRMANIEIASVESRATLSEKQQDFREIMASHLR LVNLKIIVIMSSPT >gi568815591r:135262888_135538331|GENSCAN_predicted_CDS_7|765_bp atgcagggggaaaagctggagaccaaaagccgcctcggctccccaattaaagaaattctc gaaggagagtcgctgcttgccgccgcgctgccttctgacccgctcccttttccttatggt aactgctgtggcgtaaaggggagagggaggaagaggaggaggaggaagaagaccaaggag gaggagggtgaagatgaggcggggggtggggagaaagggggccgactccacgggctgccg cgtcctcacaagaaaccaccgaacgagcgtcggggaaaaaactcttcagaaccgggcctg attgcttcagcgagtccgacctttacggctgagagagagactctcagctttcgatcactc agctttgtggagaatggattggagaaaggcaaaaatgaaaatggaacaactaatgaagag aatatttcagtcatccagatggtagttctgaatgtggtggtagcagtaaagataggggga aaaaaaggtgatgaattcaagatatatttttttggagttgaaataataggacttggtgat gtactgcacatgaagttctcatcaatgactctcagattttcgggcttcaatggccttgta gatagtggctcctttagtaaaaggagaatggcgaatattgaaatagcatctgtagaaagc agagcaactttatcagaaaaacagcaggatttcagggaaataatggcttctcatttgaga ctcgtgaatttaaaaatcatagtgattatgtcttccccaacttag >gi568815591r:135262888_135538331|GENSCAN_predicted_peptide_8|107_aa MAKRKVEKASEKELRISRDKKRNDTILKDGYIMMDLKRQMGAGWVQERGRAESQEETNEG IQTTLLSSNPTVQRELGGGVTKGLYLGNGRIPMVEGRGVILCDNSTV >gi568815591r:135262888_135538331|GENSCAN_predicted_CDS_8|324_bp atggcaaagagaaaggtggagaaagccagcgagaaggaactaaggataagtagggataaa aaaagaaatgacactattctgaaagatggctatatcatgatggacctaaaaaggcaaatg ggagctggttgggtgcaggaaaggggaagggcagagagccaagaggaaactaatgaaggt attcagacaacgctactgagttcaaatccaactgtccaaagggagcttggtggaggggtg acaaagggtctatatcttggaaatggtagaataccaatggtagaaggcagaggagtaatt ctttgtgacaatagcacagtgtga