GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:55:47 Sequence gi568815589f:122934567_123135613 : 201047 bp : 39.50% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 109 104 6 1.05 1.03 Term - 6369 6136 234 0 0 99 52 187 0.645 11.54 1.02 Intr - 7347 7136 212 1 2 45 69 140 0.300 5.51 1.01 Init - 16782 16728 55 0 1 68 100 4 0.177 1.10 1.00 Prom - 19563 19524 40 -3.85 2.00 Prom + 21253 21292 40 -8.05 2.01 Init + 22494 22643 150 2 0 44 93 143 0.981 10.49 2.02 Intr + 49919 50153 235 1 1 66 50 262 0.900 16.54 2.03 Intr + 51649 51853 205 2 1 94 78 166 0.998 13.54 2.04 Intr + 54731 54905 175 2 1 106 87 168 0.999 17.52 2.05 Intr + 55490 55647 158 1 2 85 63 131 0.628 8.19 2.06 Intr + 61475 61585 111 2 0 46 121 114 0.631 9.08 2.07 Intr + 61973 62039 67 2 1 83 91 68 0.971 4.59 2.08 Intr + 62693 62795 103 1 1 101 54 67 0.999 3.43 2.09 Intr + 64031 64200 170 0 2 103 84 113 0.988 11.14 2.10 Intr + 75788 75962 175 1 1 67 92 125 0.984 9.39 2.11 Intr + 80977 81070 94 2 1 87 115 74 0.999 8.10 2.12 Intr + 85743 85893 151 0 1 74 116 189 0.999 19.54 2.13 Intr + 99078 99176 99 2 0 100 92 59 0.777 6.89 2.14 Term + 100310 101050 741 1 0 4 45 443 0.088 24.07 2.15 PlyA + 101341 101346 6 1.05 3.00 Prom + 112853 112892 40 -6.65 3.01 Init + 120270 120297 28 2 1 69 88 43 0.302 2.21 3.02 Intr + 121962 122002 41 1 2 97 82 24 0.289 -0.28 3.03 Intr + 130782 130895 114 2 0 130 63 89 0.996 10.32 3.04 Intr + 135784 135858 75 0 0 99 84 66 0.985 6.09 3.05 Intr + 138986 139111 126 1 0 89 98 105 0.999 11.56 3.06 Intr + 142068 142196 129 2 0 61 93 113 0.989 9.07 3.07 Intr + 146474 146538 65 1 2 108 -7 93 0.111 -1.60 3.08 Intr + 146681 146802 122 2 2 34 100 82 0.080 3.22 3.09 Intr + 152036 152112 77 0 2 36 89 61 0.705 -0.68 3.10 Intr + 155192 155284 93 1 0 101 68 149 0.990 13.44 3.11 Intr + 155709 155819 111 2 0 91 111 195 0.997 21.76 3.12 Intr + 157318 157419 102 0 0 96 85 5 0.543 0.45 3.13 Intr + 163175 163279 105 1 0 50 110 219 0.997 19.79 3.14 Intr + 164149 164232 84 0 0 70 96 96 0.995 7.70 3.15 Intr + 164912 164983 72 1 0 49 94 102 0.893 5.58 3.16 Intr + 167000 167197 198 1 0 61 92 289 0.998 25.23 3.17 Term + 168525 168647 123 2 0 106 54 130 0.997 8.80 3.18 PlyA + 169852 169857 6 -0.45 4.00 Prom + 171704 171743 40 -2.05 4.01 Init + 172958 173112 155 1 2 90 -8 100 0.733 0.11 4.02 Intr + 173803 173889 87 1 0 109 7 132 0.904 5.37 4.03 Intr + 174256 174564 309 1 0 -9 64 267 0.182 9.10 4.04 Intr + 177832 178135 304 1 1 51 20 290 0.714 14.17 4.05 Intr + 178976 179165 190 1 1 65 110 45 0.609 2.64 4.06 Intr + 179770 180121 352 2 1 85 21 167 0.572 3.16 4.07 Intr + 180603 180730 128 0 2 98 63 60 0.463 3.90 4.08 Intr + 185014 185149 136 2 1 16 95 138 0.072 6.01 4.09 Intr + 186407 186664 258 2 0 33 87 119 0.527 1.96 4.10 Intr + 188507 188746 240 2 0 21 87 132 0.526 2.14 4.11 Intr + 193154 193223 70 2 1 106 55 36 0.304 0.37 4.12 Term + 193692 193814 123 2 0 49 42 152 0.445 4.10 4.13 PlyA + 196678 196683 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 23686 23742 57 0 0 83 54 77 0.915 0.51 S.002 Intr + 184991 185149 159 2 0 82 95 137 0.851 11.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:122934567_123135613|GENSCAN_predicted_peptide_1|166_aa MQWHGFANVQLAEAKPPPRHLFPKKNSRVTKGMQMRETQEQMINSLDAEEQRPKYYQAMD SELLEVLRLSFGSTRKTLRVRIIKRHSDEVCDHDERPYWPNTGKRGGASSSPSPTWAFSP PQDCAPARSALAVSWTWAVLCIWSWTANYVTAAGTGDDGSGLGSTC >gi568815589f:122934567_123135613|GENSCAN_predicted_CDS_1|501_bp atgcagtggcatggatttgctaatgtgcaattagcagaggcaaaaccacctcccagacat ttgtttccaaaaaagaattctcgagtaaccaaaggtatgcagatgcgtgaaacgcaggag cagatgataaacagtctggatgcagaagagcagagaccaaagtactaccaggcaatggac agcgaactcctggaagtcttgaggttatcctttggctcaacacgtaaaaccctaagggtg cggataatcaaaagacacagcgatgaggtctgcgaccatgacgagcgaccctattggccc aataccgggaagcgaggcggagcttcctcttctccgagccccacctgggctttcagcccg cctcaggactgtgcgcctgcgcgaagtgcactggctgtgagctggacctgggctgtcttg tgtatttggagttggacagccaactacgtgactgcagccgggactggggacgatggaagc ggactgggtagtacctgttga >gi568815589f:122934567_123135613|GENSCAN_predicted_peptide_2|877_aa MDDKASVGKISVSSDSVSTLNSEDFVLVSRQGDETPSTNNGSDDEKTGLKIVGNGSEQQL QKELADVLMDPPMDDQPGEKELVKRSQLDGEGDGPLSNQLSASSTINPVPLVGLQKPEMS LPVKPGQGDSEASSPFTPVADEDSVVFSKLTYLGCASVNAPRSEVEALRMMSILRSQCQI SLDVTLSVPNVSEGIVRLLDPQTNTEIANYPIYKILFCVRGHDGTPESDCFAFTESHYNA ELFRIHVFRCEIQEAVSRILYSFATAFRRSAKQTPLSATAAPQTPDSDIFTFSVSLEIKE DDGKGYFSAVPKDKDRQCFKLRQGIDKKIVIYVQQTTNKELAIERCFGLLLSPGKDVRNS DMHLLDLESMGKSSDGKSYVITGSWNPKSPHFQVVNEETPKDKVLFMTTAVDLVITEVQE PVRFLLETKVRVCSPNERLFWPFSKRSTTENFFLKLKQIKQRERKNNTDTLYEVVCLESE SERERRKTTASPSVRLPQSGSQSSVIPSPPEDDEEEDNDEPLLSGSGDVSKECAEKILET WGELLSKWHLNLNVRPKQLSSLVRNGVPEALRGEVWQLLAGCHNNDHLVEKYRILITKFC STRVEAHSCFARLSAKDVLWLLFGLQHAELQIFGFVVSVLKSVSMASLACISIDRYIAIT KPLTYNTLVTPWRLRLCIFLIWLYSTLVFLPSFFHWGKPGYHGDVFQWCAESWHTDSYFT LFIVMMLYAPAALIVCFTYFNIFRICQQHTKDISERQARFSSQSGETGEVQACPDKRYAM VLFRITSVFYILWLPYIIYFLLESSTGHSNRFASFLTTWLAISNSFCNCVIYSLSNSVFQ RGLKRLSGAMCTSCASQTTANDPYTVRSKGPLNGCHI >gi568815589f:122934567_123135613|GENSCAN_predicted_CDS_2|2634_bp atggatgacaaggcttctgttggaaaaatcagtgtctcttcagactcagtatctactctt aatagtgaagattttgtcttggtttccaggcaaggagatgagacaccatctacaaataat ggaagtgatgatgagaaaacaggactcaagattgtagggaatggaagtgaacagcagctg caaaaagagctagcagatgtactgatggatcctccaatggacgaccagccaggggaaaag gagcttgtgaaaaggtcacaactggatggtgaaggagatgggcctctttctaatcagctc tccgcttcatccaccattaaccctgtgccattagtagggctccaaaaaccagagatgagc ctaccagtgaaacctggacaaggagattctgaagcttcaagtcctttcacaccagtggcc gatgaggacagcgtagttttcagtaaactgacttacttaggctgtgcctcggtaaatgct cccaggagtgaagtggaagccttaaggatgatgtccatcttaagaagccagtgtcagatt tcactagatgttaccctttcagtgccgaatgtgtctgaaggaattgtgagactcttagat cctcagacaaacactgaaatagcaaactaccctatctacaaaatcctcttctgtgtcaga gggcatgatggaactcctgagagtgactgttttgctttcactgaaagtcattacaatgca gagctcttcagaatacacgtcttccggtgtgaaatacaagaagctgtaagccggatactt tacagttttgccactgccttccgccgttctgccaagcagaccccactttcagccactgct gcaccccagactcctgacagtgacatctttaccttctctgtgtctttagaaataaaagaa gatgatggtaaaggttattttagtgcagttcccaaagataaggacagacagtgctttaaa ctacgccaaggaattgataagaagattgtcatctatgtgcagcaaacaactaataaagaa cttgccattgaaaggtgttttggtcttctccttagtccaggaaaagatgtacgaaatagt gacatgcacttattagatttggaatctatgggcaaaagttcagatggaaagtcgtatgtt attacggggagctggaatccaaaatccccacattttcaagttgtaaatgaagaaactcct aaagataaagtcctgtttatgaccacagctgtagatttggtaataacagaagtacaggag cctgttcgatttctcctggagacaaaagtccgcgtttgctcacctaatgaaagattattc tggcccttcagcaaacgtagtactactgaaaatttctttttgaaactaaaacagataaag caaagggagagaaagaataatactgacactttatatgaagttgtatgcttggaaagtgaa tcagaaagagagaggaggaaaactacagccagtccttcagttcgcctgccacagtctgga tcgcaaagttcagtgataccttctcctccagaagatgatgaagaggaagataatgatgaa cctctcctgagtggatctggtgatgtatccaaagaatgtgcagaaaaaattcttgaaaca tggggagaactgttgtcaaaatggcatctcaacttgaatgtgagaccgaagcagttgtca tccttagtaagaaacggtgtccctgaagctcttcgaggagaagtctggcagctgctagca ggctgtcataacaatgaccacctggtagagaaataccgcattcttatcacaaagttctgc agtacgcgggttgaagcacattcctgctttgcaaggctttctgctaaggatgtattgtgg cttttgtttggattgcagcatgcagaattacagatatttggttttgtagtatcagttctg aagagcgtctccatggcttctctggcctgtatcagcattgatagatacattgccattact aaacctttaacctataatactctggttacaccctggagactacgcctgtgtattttcctg atttggctatactcgaccctggtcttcctgccttcctttttccactggggcaaacctgga tatcatggagatgtgtttcagtggtgtgcggagtcctggcacaccgactcctacttcacc ctgttcatcgtgatgatgttatatgccccagcagcccttattgtctgcttcacctatttc aacatcttccgcatctgccaacagcacacaaaggatatcagcgaaaggcaagcccgcttc agcagccagagtggggagactggggaagtgcaggcctgtcctgataagcgctatgccatg gtcctgtttcgaatcactagtgtattttacatcctctggttgccatatatcatctacttc ttgttggaaagctccactggccacagcaaccgcttcgcatccttcttgaccacctggctt gctattagtaacagtttctgcaactgtgtaatttatagtctctccaacagtgtattccaa agaggactaaagcgcctctcaggggctatgtgtacttcttgtgcaagtcagactacagcc aacgacccttacacagttagaagcaaaggccctcttaatggatgtcatatctga >gi568815589f:122934567_123135613|GENSCAN_predicted_peptide_3|554_aa MNFEEEEHREWDLAGSWDNSGGKESPQDSAITRDINRTFPAHDYFKDTGGDGQDSLYKIC KAYSVYDEEIGYCQGQSFLAAVLLLHMPEEQAFSVLVKIMFDYGLRELFKQNFEDLHCKF YQLERLMQTSKDDLLLTDFEGALKFFRVQLPKRYRSEENAKKLMELACNMKPTHHEDNQD EDLYDDPLPLSEYKPLVSKFWGSQKLHVDFQVRRASVPLTRVVQGLTVLPDNQATHAAKV SISREPMRSYGVFIDFVFVISQKKLKKYEKEYHTMREQQAQQEDPIERFERENRRLQEAN MRLEQENDDLAHELVTSKIALRKDLDNKHYMGHSLPVRVFLPSEWREAGVHGAGTTEAGP EAEEKADALNKELLMTKQKLIDAEEEKRRLEEESAQLKEMCRRELDKAESEIKKNSSIIG DYKQICSQLSERLEKQQTANKVEIEKIRQKVDDCERCREFFNKEGRVKGISSTKEVLDED TDEEKETLKNQLREMELELAQTKLQLVEAECKIQDLEHHLGLALNEVQAAKKTWFNRTLS SIKTATGVQGKETC >gi568815589f:122934567_123135613|GENSCAN_predicted_CDS_3|1665_bp atgaattttgaggaggaagaacacagagagtgggatttggcagggtcatgggacaatagt ggagggaaggagtctccccaggacagtgctatcacccgggatattaaccgaacattccca gcccatgactactttaaggacacaggaggagatggacaagattccttatataaaatatgc aaggcttattctgtgtatgatgaagagattggttattgccagggccagtcatttcttgct gctgtgctccttctccatatgcctgaagaacaggcattcagtgttctggtcaagatcatg tttgactatgggctcagggaacttttcaagcaaaacttcgaagatttgcattgcaaattt taccagttggagcgcctcatgcagacttcgaaagatgacctgctgttgacagactttgaa ggtgccttgaagttctttagggttcagcttcctaagagataccgctcagaagaaaatgca aaaaaactaatggaattagcctgcaacatgaagcctacccatcatgaggacaaccaggat gaagacctttatgatgatccacttccacttagtgaatataagccattagtaagtaaattc tgggggagtcagaagttacacgtggattttcaggtgcgcagggcgtcagtgcccctaact cgtgttgttcaagggttaactgtacttcctgataatcaagccacccatgctgcaaaagtc agcatcagccgtgaacccatgagaagctatggtgtttttattgattttgtgtttgtgatt agtcagaagaagttgaaaaaatacgagaaagaatatcacaccatgagggaacagcaggcc cagcaagaagaccccatcgagcgatttgagcgggagaataggcgtctacaagaagctaac atgaggttggaacaggaaaacgatgacttagcccatgagctggtgaccagcaagattgca ctacggaaggacctggataacaaacactatatgggtcattctctccctgttcgggtattt ctgcctagtgagtggagggaggcaggtgtacatggagcagggactactgaagcagggcca gaggctgaggaaaaggcagatgctctgaataaggagctgctgatgaccaaacagaagttg attgatgcagaagaagagaaaagacggctggaagaagagtctgctcagttaaaagaaatg tgccgtcgggaactcgacaaggcagaatctgagattaaaaaaaacagttctatcattggt gactataagcagatttgttctcagttgagtgaaagattggagaagcagcagacagccaat aaggtggaaattgagaaaattcggcaaaaagtggatgactgtgagcggtgccgggaattt ttcaacaaagaagggcgtgtaaaaggcataagctcaaccaaggaggttttagatgaggac acggatgaagagaaagagacgctcaagaaccagctgagagaaatggagctagaactggca cagaccaaactccagctggtggaggccgagtgtaagatacaggacttggaacaccattta gggcttgccctcaatgaggtgcaggcagccaagaagacgtggtttaaccgaacactgagc tccataaagacagcaaccggggttcaagggaaagagacttgctga >gi568815589f:122934567_123135613|GENSCAN_predicted_peptide_4|783_aa MQHRLSGPSLAGPAANQADFQGLDNSQWGGSGSDRMAGAISPKVSAPLLNSNHQNTPEFS DVIPKFESEGEKSACDHTIGEQQQQQQPLARFPTRTLPWATGPSSSSSFQVDWDYIVVLS SQLLGQLNCRSLAEKQAAGRRALALGSNCCRPLEARHHWAGSFNPEPRFPICIVCLGITD PANSPSRPFAADTVLLSTHGSASYSHLLFTSDWQQLTSLAPLRCSERQLPSTCLETPLRG KTMLSRKRYKHPNYTEHSSMQPPRLIHLSPLRLTSGPAQSSPPPGPPALLFQCPTCHSKP KFHMGRNYIVLCIEPVLTRRRCSGSLCGEVDGSRQWMGRKFPVMIKPKELGKECPRLPMG YRLCSTDAESNKLQPLLRLKMNTNGLTFCQAKGLSMNNHPVPVTPGWRAQPGGNSERVKM LIGHKPLPTANGDARVLQHNGVWPTKAPFPSPVEMALAPFYLLCTLKALHTPQVGTLSLC MHARPASDSPPGDPVWLSGKPIALGKLLTWQQLTPISTKEPTQLHPEQAMHLPLQHLLAE PCPPHVHPHLGQPWIFVIGPIPTTRRLSELKCFHWSQPLFTCTQKRTWYLIGTGLFQQDW IALSTSQLYLIQPFIDAKATLNVHTDSAWKKTRVVPATLKPRHRNCHFKPDNLMVLEVRE EILRAQVNNKWCDAQRLAIGAVNEKTTYSGEKSREAKLHVNLRQFGEAKSFIYISLLFQL ANDTHLLQPQSKVMEKNEGYKFEPSAKKKKADQSKTQGNHHWPNSQAPPAQHPGPGGLLP DVG >gi568815589f:122934567_123135613|GENSCAN_predicted_CDS_4|2352_bp atgcagcacaggctgagtggaccttctttggcaggccctgctgcaaaccaagctgatttc cagggccttgataattctcagtggggaggcagcggctcagaccgaatggcaggagctatc agccccaaagtgtctgcacctcttctgaattccaatcatcagaacacccctgagttcagt gatgttattcccaaatttgaaagtgaaggtgaaaagtctgcctgtgaccacaccatcggg gagcagcagcagcagcagcagcccctggcccgattccccaccaggacacttccctgggct accggaccttcttcatcctcgtccttccaagtggactgggactacattgtggtcttgagc tctcagttactagggcagctcaactgcagaagccttgcagaaaagcaggcagcaggaaga agagccttagctttgggttccaattgctgccgccctctagaagctcgacaccattgggca gggtcctttaaccctgagcctcggtttccgatctgtattgtctgtctgggtattactgac cccgcgaacagccctagccgccccttcgcagcggacactgtgctcctatcaacgcacggc agcgccagctattcccaccttctgttcacgtcagactggcagcagctcacgtccctggca cctctccgatgctctgaacggcagctcccttcaacgtgcctagaaacccctctgagagga aagaccatgctcagcagaaagcgctacaaacaccccaactacactgagcattcctcaatg cagccaccacgtctgattcacctcagtcccctacgcctgacgtcgggcccggcccagagc agtccacccccagggccacctgctctcctcttccaatgtcccacgtgtcactcaaaacct aagttccatatgggcaggaactacatcgtcctgtgcattgagccagtgctcacacggagg agatgctcaggaagcctttgtggggaggtggatggatccaggcagtggatgggaaggaag tttccagtcatgattaagccaaaggaacttgggaaagaatgtccacgtcttcccatgggc tacagactgtgttccacagatgctgaatccaacaagctccaacccttgcttaggttaaaa atgaacacaaatggattaactttttgccaagcaaaaggcctcagcatgaacaaccacccg gtacctgtgactcctggctggagagcccagcccggcggaaactcagaaagggtgaagatg ctgataggtcacaagcccctccccactgccaatggagatgccagagtgctccagcacaat ggagtgtggcccaccaaagccccatttcctagcccggtggagatggccctcgctccattc tacctcctctgtaccctcaaggcccttcacactccccaagtgggcacactctctctgtgc atgcatgccaggcccgcctctgactcccctcctggggatcccgtctggctcagtgggaag cctatcgctcttggtaaattactcacctggcagcagctgacgcctatcagcaccaaggaa ccgacacagctccaccctgagcaggccatgcacctgcctctccaacacctgttggcagag ccttgccctcctcacgttcatcctcacctcggccaaccatggatatttgtaataggccca attccaacaacaaggagactgtcagaattaaaatgcttccactggtcacagcctctgttc acttgtacccagaagaggacctggtacctaataggcactgggctgtttcagcaggactgg atcgcactcagtactagtcagctatacctcatacaaccttttatagatgccaaagccaca ttaaatgtacatacagattctgcatggaaaaaaacaagagttgttccagcaactttaaaa ccaagacatagaaattgccatttcaagccagacaacctgatggttctagaagtcagagaa gaaatcttgagggcccaagtgaataataaatggtgcgacgctcagaggctggcaataggg gctgtcaatgaaaaaaccacctacagtggagaaaagagcagagaagcaaaacttcatgtt aatctcaggcaatttggtgaagccaagagcttcatttatatttctctgctctttcagctg gccaatgatacccatttacttcaaccccaaagcaaagttatggaaaagaatgaaggctac aaatttgaacctagtgcaaagaagaagaaggcagatcagtccaagacccagggcaatcat cactggcccaattcacaagcaccacctgctcagcaccctgggcccggaggacttcttcca gatgttggatga