GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:57:05 Sequence gi568815591f:154970900_155184484 : 213585 bp : 44.17% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 510 689 180 2 0 81 74 96 0.702 6.73 1.02 Term + 1983 2078 96 2 0 71 54 77 0.593 0.57 1.03 PlyA + 2815 2820 6 1.05 2.06 PlyA - 4404 4399 6 1.05 2.05 Term - 4701 4693 9 0 0 77 32 0 0.427 -8.51 2.04 Intr - 5432 4797 636 2 0 97 103 453 0.807 39.99 2.03 Intr - 12433 12320 114 1 0 73 86 44 0.848 3.34 2.02 Intr - 27885 27751 135 0 0 118 86 58 0.955 9.36 2.01 Init - 32030 31950 81 2 0 85 86 136 0.983 14.07 2.00 Prom - 35631 35592 40 -5.76 3.04 PlyA - 36094 36089 6 1.05 3.03 Term - 36731 36563 169 1 1 56 54 116 0.751 2.35 3.02 Intr - 42401 42261 141 1 0 49 113 79 0.974 5.97 3.01 Init - 45543 45509 35 0 2 87 81 58 0.961 4.34 3.00 Prom - 50775 50736 40 -2.86 4.03 PlyA - 50965 50960 6 1.05 4.02 Term - 65163 65051 113 1 2 -63 37 386 0.717 17.12 4.01 Init - 73638 73554 85 0 1 82 47 55 0.308 1.78 4.00 Prom - 79403 79364 40 -1.66 5.00 Prom + 79429 79468 40 -4.76 5.01 Init + 86338 86415 78 0 0 55 37 124 0.396 5.06 5.02 Intr + 90374 90407 34 1 1 107 97 44 0.904 5.00 5.03 Intr + 98907 98941 35 1 2 98 86 10 0.018 -0.26 5.04 Intr + 100058 100741 684 1 0 37 90 1104 0.021 97.26 5.05 Term + 104299 104415 117 0 0 40 41 137 0.846 2.74 5.06 PlyA + 104951 104956 6 -0.45 6.00 Prom + 105162 105201 40 -6.46 6.01 Init + 106283 106335 53 1 2 78 91 1 0.029 0.04 6.02 Intr + 111110 111635 526 2 1 -25 18 731 0.017 47.75 6.03 Term + 113256 113588 333 2 0 127 55 478 0.532 43.01 6.04 PlyA + 114776 114781 6 1.05 7.05 PlyA - 114971 114966 6 1.05 7.04 Term - 119762 119745 18 2 0 109 42 17 0.614 -2.48 7.03 Intr - 120384 120213 172 2 1 101 87 70 0.930 8.15 7.02 Intr - 123136 123006 131 1 2 5 116 50 0.380 -1.01 7.01 Init - 124376 124320 57 2 0 86 105 23 0.426 5.11 7.00 Prom - 126818 126779 40 -7.16 8.00 Prom + 127169 127208 40 -6.56 8.01 Init + 127897 128025 129 0 0 79 69 69 0.778 4.25 8.02 Intr + 128095 128204 110 0 2 23 81 45 0.794 -3.62 8.03 Intr + 128462 128618 157 2 1 -23 29 565 0.749 39.51 8.04 Term + 129667 129882 216 0 0 79 54 129 0.931 5.74 8.05 PlyA + 133416 133421 6 1.05 9.00 Prom + 135220 135259 40 -4.86 9.01 Init + 135576 135747 172 2 1 73 10 132 0.391 3.53 9.02 Term + 139904 140118 215 0 2 64 42 102 0.128 0.59 9.03 PlyA + 144001 144006 6 1.05 10.00 Prom + 144997 145036 40 -4.56 10.01 Init + 147870 148021 152 2 2 69 9 217 0.494 11.42 10.02 Term + 149030 149219 190 2 1 13 55 127 0.527 -1.18 10.03 PlyA + 151472 151477 6 1.05 11.00 Prom + 152491 152530 40 -6.56 11.01 Init + 154664 154727 64 1 1 82 109 5 0.501 3.31 11.02 Term + 158489 158595 107 0 2 67 48 125 0.347 4.97 11.03 PlyA + 158715 158720 6 1.05 12.09 PlyA - 158774 158769 6 1.05 12.08 Term - 160053 159910 144 0 0 48 47 126 0.108 2.41 12.07 Intr - 183756 183665 92 1 2 123 74 26 0.338 4.41 12.06 Intr - 183872 183837 36 0 0 113 96 20 0.536 3.53 12.05 Intr - 185613 185562 52 0 1 107 80 29 0.117 2.48 12.04 Intr - 195213 195097 117 0 0 77 30 68 0.200 0.56 12.03 Intr - 200616 200414 203 1 2 42 75 109 0.015 4.00 12.02 Intr - 201078 200907 172 0 1 60 87 44 0.012 1.02 12.01 Init - 211135 211082 54 1 0 51 65 62 0.033 1.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100741 741 1 0 80 90 1083 0.966 102.44 S.002 Sngl - 111545 111105 441 2 0 90 49 664 0.956 58.95 S.003 Term + 202171 202275 105 0 0 105 28 92 0.860 3.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:154970900_155184484|GENSCAN_predicted_peptide_1|91_aa MAVGLEPSPLEEARGAGLDGEEKSARPAIEMPPKTPTTFIRSLFQPSDLPLAGNMLVLEN PVILYGLYFLPVMTSSGSQDDCVIVTLLLCG >gi568815591f:154970900_155184484|GENSCAN_predicted_CDS_1|276_bp atggccgttggcctggaaccttctccacttgaggaagcgagaggtgcaggcctggatgga gaggagaaatctgcaaggccagccatcgagatgcctcccaaaacccctaccacattcatc cgatcactttttcaaccatctgatctgcctctggctggaaacatgttagtcctggagaac cctgtcattctttatggcctttactttctacctgtcatgacatcctctggctctcaggac gactgtgtgatcgtcactttgctgctctgtggctga >gi568815591f:154970900_155184484|GENSCAN_predicted_peptide_2|324_aa MSDQAPKVPEEMFREVKYYAVGDIDPQVIQLLKAGKAKEVSYNALASHIISEDGDNPEVG EAREVFDLPVVKVSSEDRSALWALVTFYGGDCQLTLNKKCTHLIVPEPKGEKYECALKRA SIKIVTPDWVLDCVSEKTKKDEAFYHPRLIIYEEEEEEEEEEEEVENEEQDSQNEGSTDE KSSPASSQEGSPSGDQQFSPKSNTEKSKGELMFDDSSDSSPEKQERNLNWTPAEVPQLAA AKRRLPQGKEPGLINLCANVPPVPGNILPPEVRGNLMAAGQNLQSSERSEMIATWSPAVR TLRNITNNADIQQMNRPSNVAHTK >gi568815591f:154970900_155184484|GENSCAN_predicted_CDS_2|975_bp atgtcggaccaggcgcccaaagttcctgaggagatgttcagggaggtcaagtattacgcg gtgggcgacatcgacccgcaggttattcagcttctcaaggctggaaaagcgaaggaagtt tcctacaatgcactagcctcacacataatctcagaggatggggacaatccagaggtggga gaagctcgggaagtctttgacttacctgttgtaaaggtgtcatctgaagacagaagtgcc ctgtgggctttggttacgttctatgggggagattgccagctaaccctcaataagaaatgc acgcatttgattgttccagagccaaagggggagaaatacgaatgtgctttaaagcgagca agtattaaaattgtgactcctgactgggttctggattgcgtatcagagaaaaccaaaaag gacgaagcattttatcatcctcgtctgattatttatgaagaggaagaagaggaagaggaa gaggaggaggaagtagaaaatgaggaacaagattctcagaatgagggtagtacagatgag aagtcaagccctgccagctctcaagaagggtctccttcaggtgaccagcagttttcacct aaatccaacactgaaaaatctaaaggggaattaatgtttgatgattcttcagattcatca ccggaaaaacaggagagaaatttaaactggaccccggccgaagtcccacagttagctgca gcaaaacgcaggctgcctcagggaaaggagcctgggttgattaacttgtgtgccaatgtc ccacccgtcccaggtaacattttgccccctgaggtccggggtaatttaatggctgctgga caaaacctccaaagttctgaaagatcagaaatgatagctacctggagtccagctgtacgg acactgaggaatattactaataatgctgacattcagcagatgaaccggccatcaaatgta gcacataccaaataa >gi568815591f:154970900_155184484|GENSCAN_predicted_peptide_3|114_aa MGLVCSGYGVQRLHATDKHVVHSAGWESVSEYGFPKASAINHLPAENTFDRHLVLLTGSF PTLNSYMTLVAAVPRRSTDGIFPAEKQCSFEFAVSPSTKLWPVCACNWSILTLS >gi568815591f:154970900_155184484|GENSCAN_predicted_CDS_3|345_bp atgggcctagtgtgctcgggctacggtgtacaaaggctccatgcaactgacaagcacgtg gtgcactctgcggggtgggagtcggtgagcgaatacggattcccaaaagccagcgccata aatcatttgccagcagagaataccttcgatagacacttggtcttgcttacaggaagtttt ccgacactgaatagctacatgacactggtggcagctgtgccccgcaggtccactgacggg atctttcctgcagagaagcagtgctcctttgagttcgcagtttctccttcaactaaacta tggcctgtttgtgcctgcaactggagtattctgaccctttcatga >gi568815591f:154970900_155184484|GENSCAN_predicted_peptide_4|65_aa MHLTSSESIQVCSKLNPCLLGGKRPRGRKEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE EEEEI >gi568815591f:154970900_155184484|GENSCAN_predicted_CDS_4|198_bp atgcaccttaccagcagcgaatccatacaggtctgcagcaaactcaatccttgcctcctt ggaggaaagcggccaagaggcagaaaagaagaggaagaggaagaagaagaagaggaagag gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaaatttaa >gi568815591f:154970900_155184484|GENSCAN_predicted_peptide_5|315_aa MLNQLQNIHISSLICEVKIGVEKKGQVIGVQVVFGYIRKQPLREATESWTNHSLGKDDLR PSSPLLSVFGVLILTLLGFLVAATFAWNLLVLATILRVRTFHRVPHNLVASMAVSDVLVA ALVMPLSLVHELSGRRWQLGRRLCQLWIACDVLCCTASIWNVTAIALDRYWSITRHMEYT LRTRKCVSNVMIALTWALSAVISLAPLLFGWGETYSEGSEECQVSREPSYAVFSTVGAFY LPLCVVLFVYWKIYKAAKFRVGSRKTNSVSPISEAVESVCVGPHSAKAAGEISVIIQIVT TVFGPLNDEVSSVEM >gi568815591f:154970900_155184484|GENSCAN_predicted_CDS_5|948_bp atgcttaatcagctgcagaatatccacatcagctcactcatctgcgaagtaaaaattggt gtggaaaagaagggccaagttattggggtgcaggtggtgtttggttacataagaaaacaa cccttaagggaagcaactgagagctggaccaaccacagcctcggcaaagacgacctgcgc cccagctcgcccctgctctcggtcttcggagtgcttattctcaccttgctgggctttctg gtggcggcgacgttcgcctggaacctgctggtgctggcgaccatcctccgtgtacgcacc ttccaccgcgtgccccacaacctggtggcatccatggccgtctcggatgtcctggtggcc gcgctggtcatgccgctgagcctggtgcacgagctgtccgggcgccgctggcagctaggt cggaggctgtgccagctttggatcgcgtgcgacgtgctttgctgcacggccagcatctgg aacgtgacggccatagccctggaccgctactggtccatcacgcgccacatggaatacacg ctccgcacccgcaagtgcgtctccaacgtcatgatcgcgctcacctgggcactctccgct gtcatctctctggccccgctgctttttggctggggagagacgtactctgagggcagcgag gagtgccaggtaagccgcgagccttcctacgccgtgttctccaccgtaggcgccttctac ctgccgctctgtgtggtgctcttcgtgtactggaagatctacaaggctgccaagttccgc gtgggctccaggaagaccaatagcgtctcacccatatccgaagctgtggagtctgtctgt gttggtcctcattcagccaaagctgcaggagagatctccgtgattatccagattgtcacc actgtttttggtcctcttaatgatgaagtttcatctgtggagatgtga >gi568815591f:154970900_155184484|GENSCAN_predicted_peptide_6|303_aa MNKGCSTFYLEADPTYCSVIDITSVIITTVNSIFSVTSISSIISIRGVSSGTSIPIVNSI SSVTSIHSVNDIHTVNSIHGVTSVASIHSVNGIHGVTNVTSIHSVNSIHGVTSVTSIHSV NGIHGVIRVTSIHSVNRIFSVTSISSVTSIPSVNSISSMIRIHGVSSITSISSVTSILYV TSVISLHSVTMSQVKDSAKQPQMVFTVRHATVTFQPEGDTWREQKEQRAALMVGILIGVF VLCWIPFFLTELISPLCSCDIPAIWKSIFLWLGYSNSFFNPLIYTAFNKNYNSAFKNFFS RQH >gi568815591f:154970900_155184484|GENSCAN_predicted_CDS_6|912_bp atgaacaagggctgcagcacattttatttggaagcagatcctacttattgcagtgtcatt gatatcaccagtgtgatcatcaccactgtgaacagtatcttcagtgtgaccagcatctcc agtataatcagcatccgtggtgtctccagtgggaccagcatccccattgtgaacagcatc tccagcgtgaccagcatccacagtgtgaacgacatccacactgtgaacagcatccatggt gtcaccagtgtggccagcatccacagtgtgaatggcatccatggggtcaccaatgtgacc agcatccacagtgtgaacagcatccatggtgttaccagtgttaccagcatccacagtgtg aatggcatccatggtgtcatcagggtgaccagcattcacagtgtgaacagaatcttcagc gtgaccagcatctccagtgtgaccagcatccccagtgtgaacagcatctctagtatgatc agaatccatggtgtttccagtataaccagcatctccagtgttaccagcatcctttatgtg accagtgtgataagcctccacagtgtgaccatgtctcaagtgaaggactctgccaaacag ccccagatggtgttcacggtccgccacgccaccgtcaccttccagccagaaggggacacg tggcgggagcagaaggagcagcgggccgccctcatggtgggcatcctcattggcgtgttc gtgctctgctggatccccttctttctcaccgagctcatcagtcccctctgctcctgtgac atccccgccatctggaaaagcatcttcctgtggcttggctactccaactccttctttaac cccctgatctatacggctttcaacaagaactacaacagcgccttcaagaacttcttttct aggcaacactga >gi568815591f:154970900_155184484|GENSCAN_predicted_peptide_7|125_aa MTASDSQDGEGRYPTRGSPLPVIGDFIGSRALLSKAGQGGTQLGIRSVLDSIYSSASTGK DLRPRTHCTLTVLGQDRGTLGEVTVSPSVAKQEAGGSSVFVNQTTSSFIYVETDLSHVGK LCYYY >gi568815591f:154970900_155184484|GENSCAN_predicted_CDS_7|378_bp atgacagcctctgactctcaggatggagaaggaagatacccaacccgtggctccccgctg cccgtgataggggacttcattggctctcgggctcttctaagcaaggctggccagggtggc acccagctggggattcgttcagtcttagattccatttattcttctgcttctactggaaaa gatttaagaccaaggacccattgcaccttaacggtgctgggacaggacagaggcaccctc ggtgaggtgacagtgagcccgtcggtggccaaacaggaagctgggggctcatctgtcttt gtaaatcagaccacctcctctttcatctacgtggaaactgacttgtcgcatgttgggaag ctttgttattactactga >gi568815591f:154970900_155184484|GENSCAN_predicted_peptide_8|203_aa MGRCPPQSEKPKKSSPPHSPQQPHGLASSLPELQVPMTVDTSQEGELKGPVGVPFVPIRC TTLVASRLKSMVEIEVARPRKKKKKKKKKKKKKKKKKKKKKKKKKKKKEEEEEEEEEEKE EEEEEEEKEEEELTWSANRGCPGIAMTLEAEDLTAGKTLGMLTVQTACCLHPSQWAVSPL LNQVLRLHLPQRGVGLLWNLDAD >gi568815591f:154970900_155184484|GENSCAN_predicted_CDS_8|612_bp atggggcgatgcccaccccagagtgagaaaccaaagaaaagcagccccccccactcacca cagcagccccacgggctggcatcttccctccctgagctccaggtccccatgactgttgac acatcccaggaaggggagctaaagggacctgtgggtgtaccttttgtgccaattcgatgt accactcttgttgcttctagacttaaatctatggtagaaattgaggtggcaaggccaagg aagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaag aagaagaagaagaagaagaagaaggaggaggaggaggaggaagaagaggaggagaaggag gaagaggaggaggaggaagaaaaggaggaggaggagctgacctggtcagcaaatagaggc tgcccagggatagccatgaccttagaggctgaggacctcacagctgggaagaccctgggg atgctgacggtgcagactgcctgctgcctgcatccctcgcagtgggctgtcagtcctctc ttgaatcaggttctgcggctgcatctaccacagcgaggggttgggctgctatggaacttg gatgctgactga >gi568815591f:154970900_155184484|GENSCAN_predicted_peptide_9|128_aa MKAMVISAASVQSSSRANIKATNAIQQQTVVLPASSLANTKLMPKTVHLANPNLLPQGSS LGSSSMLPKHHQAPVVAPFPQSVSFLKDVAPALSISPACSEDFTSKYVNEALSPEGFGEG IATNSDET >gi568815591f:154970900_155184484|GENSCAN_predicted_CDS_9|387_bp atgaaagccatggtcatatctgctgcctctgtccagagctccagcagagccaacattaaa gccaccaatgccatccaacagcaaactgtcgtgctgccagcatccagcctggccaacacc aaactcatgccaaagactgtgcaccttgccaaccctaaccttttgcctcagggctcctct ctgggatcctcctccatgcttccgaaacatcatcaagcccctgtggtagcaccatttcct cagtccgtgagctttctcaaagacgtagctcctgccctctccatctcaccggcatgcagt gaagacttcacctccaaatatgtgaatgaggctctgagccctgaaggatttggagaaggc atagcaacgaattctgatgaaacctaa >gi568815591f:154970900_155184484|GENSCAN_predicted_peptide_10|113_aa MDLLATDSHFSLQPQVIRIHNEAEDHEYEEFSVVAGMGKSSTVSSGCSTLCETTETSCRE GLHKELTHPRMQLPHPDDFIPRTLTNQRTHFSNSSPSMIPLKPPSQNSSGRQI >gi568815591f:154970900_155184484|GENSCAN_predicted_CDS_10|342_bp atggacctgctggctacagacagtcatttctcactgcagccgcaggtcattcgaattcac aatgaagctgaagaccatgaatatgaagagttcagtgtggttgctggcatgggcaaatcc tcaactgtgtcctcgggctgcagtaccctctgtgaaactactgagaccagctgccgtgaa ggactccacaaggagctgactcatccaagaatgcaacttccacatcccgatgacttcatc ccccgtaccctgaccaatcaacgaacccacttttccaactcctcgccctccatgatccct ttaaaaccccccagccagaactcctcagggagacaaatttga >gi568815591f:154970900_155184484|GENSCAN_predicted_peptide_11|56_aa MALGSGRAQPIKGSTHPGKETGAAHQALRRPQAPVRIRILQEDIEAAGSEAVRGVG >gi568815591f:154970900_155184484|GENSCAN_predicted_CDS_11|171_bp atggcccttgggtctggaagggctcaacccatcaagggctcaacccatcctggcaaggaa acaggagcagcccatcaggctctaagaaggccccaggctcccgtgagaattcgaattctg caggaggacatcgaggcagcaggcagtgaagcggtgaggggtgtgggctga >gi568815591f:154970900_155184484|GENSCAN_predicted_peptide_12|289_aa MVVLFNGVCSFGRDVRGVVSVESSFGCGHPCGFLRFLWDTPEGRRCPVKNICSPVLLCDS AQTARESQRLDFQFPAQLNSDDRRGVVQNHKHFLSDPLWEDKLAKAAQGSPRLQVCVVSD KAHITGKCLRNTLASHSKSDGKNCTGKIGSHVGLENECKVLPNGGDSSQQMPGEPEGGWS GKECTSGLTIALADTDNHAEHAQHVPAAAFAGRVTPPGQLHPSLFQIPVVHIMQRLLKNS PLEWGMGQAATHETGRDLAGSLQLTALAAASQKDEGFSPIGGTSGECGR >gi568815591f:154970900_155184484|GENSCAN_predicted_CDS_12|870_bp atggtcgtcctcttcaacggagtgtgcagctttgggagggatgtacgtggagtggtgagt gttgaatcgtcttttggttgtgggcatccttgtgggtttctgcggtttctctgggacact cctgaagggcgccggtgcccagtaaagaacatctgctcaccagtgctcttgtgtgacagt gcccagactgccagagaatcacagagactggatttccagttcccagcacaactgaatagt gatgaccgccgaggagttgtgcaaaaccataaacactttctttctgaccctttatgggag gacaagcttgctaaagctgctcagggctcccccaggctccaagtttgtgtggtgtccgac aaagcccacattacaggaaaatgcttgcgaaacacacttgcatcccacagcaaaagtgat ggcaaaaattgtactggaaaaattggatcacacgtgggcttggagaatgagtgtaaggtt ttaccgaatggtggagatagctctcagcagatgcctggggagccagaagggggatggagt gggaaggagtgcacatcagggctgaccattgcattagctgacacagataatcatgcagaa catgcccagcacgtccctgctgcagcctttgccggccgtgtgactcccccaggtcagctg catccttccctcttccaaatccctgtggtgcacatcatgcagagacttctaaagaactca ccattggaatggggcatgggtcaggcagcgacccatgagaccggaagagacttagccggg agccttcagctgacggcactggcagcggccagccagaaggatgagggctttagtcctatt ggtgggaccagcggggagtgtggacggtga