GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:30:49 Sequence gi568815575r:15727328_15952524 : 225197 bp : 40.86% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 510 505 6 1.05 1.04 Term - 10680 10556 125 1 2 85 42 104 0.680 3.07 1.03 Intr - 11420 11229 192 0 0 62 23 155 0.642 4.94 1.02 Intr - 13313 13207 107 1 2 0 94 112 0.076 1.94 1.01 Init - 17401 17181 221 1 2 68 26 179 0.030 7.85 1.00 Prom - 20682 20643 40 -4.95 2.00 Prom + 21251 21290 40 -9.05 2.01 Init + 21539 21588 50 1 2 98 52 60 0.058 3.87 2.02 Intr + 22743 22838 96 0 0 13 76 147 0.023 4.21 2.03 Intr + 29026 29147 122 1 2 3 60 147 0.334 2.62 2.04 Intr + 37251 37448 198 1 0 78 94 201 0.775 18.10 2.05 Intr + 45169 45287 119 2 2 51 87 132 0.612 8.66 2.06 Intr + 46975 47070 96 0 0 75 93 72 0.701 5.69 2.07 Intr + 47919 47981 63 2 0 103 74 45 0.858 2.60 2.08 Intr + 49387 49542 156 0 0 89 91 120 0.997 11.69 2.09 Term + 55158 55337 180 2 0 94 41 192 0.990 11.73 2.10 PlyA + 56306 56311 6 1.05 3.00 Prom + 60290 60329 40 -3.75 3.01 Init + 63169 63209 41 0 2 102 121 39 0.468 8.31 3.02 Intr + 63607 63686 80 1 2 51 64 172 0.917 9.38 3.03 Intr + 72545 72626 82 0 1 66 78 145 0.992 9.08 3.04 Intr + 76361 76469 109 2 1 36 64 138 0.993 5.57 3.05 Intr + 76784 76870 87 1 0 65 45 135 0.102 6.15 3.06 Intr + 82874 82948 75 1 0 24 95 78 0.050 0.89 3.07 Intr + 85754 85875 122 1 2 29 85 73 0.569 -0.53 3.08 Intr + 88350 88563 214 0 1 73 86 178 0.601 13.90 3.09 Term + 95404 96015 612 0 0 81 46 407 0.853 29.29 3.10 PlyA + 96795 96800 6 1.05 4.08 PlyA - 96886 96881 6 1.05 4.07 Term - 104735 104698 38 0 2 107 38 41 0.356 -2.48 4.06 Intr - 117921 117738 184 0 1 32 92 108 0.749 4.04 4.05 Intr - 118189 118052 138 1 0 142 99 86 0.999 14.84 4.04 Intr - 118684 118576 109 0 1 86 65 74 0.948 4.27 4.03 Intr - 125197 125019 179 1 2 78 70 77 0.369 2.70 4.02 Intr - 127850 127628 223 1 1 34 12 172 0.386 1.41 4.01 Init - 132779 132700 80 2 2 83 75 40 0.409 2.78 4.00 Prom - 132956 132917 40 -3.65 5.03 PlyA - 134781 134776 6 1.05 5.02 Term - 146661 146522 140 1 2 76 38 105 0.852 1.44 5.01 Init - 148543 148420 124 1 1 86 70 70 0.698 5.38 5.00 Prom - 149136 149097 40 -3.45 6.07 PlyA - 149313 149308 6 -0.45 6.06 Term - 150114 149895 220 2 1 55 42 131 0.511 0.73 6.05 Intr - 152434 152151 284 1 2 47 76 162 0.025 6.19 6.04 Intr - 153731 153664 68 0 2 96 91 85 0.013 7.31 6.03 Intr - 164679 164522 158 2 2 99 68 43 0.036 2.13 6.02 Intr - 173940 173748 193 1 1 78 76 51 0.007 0.53 6.01 Init - 174458 173969 490 2 1 53 1 307 0.018 13.91 6.00 Prom - 175314 175275 40 -4.55 7.00 Prom + 180369 180408 40 -7.65 7.01 Init + 183673 183707 35 0 2 79 40 27 0.255 -3.31 7.02 Term + 185600 185747 148 2 1 29 50 259 0.818 12.59 7.03 PlyA + 186220 186225 6 1.05 8.03 PlyA - 187309 187304 6 1.05 8.02 Term - 201168 200920 249 0 0 113 39 166 0.893 8.92 8.01 Init - 206409 206377 33 0 0 44 106 23 0.375 -0.38 8.00 Prom - 206541 206502 40 -6.55 9.00 Prom + 206569 206608 40 -6.95 9.01 Init + 214110 214142 33 2 0 54 116 3 0.433 -0.29 9.02 Term + 220344 220607 264 2 0 91 50 210 0.749 12.02 9.03 PlyA + 221951 221956 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 13273 13207 67 1 1 98 94 44 0.851 7.29 S.002 Sngl - 17401 17063 339 1 0 68 47 209 0.869 10.58 S.003 Term + 76784 76900 117 1 0 65 45 168 0.881 7.76 S.004 Intr - 153089 152914 176 1 2 19 55 185 0.960 6.96 S.005 Sngl - 174458 173925 534 2 0 53 54 292 0.966 18.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:15727328_15952524|GENSCAN_predicted_peptide_1|214_aa MDRGTFRDDEEVAGDDEAPMLKRTIGVKMGRTLGLLNKQFQYQGDVGKQVEKASFLAHNR VRMRHSNFWEDDGNSYRYIKGDSQKGAMKDAETENGQRQSRKAQSLEDPAISQKAGLSGV IWIQPTLATLPTKELLSRAPTPFSLEQLLSRPVQRTLKDDSRLVTPIVEEEGWAWATPLS PALAPPPPDQSSLLLLAAGRISGLKLDLGLQSLI >gi568815575r:15727328_15952524|GENSCAN_predicted_CDS_1|645_bp atggacaggggaaccttccgcgatgatgaggaggtggcaggggatgatgaggccccaatg ctgaagagaacaattggagtcaaaatgggaagaactttaggtttactcaataaacaattc cagtatcagggggatgtgggcaaacaagtggaaaaggcctcctttcttgctcacaatcga gttagaatgagacatagtaatttttgggaagatgatggcaacagttacaggtatatcaag ggagactcacaaaaaggagccatgaaggatgctgagactgagaacggtcaaaggcaatca aggaaggcgcagagcttggaagatccagctatttcacagaaggctggactttctggagtt atttggattcaaccaaccttggcaaccctacctaccaaggaactgctctcccgcgccccc acccccttttctctggaacagctgttatcgcgtcctgtgcaaagaaccttaaaggatgac agccgtttggtaactcccatagtggaggaagagggctgggcctgggccacgcctctgagc cctgctctggccccgcccccgccggaccaatcctccctcctcctgctcgccgctggccgc attagtggattgaaactagacctggggcttcagagccttatctaa >gi568815575r:15727328_15952524|GENSCAN_predicted_peptide_2|359_aa MVKPYLYEKYKKEPLHRQIAVEKVPDSEIHASEALQPLYLYLQNPEPSLKALETIEKVED TDLMGLELTQALWADKANTDYLRNGWMLNVHPLWESVDLVPGGDRQSPINIRWRDSVYDP GLKPLTISYDPATCLHVWNNGYSFLVEFEDSTDKSVIKGGPLEHNYRLKQFHFHWGAIDA WGSEHTVDSKCFPAELHLVHWNAVRFENFEDAALEENGLAVIGVFLKLGKHHKELQKLVD TLPSIKHKDALVEFGSFDPSCLMPTCPDYWTYSGSLTTPPLSESVTWIIKKQPVEVDHDQ LEQFRTLLFTSEGEKEKRMVDNFRPLQPLMNRTVRSSFRHDYVLNVQAKPKPATSQATP >gi568815575r:15727328_15952524|GENSCAN_predicted_CDS_2|1080_bp atggtaaaaccctatctctatgaaaaatacaaaaaagagcctctgcacaggcaaattgct gtggagaaagttccagattccgagattcatgccagcgaggccctgcagcctctatacttg tacttacaaaacccggaaccgagccttaaagccctggagactatagaaaaagtggaggac acagacctcatgggcttggagttgacacaggccctttgggcagataaagcaaacacagat tacctgaggaatggttggatgctaaatgtgcatccactctgggagagcgtggacctggtt cctgggggcgatcgccagtcacccatcaacattcggtggagggacagtgtttatgatccc ggcttaaaaccactgaccatctcttatgacccagccacctgcctccacgtctggaataat gggtactctttcctcgtggaatttgaagattctacagataaatcagtgatcaagggagga cccctggaacacaactaccgattgaagcagttccattttcactggggggccatcgatgcc tggggttctgagcacaccgtggacagcaaatgcttcccagcagagctgcacttagtgcat tggaacgcagtcagatttgaaaactttgaggatgcagcactggaagaaaatggtttggct gtgataggagtatttttaaagctaggcaaacatcataaggagctacagaaattagtggat actttgccgtcaattaagcataaggacgcccttgtggaatttgggtcatttgacccttcc tgcctgatgcctacctgcccagattactggacctactcagggtctctgactaccccaccc ctctccgagtctgtcacctggatcattaagaagcaaccagtagaggttgatcatgatcag cttgagcaatttcggaccctgcttttcacttccgaaggggagaaagagaaaagaatggtg gacaacttccgcccccttcagccactgatgaatcgcactgttcgttcatccttccggcat gattatgtgctgaatgtacaagcgaaacccaagccggccaccagccaagcaaccccctaa >gi568815575r:15727328_15952524|GENSCAN_predicted_peptide_3|473_aa MAAPEKMTFPEKPSHKKYRAALKKEKRKKRRQELARLRDSGLSQKEEEEDTFIEEQQLEE EKLLERERQRLHEEWLLREQKAQEEFRIKKEKEEAAKKRQEEQERKLKEQWEEQQRKERE EEEQKRQEKKEKEKTFMWKPSVDLFSHLASLRTPIICQDKGNAYQEHFPEKKKKRSSQLA IPEGAFGYPHHLLAILGLRCSRKHNFPTSSPTLLIKSMFTTFGMEQCRRDDYDPDASLEY SEEETYQQFLDFYEDVLPEFKNVGKVIQFKVYLKYNNVQEESTATFFMCSEIPTMNSGKL IETSTCLQIGLAPPLGRTPKGGRGWATTTTTTAGCGEGETLVQTTPTKEMGNPRGKVVVT GGRNLTNAHQRVGRGTIHEAEEEIGTAAGTAAGAGAAGAGAGAGAAGAAAAGAKVPLGPE VVAGGGRVIETELFRVPNPNKLVLFLNDCISYLLWLLHTRIEGYERRWVRKSG >gi568815575r:15727328_15952524|GENSCAN_predicted_CDS_3|1422_bp atggctgcgcccgagaagatgacgtttcccgagaaaccaagccacaaaaagtacagggcc gccctgaagaaggagaaacgaaagaaacgtcggcaggaacttgctcgactgagagactca ggactctcacagaaggaggaagaggaggacacttttattgaagaacaacaactagaagaa gagaagctattggaaagagagaggcaaagattacatgaggagtggttgctaagagagcag aaggcacaagaagaattcagaataaagaaggaaaaggaagaggcggctaaaaaacggcaa gaagaacaagagagaaagttaaaggaacaatgggaagaacagcagaggaaagagagagaa gaggaggagcagaaacgacaggagaagaaagaaaaagagaagacgttcatgtggaaacct agtgttgatctcttcagtcatttagcaagtcttcgaacacctattatatgccaggacaaa ggtaatgcgtaccaggagcacttccctgagaagaagaaaaagaggagtagtcagttagct atacccgagggggcatttggatacccacaccatttattggccatccttggcctaagatgt tcacgtaaacataatttcccaacatccagtcctacccttcttattaagagcatgtttacg acgtttggaatggagcagtgcaggagggatgactatgaccctgacgcaagcctggagtac agcgaggaagaaacctaccaacagttcctagacttctatgaggatgtgttgcccgagttc aagaacgtggggaaagtgattcagttcaaggtttatttgaaatacaacaatgtccaagag gaaagcactgcaactttcttcatgtgttcagaaatcccaacaatgaattctgggaagcta atagagacatctacttgtctccagatcggactggctcctcctttgggaagaactccgaaa ggagggagaggatgggccaccacgacgactactacagcaggctgcggggaaggagaaacc ctagtccagaccactcctacaaaagaaatggggaatccgagaggaaaagtagtcgtcaca gggggaagaaatctcacaaacgcacatcaaagagtcgggagaggcacaattcacgaagca gaggaagaaatagggaccgcagcagggaccgcagccggggccggggcagccggagccgga gccggagccggagccgcaggagccgccgcagccggagccaaagttcctctaggtcccgaa gtcgtggcaggaggaggtcgggtaatagagacagaactgttcagagtcccaaatccaaat aaactagttttgttcttaaatgattgtatatcttatttattatggttgctacatactcgg atagaaggctatgaacgcagatgggttcgcaagtctggctga >gi568815575r:15727328_15952524|GENSCAN_predicted_peptide_4|316_aa MDEAGNHDSQQTIARTKNQTLHVLTHRLRTLQNPVLLSLNKQDSRPRASRRHLQERAGAR ALGPGKRRFRVTWPGFRFAPPLSEETSNRLPWTQRLPALPPMQFMLLFSRQGKLRLQKWY VPLSDKEKKKITRELVQTVLARKPKMCSFLEWRDLKIVYKRYASLYFCCAIEDQDNELIT LEIIHRYVELLDKYFGSVCELDIIFNFEKAYFILDEFLLGGEVQETSKKNVLKAIEQADL LQEFIDWNMCQSRQLSYLWMDGSVWRWSIGQASSQWPFLDPKTLESKPLEDRLFEEGKRL AESSSISKRLKTFIAT >gi568815575r:15727328_15952524|GENSCAN_predicted_CDS_4|951_bp atggatgaagctggaaaccatgattctcagcagactattgcaaggacaaaaaaccaaaca ctgcatgttctcactcacaggctaaggacgctgcagaatcccgtcctcctatcattaaac aagcaggacagccgcccgcgcgcgtcccggaggcacctgcaagagcgcgcgggcgcgcgg gcgctcgggcccggcaaacgccgcttccgggtcacgtggcccggcttccggttcgcacct cccctcagcgaagaaacctccaatcggctgccttggactcagcggttacccgcgctgcct ccgatgcagtttatgttgctttttagtcgtcagggaaagcttcgactgcaaaaatggtat gtcccactatcagacaaagagaagaaaaagatcacaagagaacttgttcagaccgtttta gcacggaaacctaaaatgtgcagcttccttgagtggcgagatctgaagattgtttacaaa agatatgctagtctgtatttttgctgtgctattgaggatcaggacaatgaactaattacc ctggaaataattcatcgttatgtggaattacttgacaagtatttcggcagtgtctgtgaa ctagatatcatctttaattttgagaaggcttattttattttggatgagtttcttttggga ggggaagttcaggaaacatccaagaaaaatgtccttaaagcaattgagcaggctgatcta ctgcaggagtttattgactggaatatgtgtcagtcacgtcaactcagttatctatggatg gatggttcagtttggagatggtctatcggtcaggcctcttcccaatggccctttttggac cctaaaacactggagtctaaacctttagaggacagactatttgaagaagggaaaagattg gctgaaagtagcagtataagtaaacggttaaaaaccttcattgctacttaa >gi568815575r:15727328_15952524|GENSCAN_predicted_peptide_5|87_aa MQSMPVHPVHKDTKSLPNALEKTAHCLYHITLINKPNIISEGDGVCGQEDKKGENKAGSS AAMPFVQSLHYFCLQMATVLKKAAKSD >gi568815575r:15727328_15952524|GENSCAN_predicted_CDS_5|264_bp atgcagagcatgcctgtccatcctgtccacaaggataccaagtccttgccaaatgcgttg gagaaaactgcacactgtttgtaccacattactttgatcaacaagcctaacatcatcagt gaaggtgatggagtatgtggtcaagaagacaaaaagggggaaaacaaggcagggagctca gcagctatgccgtttgtacaaagccttcattatttctgcttgcagatggccacagtactt aagaaagctgccaagtcagattaa >gi568815575r:15727328_15952524|GENSCAN_predicted_peptide_6|470_aa MWKTLELPRDLLNGFAQNANSDNSDMDNKVQAEVVSDENEECVGNWSKGDSCYVLAKRLL AFCSCPRDLWNCGLERDDLGYLVEEISKQQSIQEVTWMLLKAFSFIREAKHKSSENLQPV NAIEKKIPFSEERFKLAAEICINNEELNVNPQDNGKMSPGHVRGLEAYEAKWFHGPSPAL PAVCSLKTWCPASQPVQPWLKGANVELGLWLQRVEAPSLGSFHMVLSLHVKCQLPFSFCH DYKFPETSPEAKQMPESCFLYSLRNHEPIKPPFFTNGPVSGKNAEQLLRSLESLSSPKDD KDVPMDLVLCVPASAAVAKKGHGTAWAMASEGASPEPWQLPCGIGPVGAWKSRIEVWELP PRFQRMYGNAWMSRQKFSARAGPSWRTSVRPVQKGNVGLPIHTVSFLISDFTVETPLKVQ LTSAPERKLVLNAPGRRSASSSGRMDAGKPTLLAGFRQQDVPLTKRPFFL >gi568815575r:15727328_15952524|GENSCAN_predicted_CDS_6|1413_bp atgtggaaaactttggaacttcctagagacttgttgaatggctttgcccaaaatgctaac agtgataacagtgatatggacaataaagtccaggctgaagtggtctcagatgaaaatgag gaatgtgttgggaactggagcaaaggtgactcttgttatgttttagcaaagagactgctg gcattttgctcctgccctagagatttgtggaactgtggacttgagagagatgatttaggg tatctggtggaagaaatttccaagcagcaaagcattcaagaggtgacttggatgctgtta aaggcattcagttttataagagaagcaaagcataaaagttcagaaaatttgcagcctgtc aatgctatagaaaagaaaatcccattttctgaggagagattcaagctggctgcagaaatt tgcataaataacgaggagctgaatgttaatccccaagacaatggaaaaatgtctccaggg catgtcagaggcctggaggcctatgaggcaaagtggtttcatgggcccagcccagcactc cctgctgtgtgtagcctaaagacttggtgccctgcatcccagccagtccagccatggctg aaaggagctaatgtagagcttgggctgtggcttcagagggtggaagccccaagccttggc agcttccacatggtgttgagcctccatgtgaagtgtcagctccccttctccttctgccat gattataagtttcctgagacctccccagaagccaaacagatgccagaatcatgcttcctg tacagcctgcggaaccatgagccaattaaacctcctttctttacaaatggcccagtctca ggtaagaatgctgagcaactcctcagaagtttggagagcctttcaagtcctaaagatgac aaagatgtgcctatggacttggtgctctgtgtcccagcctctgcagctgtggctaaaaag ggccacggtacagcttgggccatggcttcagagggtgcaagccccgagccttggcagctt ccatgtggtattgggcctgtgggtgcatggaagtcaagaattgaggtttgggaacttcca cctagatttcagaggatgtatggaaatgcctggatgtccaggcagaagttttctgcaagg gcagggccttcatggagaacctctgttaggccagtgcaaaagggaaatgtaggcctacct attcatacagtgtcttttctaatctctgatttcactgtagaaaccccacttaaggtccag ttaacttctgcccctgaaaggaagcttgtgcttaatgcccctggaagaaggtctgccagc tcttcaggccgaatggatgctgggaagccaactctcctggcaggcttccgacagcaagat gttcctctaaccaagaggcccttcttcctctga >gi568815575r:15727328_15952524|GENSCAN_predicted_peptide_7|60_aa MVPWDREAVPQWTFNYLDGDDDDNSGNFGGGDNVNGGDDADVAGDDDGQERDKEESLIYG >gi568815575r:15727328_15952524|GENSCAN_predicted_CDS_7|183_bp atggttccctgggaccgggaagctgtgccacagtggacgtttaactatctggatggagat gatgatgataatagtggtaattttggaggaggtgacaatgttaatggtggtgatgatgct gatgttgctggtgatgatgatggccaagaaagagataaagaagaaagtcttatttatggt tga >gi568815575r:15727328_15952524|GENSCAN_predicted_peptide_8|93_aa MSQEESTQQRKCTSDLYIAESMAKSPLPVSKQQHLAQPINSSPCYMVFTRFPEPYPLLVF PSHWLDLLSALWMVPLRSLESMLAVSSSLSHPE >gi568815575r:15727328_15952524|GENSCAN_predicted_CDS_8|282_bp atgtctcaggaagaatcaacacagcaaagaaaatgcactagtgatctttacattgctgaa tctatggccaaatctcctcttcccgtgagcaaacagcagcatttggcacaacccataaac tcttctccctgctacatggtcttcactcggtttccagaaccctaccctctattggttttc ccatctcactggttggatcttctcagtgctctttggatggttcctcttcgttccctggaa tcaatgctggctgtctcttcttctctgtcacaccctgaatga >gi568815575r:15727328_15952524|GENSCAN_predicted_peptide_9|98_aa MHHENLCAVRKATALHFSLVIKEDEQDHALGKAIQGNEAGMQSESRRKCSQQQGGGKEWP GTRRGLSQQETTKADCGSKPSPLFVKRIHNKAWTPTDK >gi568815575r:15727328_15952524|GENSCAN_predicted_CDS_9|297_bp atgcaccatgaaaacctgtgtgctgtgagaaaggcaacagccttgcatttcagtctggta attaaggaggatgagcaggatcatgctttgggcaaagcaatacaagggaatgaagcagga atgcaaagtgagtcccggaggaagtgtagccagcagcagggtggagggaaggagtggcct ggaaccaggaggggattgagtcagcaggaaacaaccaaggcagattgtgggagtaagcct tcccctttgtttgtgaagaggattcacaacaaagcctggactcctactgataagtga