GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:52:07 Sequence gi568815595f:16075152_16327497 : 252346 bp : 43.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 488 483 6 1.05 1.02 Term - 2524 2409 116 2 2 78 49 85 0.595 2.33 1.01 Init - 7254 7131 124 0 1 59 55 136 0.367 7.73 1.00 Prom - 9965 9926 40 -4.96 2.06 PlyA - 10776 10771 6 1.05 2.05 Term - 16662 16525 138 0 0 41 52 148 0.293 4.46 2.04 Intr - 22703 22557 147 2 0 40 86 105 0.535 5.93 2.03 Intr - 34900 34874 27 1 0 77 96 37 0.034 1.71 2.02 Intr - 50329 50187 143 2 2 77 48 59 0.441 0.87 2.01 Init - 53514 53436 79 0 1 115 60 49 0.786 6.12 2.00 Prom - 54508 54469 40 -2.46 3.02 PlyA - 55908 55903 6 1.05 3.01 Sngl - 76393 76145 249 1 0 92 54 144 0.949 6.48 3.00 Prom - 84206 84167 40 -2.46 4.02 PlyA - 84970 84965 6 1.05 4.01 Sngl - 87307 86282 1026 1 0 49 43 394 0.995 28.09 4.00 Prom - 87400 87361 40 -4.96 5.00 Prom + 92675 92714 40 -5.26 5.01 Init + 100001 100539 539 1 2 82 93 433 0.841 35.54 5.02 Intr + 108382 108420 39 1 0 68 121 15 0.064 0.14 5.03 Intr + 120609 120775 167 0 2 140 63 272 0.393 29.50 5.04 Intr + 125468 125672 205 0 1 93 93 144 0.633 13.66 5.05 Intr + 133352 133519 168 2 0 57 109 137 0.951 11.76 5.06 Intr + 135973 136090 118 1 1 88 95 78 0.521 8.97 5.07 Intr + 137418 137612 195 2 0 91 116 85 0.524 11.21 5.08 Intr + 144759 144863 105 2 0 91 94 69 0.995 8.21 5.09 Intr + 147464 147607 144 1 0 75 105 124 0.955 13.28 5.10 Term + 152203 152349 147 0 0 122 42 165 0.999 13.20 5.11 PlyA + 153866 153871 6 1.05 6.00 Prom + 157484 157523 40 -2.06 6.01 Init + 168119 168224 106 1 1 79 71 85 0.574 6.28 6.02 Term + 176445 176557 113 1 2 99 54 41 0.207 0.52 6.03 PlyA + 176976 176981 6 1.05 7.04 PlyA - 177003 176998 6 1.05 7.03 Term - 177351 177218 134 1 2 112 41 65 0.287 2.45 7.02 Intr - 184715 184640 76 2 1 49 98 9 0.054 -2.91 7.01 Init - 189725 189618 108 2 0 85 88 167 0.652 16.62 7.00 Prom - 195107 195068 40 -0.46 8.08 PlyA - 196458 196453 6 1.05 8.07 Term - 208546 208436 111 1 0 100 42 55 0.081 0.66 8.06 Intr - 220948 220887 62 2 2 97 87 28 0.006 2.05 8.05 Intr - 229963 229941 23 0 2 73 119 -15 0.004 -2.71 8.04 Intr - 240507 240474 34 1 1 80 111 48 0.228 3.58 8.03 Intr - 242081 241697 385 2 1 68 60 251 0.138 14.72 8.02 Intr - 248306 248225 82 1 1 66 116 27 0.610 2.94 8.01 Intr - 251725 251622 104 1 2 101 92 177 0.943 18.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 103536 103761 226 0 1 67 46 148 0.852 4.75 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:16075152_16327497|GENSCAN_predicted_peptide_1|79_aa MWESLELPRDLLNGFNQNADNEVQVEVVSDEDEELVGNWSKGGLVVSLTSGMKPQTLTVS VTAHKGSADPKSEQQQDLS >gi568815595f:16075152_16327497|GENSCAN_predicted_CDS_1|240_bp atgtgggaaagtttggaacttcctagagacttgttgaatggctttaaccaaaatgctgac aatgaagtccaggttgaggtggtctcagatgaagatgaggaacttgttgggaactggagc aaaggtgggctcgtggtctcgctgacttcaggaatgaagccgcagaccctcacagtgagt gttacagctcataaaggtagcgcagacccaaagagtgagcagcaacaagatttatcatga >gi568815595f:16075152_16327497|GENSCAN_predicted_peptide_2|177_aa MDELMPEYRWGQARASFLNAIEVSDIGIYTLTPAGTFQNAADSPDHSPALGYTIGLLEFT TVLNTRALTLYPPKMTLENAVRRAPPPNIITLRVRISTYEFEGDTNSQSIVKMKKKEEKA QVEQERRKREEKPLEFQTNHQQANSHPAMGTRVSAASQRPPGESKQYRCTLPGLVLV >gi568815595f:16075152_16327497|GENSCAN_predicted_CDS_2|534_bp atggatgagttgatgcctgagtatcgatgggggcaagccagagcgtccttcctaaacgct atcgaagtctctgacataggtatatacaccctcacaccagcaggtaccttccaaaatgct gcagactccccagaccacagtcctgctcttggatacaccattggccttttggagttcacc acagttctgaatactcgggctctaactctgtaccctcccaagatgactctggagaatgct gtacggagggcaccccctccaaatatcatcacactgagggttaggatttcaacatatgaa tttgaaggggatacaaacagtcagtccattgtgaaaatgaagaagaaagaagaaaaagca caagtggaacaagagaggaggaagagggaagagaagccactagaattccagaccaaccat cagcaggccaactcccatccagccatgggaaccagggtgtcagcagcttcccaacgacca ccaggtgagtccaagcagtatcgctgcacactccctggcctggtgttggtgtga >gi568815595f:16075152_16327497|GENSCAN_predicted_peptide_3|82_aa MDTEWRDQEQVGIHRYKVEPHKDELKPVSFLVASDVGGLGVLEKLGPSVIELNIDGWPRK RKLKEDPGEDKALAGPAVASCQ >gi568815595f:16075152_16327497|GENSCAN_predicted_CDS_3|249_bp atggacacagagtggagagatcaagaacaagttggaatccataggtacaaggtggagccc cacaaagatgaattaaaacccgtgtcttttcttgttgcctctgatgtaggtggtttgggt gtcttggagaagctggggccttctgtcatagagctaaacatagatggctggcccagaaag cggaaactgaaggaggatccaggagaagataaagcacttgcaggcccagcagtggcttcc tgccaatga >gi568815595f:16075152_16327497|GENSCAN_predicted_peptide_4|341_aa MGDFNTPLSTLERSTRQEVNKDIQDLNSALQQADLTDIYRTLQPKSTEYTFFSAPHRTYS KIDHIVGSKALLSKRKRTEIITNCLSDHSAIKLEFKIKKLTQNRLTIWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDTFKARCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETPKTLQKINESRSWFLEKINKIDRLLARLIK KKREKNQIDAIKNDKGDVTTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLN QEEAESLNRPITGSEIEAIIKSLPIKKSPRSDGFTAEFYQR >gi568815595f:16075152_16327497|GENSCAN_predicted_CDS_4|1026_bp atgggagactttaacaccccactgtcaacattagagagatccacgagacaggaagttaac aaggatatccaggacttgaactcagctctgcaacaagcagacctaacagacatctacaga actctccaacccaaatcaacagaatatacattcttctcagcaccacatcgcacttattcc aaaattgaccacatagttggaagtaaagcactcctcagcaaacgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaattcaagattaagaaactc actcaaaaccgcttaactatatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacaca acataccagaatctctgggacacatttaaggcaaggtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaa gaactagagaagcaagagcaaacacattcaaaagccagcagaaggcaagaaataactaag atcagagcagaactgaaggagatagagacaccaaaaacccttcaaaaaatcaatgaatcc aggagctggtttttagaaaagattaacaaaattgatagactgctagcaagactaataaag aagaaaagagagaagaatcaaatagacgcaataaaaaatgataaaggggatgtcaccacc gatcccacagagatacaaactaccatcagagaatactataaacacctctatgcaaataaa ctagaaaatctagaagaaatggataaattcctggacacatacaccctcccaagactaaac caggaagaagctgagtccctgaatagaccaataacaggctctgaaattgaggcaataatt aagagcctaccaatcaaaaaaagtccaagatcagacggattcacagctgaattctaccag aggtag >gi568815595f:16075152_16327497|GENSCAN_predicted_peptide_5|608_aa MLLRKRYRHRPCRLQFLLLLLMLGCVLMMVAMLHPPHHTLHQTVTAQASKHSPEARYRLD FGESQDWVLEAEDEGEEYSPLEGLPPFISLREDQLLVAVALPQARRNQSQGRRGGSYRLI KQPRRQDKEAPKRDWGADEDGEVSEEEELTPFSLDPRGLQEALSARIPLQRALPEVRHPL EQIAIEQLSEVRRCLQQHPQDSLPTASVILCFHDEAWSTLLRTVHSILDTVPRAFLKEII LVDDLSQQGQLKSALSEYVARLEGVKLLRSNKRLGAIRARMLGATRATGDVLVFMDAHCE CHPGWLEPLLSRIAGDRSRVVSPVIDVIDWKTFQYYPSKDLQRGVLDWKLDFHWEPLPEH VRKALQSPISPIRSPVVPGEVVAMDRHYFQNTGAYDSLMSLRGGENLELSFKAWLCGGSV EILPCSRVGHIYQNQDSHSPLDQEATLRNRVRIAETWLGSFKETFYKHSPEAFSLSKLHN TGLGLCADCQAEGDILGCPMVLAPCSDSRQQQYLQHTSRKEIHFGSPQHLCFAVRQEQVI LQNCTEEGLAIHQQHWDFQENGMIVHILSGKCMEAVVQENNKDLYLRPCDGKARQQWRFD QINAVDER >gi568815595f:16075152_16327497|GENSCAN_predicted_CDS_5|1827_bp atgctcctaaggaagcgatacaggcacagaccatgcagactccagttcctcctgctgctc ctgatgctgggatgcgtcctgatgatggtggcgatgttgcaccctccccaccacaccctg caccagactgtcacagcccaagccagcaagcacagccctgaagccaggtaccgcctggac tttggggaatcccaggattgggtactggaagctgaggatgagggtgaagagtacagccct ctggagggcctgccaccctttatctcactgcgggaggatcagctgctggtggccgtggcc ttaccccaggccagaaggaaccagagccagggcaggagaggtgggagctaccgcctcatc aagcagccaaggaggcaggataaggaagccccaaagagggactggggggctgatgaggac ggggaggtgtctgaagaagaggagttgaccccgttcagcctggacccacgtggcctccag gaggcactcagtgcccgcatccccctccagagggctctgcccgaggtgcggcacccactt gaacaaatagctattgaacaactatcagaagtaagaaggtgtctgcagcagcaccctcag gacagcctgcccacagccagcgtcatcctctgtttccatgatgaggcctggtccactctc ctgcggactgtacacagcatcctcgacacagtgcccagggccttcctgaaggagatcatc ctcgtggacgacctcagccagcaaggacaactcaagtctgctctcagcgaatatgtggcc aggctggagggggtgaagttactcaggagcaacaagaggctgggtgccatcagggcccgg atgctgggggccaccagagccaccggggatgtgctcgtcttcatggatgcccactgcgag tgccacccaggctggctggagcccctcctcagcagaatagctggtgacaggagccgagtg gtatctccggtgatagatgtgattgactggaagactttccagtattacccctcaaaggac ctgcagcgtggggtgttggactggaagctggatttccactgggaacctttgccagagcat gtgaggaaggccctccagtcccccataagccccatcaggagccctgtggtgcccggagag gtggtggccatggacagacattacttccaaaacactggagcgtatgactctcttatgtcg ctgcgaggtggtgaaaacctcgaactgtctttcaaggcctggctctgtggtggctctgtt gaaatccttccctgctctcgggtaggacacatctaccaaaatcaggattcccattccccc ctcgaccaggaggccaccctgaggaacagggttcgcattgctgagacctggctggggtca ttcaaagaaaccttctacaagcatagcccagaggccttctccttgagcaagctccacaac actggacttgggctctgtgcagactgccaggcagaaggggacatcctgggctgtcccatg gtgttggctccttgcagtgacagccggcagcaacagtacctgcagcacaccagcaggaag gagattcactttggcagcccacagcacctgtgctttgctgtcaggcaggagcaggtgatt cttcagaactgcacggaggaaggcctggccatccaccagcagcactgggacttccaggag aatgggatgattgtccacattctttctgggaaatgcatggaagctgtggtgcaagaaaac aataaagatttgtacctgcgtccgtgtgatggaaaagcccgccagcagtggcgttttgac cagatcaatgctgtggatgaacgatga >gi568815595f:16075152_16327497|GENSCAN_predicted_peptide_6|72_aa MEVRPELGLISKEAQGCDTGHRNICNKSQPTKVPSAQKGSSPLEFAPSDKPGQLHPSIEI VTCFLKVTSEQS >gi568815595f:16075152_16327497|GENSCAN_predicted_CDS_6|219_bp atggaagttcggccagagctggggctgatatccaaggaggcccaaggatgcgacactgga caccgaaacatctgcaataagagccaaccaacaaaggttccctcagcccagaagggcagc agtcctctcgagtttgccccttcagacaaaccaggacagcttcatccaagtatagaaata gtcacctgctttctcaaagtcacatcagaacagagctga >gi568815595f:16075152_16327497|GENSCAN_predicted_peptide_7|105_aa MAVFHDEVEIEDFQYDEDSETYFYPCPCGDNFSITKVGVNVKFPWLVAVLLAYHSVVSLL QASSLIHFSGAQQFVMQTVSNQDSSSHFKSDMIRVTIAETLNFGP >gi568815595f:16075152_16327497|GENSCAN_predicted_CDS_7|318_bp atggcagtgtttcatgacgaggtggaaatcgaggacttccaatatgacgaggactcggag acgtatttctatccctgcccatgtggagataacttctccatcaccaaggtaggggttaac gtcaaatttccatggctggtagctgtgcttttggcatatcacagtgttgtgtcactacta caagcgagttccctgatacatttcagtggtgcccaacagtttgttatgcagactgtttca aatcaggacagcagctctcacttcaagtctgatatgatccgcgtgaccatagctgaaacc ttgaactttggaccttaa >gi568815595f:16075152_16327497|GENSCAN_predicted_peptide_8|266_aa GVEVQTDYVPLLNSLAAYGWQLTCVLPTPVVKTTSEGSVSTKQIVFLQRPCLPQKIKKKE SKFQWRFSREEMHNRQMRKSKGKLSARDKQQAEENEKNLEDQSSKAGDMGNCVSGQQQEG GVSEEMKGPVQEDKGEQLSPGGLLCGVGVEGEAVQNGPASHSRALVGICTGHSNPGEDAR DGDAEEVRELERDGKWNSDATSLGTLGPTGGPSTATKRNCVSIPSLQARAVEALKDSQTT TLLPPSADNSSLLFCQSQTDINTPHK >gi568815595f:16075152_16327497|GENSCAN_predicted_CDS_8|801_bp ggtgtcgaagtgcagacagactacgtgcccctgctgaactcgctggcggcctatggctgg cagctcacctgtgtgctaccaactcccgtcgtcaagactaccagcgaggggagtgtatcc accaagcagattgtctttcttcagagaccttgtctacctcagaaaatcaagaagaaggaa tcgaagtttcagtggcgattctccagagaagaaatgcacaacaggcagatgaggaaatca aaaggtaaactcagtgccagagacaaacaacaagcagaagaaaatgagaagaacttagaa gaccagtcttccaaagctggagacatgggaaactgtgtttcaggacagcagcaggagggt ggagtctccgaggagatgaagggccctgtccaagaggacaagggagaacagctgtcccct ggtggcctgctgtgtggggtgggtgtggagggtgaggctgtgcagaatggtcctgccagc cacagcagggccctggtggggatttgcactgggcactccaatcctggagaggatgccagg gacggggatgctgaggaagtcagagagcttgaaagagatggcaagtggaactcagatgcc accagcctgggcactttggggccaactggtgggccttctacagccacaaaacgcaactgt gtttccataccctctttgcaggcaagagctgttgaggccctcaaggactcacagactact actctcctcccaccttctgctgacaactcatcattacttttctgccaatcacaaactgac atcaacactccccataagtaa