GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:56:05 Sequence gi568815596f:173258711_173467286 : 208576 bp : 42.30% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 16 105 90 0 0 68 97 61 0.566 5.54 1.02 Intr + 2353 2427 75 0 0 88 99 36 0.911 3.49 1.03 Intr + 5035 5185 151 0 1 75 108 75 0.995 7.01 1.04 Term + 7340 8040 701 0 2 121 45 514 0.999 43.11 1.05 PlyA + 8787 8792 6 1.05 2.00 Prom + 11408 11447 40 -8.95 2.01 Init + 12810 12994 185 2 2 73 72 105 0.547 6.04 2.02 Intr + 14271 14512 242 0 2 67 49 174 0.724 7.67 2.03 Intr + 23797 24006 210 2 0 39 36 132 0.007 1.06 2.04 Intr + 38885 39126 242 0 2 41 13 257 0.031 9.85 2.05 Term + 39233 39706 474 1 0 92 41 309 0.966 20.50 2.06 PlyA + 39822 39827 6 1.05 3.03 PlyA - 40928 40923 6 -0.45 3.02 Term - 41588 41533 56 0 2 63 54 69 0.394 -2.26 3.01 Init - 42510 42285 226 0 1 94 109 136 0.632 14.98 3.00 Prom - 45930 45891 40 -4.55 4.03 PlyA - 46224 46219 6 1.05 4.02 Term - 47660 47429 232 1 1 79 39 139 0.176 3.06 4.01 Init - 50305 50298 8 1 2 54 131 0 0.197 1.40 4.00 Prom - 52162 52123 40 -1.95 5.00 Prom + 69076 69115 40 -0.45 5.01 Init + 96254 96274 21 1 0 94 93 70 0.524 5.92 5.02 Intr + 100002 100127 126 2 0 57 127 81 0.864 8.86 5.03 Intr + 100545 100781 237 2 0 68 101 203 0.990 16.49 5.04 Intr + 104516 104752 237 1 0 18 78 174 0.967 6.29 5.05 Intr + 105108 105185 78 2 0 89 89 27 0.744 1.73 5.06 Intr + 106085 106279 195 1 0 26 82 194 0.824 11.29 5.07 Intr + 106742 106882 141 1 0 68 85 114 0.745 8.73 5.08 Intr + 107573 107722 150 1 0 62 94 123 0.951 9.74 5.09 Intr + 108440 108576 137 1 2 67 67 84 0.557 2.65 5.10 Intr + 122156 122214 59 2 2 69 107 5 0.039 -1.89 5.11 Intr + 127872 127994 123 1 0 46 105 61 0.161 3.24 5.12 Term + 128574 128716 143 1 2 80 40 102 0.388 1.71 5.13 PlyA + 131835 131840 6 1.05 6.03 PlyA - 132224 132219 6 1.05 6.02 Term - 135781 135548 234 1 0 64 45 226 0.703 11.24 6.01 Init - 136578 136453 126 0 0 59 53 163 0.954 10.11 6.00 Prom - 140363 140324 40 -8.15 7.09 PlyA - 140447 140442 6 1.05 7.08 Term - 143994 143795 200 1 2 38 43 312 0.756 18.28 7.07 Intr - 144363 144241 123 1 0 65 63 99 0.399 4.74 7.06 Intr - 155078 154947 132 0 0 46 41 108 0.126 1.60 7.05 Intr - 164246 164024 223 2 1 19 98 54 0.010 -3.72 7.04 Intr - 164478 164289 190 2 1 6 78 178 0.033 7.27 7.03 Intr - 167709 167657 53 0 2 131 53 12 0.038 -1.21 7.02 Intr - 170677 170529 149 2 2 32 96 93 0.012 3.53 7.01 Init - 173102 172904 199 2 1 92 50 143 0.010 10.01 7.00 Prom - 178334 178295 40 -5.45 8.00 Prom + 187613 187652 40 -5.95 8.01 Init + 192484 192620 137 0 2 67 75 159 0.717 12.16 8.02 Intr + 194531 194681 151 2 1 -36 45 168 0.238 -0.66 8.03 Intr + 196147 196273 127 0 1 64 72 40 0.133 -0.67 8.04 Term + 200748 200863 116 1 2 69 32 133 0.626 3.55 8.05 PlyA + 201852 201857 6 1.05 9.02 PlyA - 202137 202132 6 1.05 9.01 Term - 206111 205990 122 0 2 83 48 127 0.922 5.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:173258711_173467286|GENSCAN_predicted_peptide_1|338_aa MDGDEIAITYIKDVTFNTNLPDAEILKMTKPPFVMEKWIVGIAKSQTVECTVTYESDVRT PKSTKHVHSIQWSRTKPQDEVKAVQLAIQTLFTNSDGNPGSRSDSSADCQWLDTLRMRQI ASNTSLQRSQSNPILGSPFFSHFDGQDSYAAAVRRPQVPIKYQQITPVNQSRSSSPTQYG LTKNFSSLHLNSRDSGFSSGNTDTSSERGRYSDRSRNKYGRGSISLNSSPRGRYSGKSQH STPSRGRYPGKFYRVSQSALNPHQSPDFKRSPRDLHQPNTIPGMPLHPETDSRASEEDSK VSEGGWTKVEYRKKPHRPSPAKTNKERARGDHRGWRNF >gi568815596f:173258711_173467286|GENSCAN_predicted_CDS_1|1017_bp atggatggggatgaaattgcaataacctacataaaagatgtgacattcaacactaaccta cctgatgcggagattttaaagatgacaaagccaccatttgtaatggagaagtggattgta ggaatagcaaaaagtcagactgtggagtgcactgtcacatatgagagtgatgttagaact ccaaaaagcactaaacatgtccattcgattcagtggagtagaacaaaacctcaggatgaa gtgaaagcagtccaacttgccattcagacattattcaccaattcagatggcaaccctgga agcaggtccgactcaagtgctgattgccagtggttagatactctgaggatgcggcagatt gcatccaacacttctttacagcgttcccagagcaatcctattctggggtcaccgttcttc tcacactttgatggccaggattcctacgctgctgctgtgagacggccccaggtgcccatt aagtatcaacagattacacctgtgaaccagtccagaagctcgtctcctactcagtatgga ctgaccaaaaacttctcttccctacatctcaactctagggacagtggcttttccagtggc aatactgacacctcttcagagaggggtcgatactcagacagaagcaggaacaaatatgga cgtggtagtatatcactcaattcttctcctagaggaagatacagtggaaagagtcagcat tccactccttcaagaggaagataccctggaaagttctacagggtttctcagtcagcactc aatcctcaccagtcgcctgacttcaagagaagccccagggacctccaccaacccaacacc ataccagggatgcctttgcaccctgagactgactcaagagccagtgaagaggacagcaaa gtcagcgaagggggctggacaaaagtggaataccggaaaaagccccacaggccatctccc gccaaaaccaataaagagagagccagaggggaccaccgtggatggagaaacttttga >gi568815596f:173258711_173467286|GENSCAN_predicted_peptide_2|450_aa METCCRYQAIQREHGSFRGCGELEQPKLVVNGSLEERNPQRKRLAILGKQDFHAGHDLPN LSALYSPAVTASKSTQSFTESLTLAVFLSFLLPKDTSTLISCQMMPLLAETGGRFVLVRG QFKICQLVVADYQDVQTGSFAMYSGVQLASPSGSRTGATGGAAHSPARAPALHSPWAVDG TRGCGAGGSARLGGSGRVGAHGSREAQAWWAAEKREGTSPRKHQMADDAGAGVGGRRPRG PRMGNRGGFHRGFGSGIRHQGCGCGQGQGQGRGVRGGKAEYKEWMTVTKLGHLDEVLKIM LVQKQTHTSQCTRFKAFVATGDYNGHISLGIKCSKEVATAICGAIILAKLSIVPACRGYW GNKISKPHTIPCKVAGCCGSVLVCLISVPRGTGIVLAPVPKKLLMMTGINDCYTSARGCT ATLGNFAKATFDAISKTYSYLTPDPGRRLY >gi568815596f:173258711_173467286|GENSCAN_predicted_CDS_2|1353_bp atggaaacgtgctgcaggtaccaagcaatccaaagagagcatgggtctttcagaggctgt ggtgagttggaacagccaaagctggtcgtcaatggaagcctagaagaacgcaatccacaa aggaaaaggttagccatactgggcaagcaagactttcatgcagggcacgacctgcccaat ctcagtgccctctacagtcctgctgtgactgcttctaaaagtacacaaagctttactgaa agtctgaccctggcagttttcctttcctttctcttacccaaagacaccagcaccctcatt agttgccagatgatgcctcttctggctgagactggaggacgctttgtgttggtgcgggga caattcaagatttgtcagctggtagtggctgattatcaagatgtccaaactggctccttt gctatgtactcaggagtccagctggcttcacctagtggatctcgcacaggggccacaggt ggagctgcccacagtccggcccgtgcacccgcactccacagcccttgggcagtcgatggg accaggggctgtggagcagggggcagtgctcgtctgggaggctcgggccgcgtaggagcc cacggcagcagggaggctcaggcatggtgggctgcagaaaagagagaaggaacttctcca agaaaacaccaaatggcagatgatgctggtgcaggagtagggggccggaggccccggggg cctaggatggggaaccgtggtggcttccacagaggctttggtagtggcatccggcaccag ggttgtggctgtggacagggccagggccaaggccgtggagttcgtggaggcaaggccgag tataaggagtggatgaccgtcaccaagctgggccacctggacgaggttttgaaaattatg ctggtgcagaagcagacccataccagccagtgtaccaggttcaaggcatttgttgccact ggggactacaatggccacatcagtctgggtattaagtgctccaaggaggtggccactgcc atctgtggggctatcatcctggccaagctctccattgtccctgcgtgcagaggctactgg gggaacaaaatcagcaagccccacaccatcccttgcaaggtggcaggctgctgtggctct gtgctggtgtgcctcatttctgtgcccaggggcactggcattgtcttggcccctgtgccc aagaagctactcatgatgactggtatcaatgactgctacacctcagccaggggctgcact gccaccctgggcaactttgccaaggccacctttgatgccatctctaagacctacagctac ctgacccccgaccctggaaggagactgtattaa >gi568815596f:173258711_173467286|GENSCAN_predicted_peptide_3|93_aa MKVHGFIPEVSKTKNPPEGLNSGHNNTSILGAGRWWNGKQLQMNSLASSEETFTNRAESP LQEFTTLCIFPSSGTGSLIFLVVKQRNAGDTRQ >gi568815596f:173258711_173467286|GENSCAN_predicted_CDS_3|282_bp atgaaggtccacggcttcatccctgaagtcagcaagaccaagaacccaccagaaggacta aattccggacacaataacaccagcattttaggagctggaagatggtggaatggcaagcaa cttcagatgaattccctagccagcagtgaggaaacattcactaacagagcagaatccccc ctgcaggaattcacaacattgtgtatctttccgagtagtggcacaggctccttaattttc ctggttgtgaaacaaagaaatgcaggtgatacccggcaatga >gi568815596f:173258711_173467286|GENSCAN_predicted_peptide_4|79_aa MHRQKPSEAPTASAFNSTSLCIDLIHQPADYQGFVIILKNFRARANDPSNSLPSDFLNVI LKILKMAKFLSHLRTAPFA >gi568815596f:173258711_173467286|GENSCAN_predicted_CDS_4|240_bp atgcaccggcaaaagccttctgaagctcccactgcctctgccttcaattccacatccctt tgcattgatctcattcatcaaccagcagattaccagggatttgtcattatcctcaagaat tttagagcccgtgcaaatgacccatccaacagcttgccttcagacttccttaatgtgatt ctaaagattctaaagatggccaaattcctgtcacacctgagaacagctccatttgcctaa >gi568815596f:173258711_173467286|GENSCAN_predicted_peptide_5|548_aa MDARRVPQKDLRVKKNLKKFRYVKLISMETSSSSDDSCDSFASDNFANTKPKFRSDISEE LANVFYEDSDNESFCGFSESEVQDVLDHCGFLQKPRPDVTNELAGIFHADSDDESFCGFS ESEIQDGMRLQSVREGCRTRSQCRHSGPLRVAMKFPARSTRGATNKKAESRQPSENSVTD SNSDSEDESGMNFLEKRALNIKQNKAMLAKLMSELESFPGSFRGRHPLPGSDSQSRRPRR RTFPGVASRRNPERRARPLTRSRSRILGSLDALPMEEEEEEDKYMLVRKRKTVDGYMNED DLPRSRRSRSSVTLPHIIRPVEEITEEELENVCSNSREKIYNRSLGSTCHQCRQKTIDTK TNCRNPDCWGVRGQFCGPCLRNRYGEEVRDALLDPNWHCPPCRGICNCSFCRQRDGRCAT GVLVYLAKYHGFGNVHAYLKSFHSDCTGVHPQAIKSARLRGKDQCTQTTKANKKKRLEDT QPGGFLQPSEQDQDYSILFPKDIEPWPADEGLAKPGYSLPPHGQLESGNQPKHPLKPQHS PSLKSVSQ >gi568815596f:173258711_173467286|GENSCAN_predicted_CDS_5|1647_bp atggacgctcgccgcgtgccgcagaaagatctcagagtaaagaagaacttaaagaaattc agatatgtgaagttgatttccatggaaacctcgtcatcctctgatgacagttgtgacagc tttgcttctgataattttgcaaacacgaaacctaaattcaggtcagatatcagtgaagaa ctggcaaatgttttttatgaggactctgataatgaatctttctgcggcttttcagaaagt gaggtgcaagatgtattagaccattgtggatttttacagaaaccaaggccagatgtcact aacgaactggccggtatttttcatgccgactctgacgatgaatcattttgcggtttctca gagagtgagatacaagatggaatgaggctgcagtcagttcgggaaggctgtaggacccgc agccagtgcaggcactctggacctctcagggtggcgatgaagtttccagcgcggagtacc aggggagcaaccaacaaaaaagcagagtcccgccagccctcagagaattctgtgactgat tccaactccgattcagaagatgaaagtggaatgaattttttggagaaaagggctttaaat ataaagcaaaacaaagcaatgcttgcaaaactcatgtctgaattagaaagcttccctggc tcgttccgtggaagacatcccctcccaggctccgactcacaatcaaggagaccgcgaagg cgtacattcccgggtgttgcttccaggagaaaccctgaacggagagctcgtcctcttacc aggtcaaggtcccggatcctcgggtcccttgacgctctacccatggaggaggaggaggaa gaggataagtacatgttggtgagaaagaggaagaccgtggatggctacatgaatgaagat gacctgcccagaagccgtcgctccagatcatccgtgacccttccgcatataattcgccca gtggaagaaattacagaggaggagttggagaacgtctgcagcaattctcgagagaagata tataaccgttcactgggctctacttgtcatcaatgccgtcagaagactattgataccaaa acaaactgcagaaacccagactgctggggcgttcgaggccagttctgtggcccctgcctt cgaaaccgttatggtgaagaggtcagggatgctctgctggatccgaactggcattgcccg ccttgtcgaggaatctgcaactgcagtttctgccggcagcgagatggacggtgtgcgact ggggtccttgtgtatttagccaaatatcatggctttgggaatgtgcatgcctacttgaaa agtttccattctgattgtaccggggtacatcctcaggcaattaaatcggccaggttaaga gggaaggatcagtgtacccagaccacaaaagcaaacaagaaaaaaagacttgaggacaca caaccagggggattcctgcagccgtcggagcaggatcaagattacagcattcttttcccc aaagacatagagccatggcctgctgatgagggactagctaaaccaggttacagcctccca ccccatgggcagctggaatctgggaaccagccgaaacaccctctgaagcctcagcatagc ccttccctgaaaagtgtctctcaatag >gi568815596f:173258711_173467286|GENSCAN_predicted_peptide_6|119_aa MPDPEPTLNFMEEKDVPKVCSTSRRVFFRMLSVVVLKQLEEREGEKERRASNALSLDQEH LVRIRNAGFGVNLVLSYIISHKLPATGNLPIDNSPHFPDAEQAREPVGENQEYRARNIQ >gi568815596f:173258711_173467286|GENSCAN_predicted_CDS_6|360_bp atgccagacccagagcccactctgaatttcatggaggagaaagatgtacctaaggtttgc tccacatcacggcgtgtcttctttcgcatgctctcggtggtcgtgttaaagcaactagag gaaagggaaggggagaaagaacgtagagcctcaaatgccctaagtttggatcaagagcac cttgttagaattaggaatgctggatttggtgtgaacttggtgctgagttacatcatcagc cataagcttccagccactggaaatcttcctatagacaacagtcctcacttcccagatgca gagcaagccagagaacctgtaggagaaaaccaggaatatagagccagaaacatacaatga >gi568815596f:173258711_173467286|GENSCAN_predicted_peptide_7|422_aa MTTTTTFKGVNPNSRNSSRVLRLLGGGSNFSLGFDEPTEQPVRKNKMSSSIFGTPEENPP SWATLAENTNKNKNQFLPSWWGQMVKPASTLRITSTMREAEPKRYVGTKEGRRRSQDQPL MGAASREECGLGQGRGFDSTVSSWDGEKFSVGTESFSRPEGLERKRIREAKGLLVAQKGS SMAGAKDDLWETVGGNEIREDGIHPVIATNNTGQGHEAPQLSSPVLSHIITTMGQPERRE LRDSSFVTGLPPSCLPCVIAHTHNPSTLGGQEQAVGEFPEVEGGKEKETGGMSSVYEHVQ GMGAREGFTSPGKAQGTQSESSRAQQCPELTDGIRKSSSSQPQAVFKIGVVAPPAKATLP TDLVTNLGDQRTLISKGLDPVKQRSVATDRFPESELPLLESKARRNPVWTYFIPQDRAAL IE >gi568815596f:173258711_173467286|GENSCAN_predicted_CDS_7|1269_bp atgaccaccaccactaccttcaagggtgttaaccccaacagcaggaatagctcccgggtt ttgcggctgctgggtggtggatccaatttttcattgggttttgatgaaccaacagaacaa cctgtgagaaagaacaaaatgtcctctagtatctttgggacacctgaagaaaatccacct tcttgggccacattggcagagaatacgaataagaataagaaccaatttctaccttcttgg tggggacaaatggttaaaccagcaagtacactaaggattactagtaccatgagagaagca gaaccgaagcgctatgtgggaacgaaggaaggaaggaggaggagtcaggaccagccattg atgggggctgcctccagagaggagtgtggccttggccaaggaagagggtttgacagcact gtgagctcatgggatggggagaagttttctgttggaactgaaagtttttctaggcctgag ggcctagagaggaagaggatacgtgaggcgaaaggtctgctcgtggcacagaagggtagc agcatggctggagccaaggatgatttgtgggagacagttggaggaaatgagatcagagaa gatggaatacaccctgtaatagctaccaacaatactggacaaggacatgaggctccacag ctttcatctcctgtgctatctcacattatcacaacaatgggccaaccagagaggagggaa ttgagggattcgagctttgtaactgggctccctccttcctgtctcccttgtgtgatagct cacacccataatcccagcactttaggaggccaagagcaggctgtaggagagttccctgag gtagaaggaggaaaagaaaaggagactgggggaatgagctctgtgtatgagcatgttcag gggatgggagcaagagaggggtttacatctcctgggaaggcccagggcacccaaagtgag agcagcagggcgcagcagtgcccagagctgactgatggcatcaggaaatctagctcatcc caaccacaggcagtttttaaaattggggttgtggctccccctgctaaagcgacattaccc acggaccttgttaccaaccttggcgaccagagaacgctgatctccaaaggtctagacccg gtgaaacagcgttcggtggccactgaccggtttccagagtcagagctccccctgctggaa tccaaagcacgtcgtaaccctgtctggacgtattttattccacaagaccgtgcagccctg attgagtga >gi568815596f:173258711_173467286|GENSCAN_predicted_peptide_8|176_aa MGVWLAKELAYKVKAAQESQVSPQASCVHSPRKCSLEATDSLCRKREGARAGELGQRVAA ELPAGGVMGKGGQGDGALIEPVAMAVELTRDKWKEWNLLIKQSIRQIGCCIPLQGCHNKV PQTGQLQQQRPNCLTVLEALPFSKQGEIQYRSRATVIQWCHITMAIAAQGRNAEQL >gi568815596f:173258711_173467286|GENSCAN_predicted_CDS_8|531_bp atgggggtctggcttgcaaaggagctcgcatataaagtgaaagctgctcaggagagccag gtctccccacaggcgtcctgtgttcacagtcctcgtaaatgttctctggaggccacagat tcattatgtcggaaaagggagggagccagggctggggagctggggcagcgtgtggcagca gaacttcctgctgggggtgtgatggggaaagggggccaaggagatggggctctaattgag ccagttgcaatggcggtggagctcaccagggataagtggaaagaatggaaccttctgatt aaacaatctattagacaaataggctgctgtattcctctgcaaggctgccataacaaagta ccacaaactgggcagcttcaacagcagagacctaattgtctcacggttctggaagcattg cccttttcaaaacaaggggagattcagtatcgttcccgtgccacagtgatccagtggtgt cacatcactatggcaattgctgcccagggcaggaatgcagagcagctgtaa >gi568815596f:173258711_173467286|GENSCAN_predicted_peptide_9|40_aa XPKNRLLPNGINRCEKTAGDNAVLLSLSQCRRESSNRIFN >gi568815596f:173258711_173467286|GENSCAN_predicted_CDS_9|123_bp nggccaaaaaaccgcctcctacctaacggcataaataggtgtgagaaaactgcaggggac aacgctgttctgttatccctgtcacagtgccgcagggaaagcagcaaccggatctttaat tga