GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:33:03 Sequence gi568815590r:21593281_21888159 : 294879 bp : 49.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5553 5681 129 2 0 78 119 59 0.769 8.15 1.02 Term + 8073 8162 90 2 0 112 48 99 0.994 6.02 1.03 PlyA + 9803 9808 6 1.05 2.05 PlyA - 10444 10439 6 1.05 2.04 Term - 13141 12978 164 2 2 45 44 109 0.268 0.30 2.03 Intr - 21182 21115 68 1 2 86 115 17 0.051 2.75 2.02 Intr - 31994 31943 52 0 1 62 110 63 0.079 3.87 2.01 Init - 36955 36883 73 1 1 84 24 145 0.213 6.94 2.00 Prom - 41441 41402 40 -6.76 3.04 PlyA - 44661 44656 6 1.05 3.03 Term - 46972 46872 101 2 2 83 39 89 0.749 1.79 3.02 Intr - 53464 53390 75 2 0 113 95 50 0.845 7.69 3.01 Init - 54269 54137 133 2 1 56 32 128 0.342 4.30 3.00 Prom - 72311 72272 40 -4.06 4.00 Prom + 80306 80345 40 -4.26 4.01 Init + 82578 82695 118 2 1 91 33 64 0.143 1.56 4.02 Intr + 91611 91689 79 1 1 85 84 32 0.813 1.11 4.03 Intr + 92050 92180 131 1 2 78 36 107 0.505 4.84 4.04 Intr + 94314 94414 101 1 2 68 31 57 0.104 -2.17 4.05 Intr + 99176 99389 214 1 1 100 66 100 0.741 7.29 4.06 Term + 101803 101996 194 2 2 61 39 167 0.791 6.68 4.07 PlyA + 104052 104057 6 1.05 5.08 PlyA - 104713 104708 6 1.05 5.07 Term - 105030 104883 148 2 1 111 47 62 0.583 1.77 5.06 Intr - 106445 106411 35 2 2 106 52 26 0.435 -2.18 5.05 Intr - 109697 109525 173 0 2 87 94 227 0.817 22.86 5.04 Intr - 111845 111705 141 0 0 116 94 218 0.999 25.52 5.03 Intr - 112761 112652 110 2 2 71 99 169 0.999 16.23 5.02 Intr - 116660 116544 117 1 0 80 77 35 0.182 1.18 5.01 Init - 127159 127071 89 1 2 78 113 -5 0.129 1.12 5.00 Prom - 132218 132179 40 -2.36 6.00 Prom + 134605 134644 40 -6.76 6.01 Init + 135182 135230 49 1 1 63 89 35 0.372 0.21 6.02 Intr + 136495 136633 139 2 1 68 56 77 0.300 2.02 6.03 Intr + 137315 137369 55 2 1 104 79 45 0.328 4.18 6.04 Intr + 147522 147579 58 2 1 65 117 117 0.056 10.76 6.05 Term + 152446 152624 179 2 2 62 47 97 0.842 0.85 6.06 PlyA + 153690 153695 6 1.05 7.10 PlyA - 154188 154183 6 -0.45 7.09 Term - 157662 157304 359 1 2 149 48 692 0.679 65.97 7.08 Intr - 174903 174745 159 1 0 115 43 68 0.457 4.96 7.07 Intr - 175298 175224 75 0 0 96 89 23 0.643 2.69 7.06 Intr - 181775 181692 84 0 0 105 111 83 0.853 12.09 7.05 Intr - 189578 189305 274 2 1 110 98 527 0.935 53.01 7.04 Intr - 193587 193557 31 2 1 105 75 21 0.051 0.63 7.03 Intr - 206828 206753 76 0 1 61 100 37 0.008 0.77 7.02 Intr - 216560 216401 160 2 1 66 91 56 0.158 3.26 7.01 Init - 219037 218951 87 1 0 107 91 73 0.824 10.14 7.00 Prom - 219581 219542 40 -7.96 8.00 Prom + 224377 224416 40 -4.36 8.01 Sngl + 233343 233669 327 2 0 73 44 335 0.828 23.61 8.02 PlyA + 233753 233758 6 1.05 9.00 Prom + 234754 234793 40 -5.36 9.01 Sngl + 234847 236262 1416 0 0 49 42 522 0.980 39.64 9.02 PlyA + 236271 236276 6 1.05 10.00 Prom + 237978 238017 40 -2.46 10.01 Init + 253692 253749 58 2 1 77 37 123 0.643 5.58 10.02 Intr + 254349 254527 179 1 2 79 97 49 0.342 4.54 10.03 Intr + 254603 254686 84 1 0 60 82 37 0.123 0.32 10.04 Intr + 262719 262782 64 2 1 109 75 59 0.101 4.99 10.05 Intr + 264903 265009 107 1 2 -1 81 70 0.096 -2.67 10.06 Term + 265489 265629 141 0 0 71 54 120 0.875 4.83 10.07 PlyA + 265981 265986 6 1.05 11.00 Prom + 267118 267157 40 -3.26 11.01 Init + 268342 268406 65 0 2 94 23 45 0.068 -0.78 11.02 Intr + 281175 281375 201 0 0 75 44 86 0.154 1.30 11.03 Term + 289586 289808 223 2 1 61 36 195 0.529 7.99 11.04 PlyA + 291653 291658 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 151840 151936 97 0 1 84 94 73 0.899 7.97 S.002 Init + 194537 194831 295 1 1 75 78 182 0.924 11.25 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:21593281_21888159|GENSCAN_predicted_peptide_1|72_aa MECYAAIKKNEFMSFAGTWMKLETIIFSKLTQEQKTKHCIFSLRHEYTHRVYKTDLKMQI GYIQFHPTVLEI >gi568815590r:21593281_21888159|GENSCAN_predicted_CDS_1|219_bp atggaatgctatgcagccataaaaaagaatgagttcatgtcctttgcagggacatggatg aagctagaaaccatcatcttcagcaaactaacacaggaacagaaaaccaaacactgcata ttctcactgagacatgagtacacgcaccgtgtgtacaaaaccgatttgaagatgcaaatt ggctacatccaatttcatccaactgtcctggaaatttaa >gi568815590r:21593281_21888159|GENSCAN_predicted_peptide_2|118_aa MASAFPVLASASCLIQVLACVAVAASRIFLVTDDLDCFEEYRKWQETCGVGTRQAICVLV TGTTVLESGKSKIKVLAADEDFLAASLHGRRQKGKKGTNAVFPNGKRSEEKIYSCKPF >gi568815590r:21593281_21888159|GENSCAN_predicted_CDS_2|357_bp atggcctctgctttccccgtgctggcttctgcctcctgcctgattcaggtgcttgcctgc gtggccgtggctgcttctcggattttcctcgttactgatgaccttgactgttttgaggag tacagaaagtggcaggagacatgcggtgtgggcacccggcaagccatttgtgttcttgtt actggtaccacagttctggagtctgggaagtccaagatcaaggtactggcagctgatgag gattttcttgctgcatccttacatggcagaaggcagaagggcaaaaaagggacaaatgct gtgttcccaaatggtaaaagatcagaagagaaaatctactcctgcaagcccttttaa >gi568815590r:21593281_21888159|GENSCAN_predicted_peptide_3|102_aa MPNTELPGLAMLRVLDEEELEKETERAKPAKWENSEKTVVFWRAVRKHILWEVFHDRLTS SICLRGIPLGEEIDVEKIFGNMLKATQLAAAERGFGPLSIWL >gi568815590r:21593281_21888159|GENSCAN_predicted_CDS_3|309_bp atgccaaacactgagctcccaggcttggccatgttaagagttctggatgaagaagaactg gaaaaggagactgagagagccaaaccagcaaaatgggagaacagcgagaagactgtggtg ttctggagggctgtcaggaaacacatcctctgggaagtcttccatgaccgtttaacatct agcatctgcctccgtggcattcctctaggtgaagaaatagatgttgagaagatatttggt aacatgctgaaagccacacagctagcagcagcagaacggggatttggacccctgtctatt tggctgtaa >gi568815590r:21593281_21888159|GENSCAN_predicted_peptide_4|278_aa MPRQSGTKVDSAKGQALSVSMGYTGWVQALTFFTANFRDGYLPSLLAPLDRMQDGSSQLS PPPTPIESVPIPDLEQTFQGSEKFSDLADVTSCWMAKPAFEPGSPDTPSGTAQKLHFTLK EECGKYVWLIDTLHGLDCVNNIFGGLPKDPRESKEANKGPFLSDFKISATEKEAGAQVLM PRHKGSPRQAELSSPPGAGPQPQHSVLQSGGAQHGGYLATAPAHFCPTRMCTPGNTVGKV RDEKGCLGNDSKIVYPGVATWMQGQRAHMMNGPLLSSL >gi568815590r:21593281_21888159|GENSCAN_predicted_CDS_4|837_bp atgccacgtcagtcagggacaaaggtggactcagcgaaaggccaagccctctcagtttcc atgggctacactggttgggttcaggctctgaccttcttcacagcaaacttcagagacggc tacctgccttctctccttgcacctctggaccgcatgcaggacggttcctcccagctttct cctccaccaactcccatagaaagtgtgcccatccctgacctagagcaaactttccaaggc tcagagaagttcagtgacctagctgatgtcaccagctgctggatggcaaagccagcattt gaacctggatcccctgatactccttctggtacagcacaaaaactccacttcacactaaaa gaggaatgtggcaaatatgtctggttaattgacactctgcatggccttgattgtgtaaac aacatctttggaggcctgccaaaggacccacgagaaagcaaagaagccaacaaaggaccg tttctctctgacttcaaaatttctgccaccgagaaagaagctggtgcccaagttttaatg cctcgccacaagggctctcccaggcaggctgagctctcaagccctcctggagctggcccc cagccccaacactccgtgctacagtcagggggagcccagcacggtggatatctggcaact gcaccagcccatttctgtcctacacggatgtgcacacctgggaacactgtgggcaaggtg agggacgagaagggctgccttggtaatgattccaagattgtctaccctggtgtggccacc tggatgcagggccagagggcgcacatgatgaacggccctctgttatcatcactgtaa >gi568815590r:21593281_21888159|GENSCAN_predicted_peptide_5|270_aa MVPGTSWESRLLFPETPCPLLWMRSSVTEREIFGISQRAKEQEAMYSPFMLLFLLQRVGL EAFEKLHGWSRLADFHANCRASYQTVTSCPADNYQACLGSYAGMIGFDMTPNYVDSSPTG IVVSPWCSCRGSGNMEEECEKFLRDFTENPCLRNAIQAFGNGTDVNVSPKGPSFQATQAP RVEKTPSLPDDLSDSTSLGTSVITTCTSVQVPSVRRTGKLQSAICKGQFFDASGAPTAQP SGVSSSHSPCAASMVEDLLLAALCPAILAA >gi568815590r:21593281_21888159|GENSCAN_predicted_CDS_5|813_bp atggtccctggtacttcttgggaatctcggttactcttcccagaaactccttgccccctt ttatggatgaggagctcagtcacagaaagggaaatatttgggatttcccaaagggcaaag gaacaggaagccatgtactccccattcatgctgctgttccttctccagcgtgtggggctg gaggcatttgagaagctgcatggatggtcccggctggccgacttccatgccaattgtcga gcctcctaccagacggtcaccagctgccctgcggacaattaccaggcgtgtctgggctct tatgctggcatgattgggtttgacatgacacctaactatgtggactccagccccactggc atcgtggtgtccccctggtgcagctgtcgtggcagcgggaacatggaggaggagtgtgag aagttcctcagggacttcaccgagaacccatgcctccggaacgccatccaggcctttggc aacggcacggacgtgaacgtgtccccaaaaggcccctcgttccaggccacccaggcccct cgggtggagaagacgccttctttgccagatgacctcagtgacagtaccagcttggggacc agtgtcatcaccacctgcacgtctgtccaggtgccctctgtgcgcaggactgggaagctg cagagtgccatctgcaagggccagttctttgatgcttctggggcacccacagcacagccc tcaggggtcagctcttcccacagtccctgtgctgcctccatggtggaggatttgctgtta gcagccttgtgtcctgccatccttgcagcctag >gi568815590r:21593281_21888159|GENSCAN_predicted_peptide_6|159_aa MWFHHVGQAGLQLLTSGLLRVTRERLSSPLFQTMRIKGPHPALTCLELLFLGLRSPNTKP NSWCLSSTIALCISHTYKRTQAFAISECYTITTKLLKPEAGQLSTFVDSGDSDTWVGKSG LQECRKKPLLSFQARPREEPGEKGPSNTCSHTIHPDEAG >gi568815590r:21593281_21888159|GENSCAN_predicted_CDS_6|480_bp atgtggtttcaccatgttggccaggctggtctccaactcctgacctcagggcttctcaga gtcaccagagaacgactctcctctccactcttccagaccatgagaattaagggaccacat cccgccctgacctgcctggagctcctgtttctgggcctaagaagcccaaatactaaaccg aacagctggtgcctctccagcaccatcgccctctgtatctctcacacatacaaacgcaca caggccttcgccatctcagaatgttacaccatcaccaccaagctgctcaagccagaagcc ggacaactgagcacttttgtggattctggggacagtgacacttgggttggcaagtcaggc ctgcaagagtgtagaaagaagccactcctctccttccaggcccggcccagggaagagcct ggtgagaagggcccttcaaatacctgctcccacaccatccaccctgatgaggctggctga >gi568815590r:21593281_21888159|GENSCAN_predicted_peptide_7|434_aa MDLELERDNAPSGRQKLADEGRVAAHRCQLKNSPIPGAIGLPKEVTARLFFKDQTEAANS RAIGSQTATQKCGQHSAMMQGPGSVYKTVFQGGSWLKGNGRADSFGGSRGSGKATSGEGP ELHGWRPPVDCVRANELCAAESNCSSRYRTLRQCLAGRDRNTMLANKECQAALEVLQESP LYDCRCKRGMKKELQCLQIYWSIHLGLTEGEEFYEASPYEPVTSRLSDIFRLASIFSGPE AGDPAFVQAPPALLGLLSKRIDGRIQLMAKEQVWGAREWGLWRRGEVDCQEVERGDPVTL TALGAARRKAASLSAGTGADPVVSAKSNHCLDAAKACNLNDNCKKLRSSYISICNREISP TERCNRRKCHKALRQFFDRVPSEYTYRMLFCSCQDQACAERRRQTILPSCSYEDKEKPNC LDLRGVCRTDHLCR >gi568815590r:21593281_21888159|GENSCAN_predicted_CDS_7|1305_bp atggatctggagttggaacgggacaatgcgccatcaggccggcagaagctggcagatgag ggccgagttgcggcacacaggtgccagctaaagaactctcccatccctggtgccattggg cttcccaaggaagttactgcaaggctgtttttcaaagaccaaacagaggctgcaaactca agagccattggcagccagacagcaacacaaaagtgtgggcaacacagtgcgatgatgcag gggccaggatccgtttacaagactgtcttccaaggtggcagctggctgaaaggaaatggg agagcagactcatttggtggaagtaggggatctgggaaggccacgagtggggagggcccc gagctccacggctggcgccccccagtggactgtgtccgggccaatgagctgtgtgccgcc gaatccaactgcagctctcgctaccgcactctgcggcagtgcctggcaggccgcgaccgc aacaccatgctggccaacaaggagtgccaggcggccttggaggtcttgcaggagagcccg ctgtacgactgccgctgcaagcggggcatgaagaaggagctgcagtgtctgcagatctac tggagcatccacctggggctgaccgagggtgaggagttctacgaagcctccccctatgag ccggtgacctcccgcctctcggacatcttcaggcttgcttcaatcttctcaggcccggag gcaggagacccagcctttgtgcaggcaccccctgctctccttgggctgctgagcaaaaga atagacgggagaattcagctgatggccaaggagcaggtgtggggtgcaagagagtggggg ctatggagacgaggagaggtggactgtcaggaggtggagagaggtgacccagtgactctc acggcgcttggggccgccaggaggaaggcagcttccctctccgcagggacaggggcagac ccggtggtcagcgccaagagcaaccattgcctggatgctgccaaggcctgcaacctgaat gacaactgcaagaagctgcgctcctcctacatctccatctgcaaccgcgagatctcgccc accgagcgctgcaaccgccgcaagtgccacaaggccctgcgccagttcttcgaccgggtg cccagcgagtacacctaccgcatgctcttctgctcctgccaagaccaggcgtgcgctgag cgccgccggcaaaccatcctgcccagctgctcctatgaggacaaggagaagcccaactgc ctggacctgcgtggcgtgtgccggactgaccacctgtgtcggtga >gi568815590r:21593281_21888159|GENSCAN_predicted_peptide_8|108_aa MGKKQSRKTGNSKKQSTCPPPKERSSSPATEQSWTENDFDELREGFRRSNYSELREDIQT KGKEVENFEKHLEECITRITNTEKCLKELMELKTKARELREECRSLRS >gi568815590r:21593281_21888159|GENSCAN_predicted_CDS_8|327_bp atgggaaaaaaacagagcagaaaaactggaaactcgaaaaagcagagcacctgtcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggacggaaaatgactttgac gagctgcgagaaggcttcagacgatcaaattactccgagctacgggaggacattcaaacc aaaggcaaagaagttgaaaactttgaaaaacatttagaagaatgtataactagaataacc aatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaactacgt gaagaatgcagaagcctcaggagctga >gi568815590r:21593281_21888159|GENSCAN_predicted_peptide_9|471_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQVDLIDIYRTLHPKSTEYTFLSAPHHTYS KIDHILGSKALLSKCKGTEIITNYLSDHSAIKLELRIKKLTQNHSTTWKLNNLLLNDYWV HNKMKAEIKMFFETNKNKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK EQEKQEQTHSKASRRQEITKVRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITRSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKE GILPNSFYEASIILIPKSGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHD QVGFIPGMQGWFNIHKSINVIQHINRTKDKNDMIISIDAEKAFDKIQQPSC >gi568815590r:21593281_21888159|GENSCAN_predicted_CDS_9|1416_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagtggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttctttcagcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaggaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaaccactcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacaaaatgaaggcagaaataaagatgttctttgaaaccaacaagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaacaagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa gtcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccactagcaagactaataaag aaaaaaagagagaagaatcaaatagacgcaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacaccctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaagatctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcatcctgataccaaagtcgggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatacacaaatcaataaatgta atccagcatataaacagaaccaaagacaaaaacgacatgattatctcaatagatgcagaa aaggcctttgacaaaattcaacaaccttcatgctaa >gi568815590r:21593281_21888159|GENSCAN_predicted_peptide_10|210_aa MPAAQWGEQLQVPARAPALSISREGSSSLQLVLQSSPLLWLNWPQRRIVYADLFMGSHGR AQEKAPRVPHPVHRTGSLKAACIKEHLQASIEPPSLRREIPPHLISKVIPLLIPGCGQSP ITIARDLRVGKQVFVIDTGAESFLSGSEILPWEKNWRLQALVEKGHPMRCGMFSGIPGLY PLANSKPPDVTPKMSPDIAECPLGGKIFPS >gi568815590r:21593281_21888159|GENSCAN_predicted_CDS_10|633_bp atgccggctgctcagtggggcgagcagctccaggtgccggcacgggcgccggctctctct atcagcagagagggtagctcctctctgcagctggtcctccagtcatctcccctgctctgg ctgaactggccccagaggaggatagtgtatgccgatttgttcatgggcagccatgggcgg gcacaggaaaaagcaccacgagttccccatccagtccacaggactggcagcctgaaggct gcttgtatcaaggagcatctgcaggccagcattgagccgccctcactaagacgtgagatc cctcctcatttaatctcaaaggtgatccctctgctcatccctggatgtggccagtcccct attacaattgcccgggacctcagagtgggtaaacaggtctttgtcattgacacaggagca gagtcctttctgtctggctctgaaatcctgccatgggagaaaaactggaggcttcaggca cttgtggagaagggccatcccatgcgctgtgggatgtttagtggcatccctggcctctac ccacttgctaatagcaagcccccagatgtgacaccaaaaatgtctccagatattgctgaa tgtcccctgggaggcaaaatcttccccagttga >gi568815590r:21593281_21888159|GENSCAN_predicted_peptide_11|162_aa MHIPDCSAQGAPVASSNLRPAGWNWHAPKLENHVPKVSGLLQQLRNCQVMQGQGEEEDPE AASHHDMRKERQSPQVTSRELGTVTPGTALDPEAFSISVNGYSILAAQAKNPGVSLKPSW QLMQSVNPTGSTFRVDLELDHFSPFATFTTATCTSAAITLSG >gi568815590r:21593281_21888159|GENSCAN_predicted_CDS_11|489_bp atgcacattccagactgcagtgcccaaggtgctcccgttgccagcagcaacctcagacct gcagggtggaactggcatgctccaaaacttgagaaccatgtcccaaaggtgtctggtctg ctgcagcaactcagaaactgccaagtgatgcagggacagggagaggaagaagacccagag gcagcatctcaccatgacatgcgcaaggagaggcagtccccacaagtaaccagcagagag ctaggaacagtcacacctggaacagcgcttgaccctgaagccttctccatctcagtcaat ggctactccattcttgcagcccaggccaaaaaccctggagtcagcctgaaaccttcctgg cagctcatgcagtcagtaaaccccactggatctaccttccgagtagatctggaattggat catttctcaccattcgccaccttcaccactgccacctgcacttcagctgccatcaccttg tctggatag