GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:23:15 Sequence gi568815593f:69404245_69653668 : 249424 bp : 43.11% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 15142 16287 1146 0 0 18 107 620 0.942 50.82 1.02 Intr + 20357 20392 36 1 0 97 95 13 0.669 1.36 1.03 Intr + 28283 28431 149 1 2 68 69 122 0.791 7.33 1.04 Intr + 28678 28849 172 1 1 86 92 195 0.985 19.65 1.05 Intr + 55278 55311 34 2 1 99 94 87 0.709 8.20 1.06 Term + 66677 66891 215 0 2 3 34 160 0.249 -0.51 1.07 PlyA + 69857 69862 6 1.05 2.00 Prom + 73153 73192 40 -2.06 2.01 Sngl + 89591 89935 345 1 0 111 51 191 0.644 11.74 2.02 PlyA + 92228 92233 6 1.05 3.00 Prom + 97472 97511 40 -4.36 3.01 Init + 104943 105575 633 2 0 74 64 469 0.563 38.45 3.02 Intr + 109704 109865 162 2 0 95 106 147 0.955 17.37 3.03 Intr + 130450 130595 146 0 2 67 97 125 0.778 10.38 3.04 Intr + 140660 140875 216 2 0 60 86 141 0.766 8.72 3.05 Intr + 143686 143857 172 1 1 79 54 216 0.884 17.25 3.06 Intr + 147300 147341 42 2 0 85 83 40 0.667 1.64 3.07 Intr + 155660 155998 339 1 0 36 117 184 0.024 11.77 3.08 Intr + 163483 163533 51 0 0 66 98 31 0.144 1.00 3.09 Intr + 178160 178185 26 1 2 74 88 9 0.010 -3.68 3.10 Intr + 181350 181453 104 0 2 115 83 41 0.101 6.12 3.11 Intr + 216096 216215 120 1 0 50 109 135 0.980 12.27 3.12 Intr + 227519 231619 4101 0 0 11 92 1084 0.366 88.52 3.13 Term + 232041 232084 44 1 2 19 55 65 0.296 -6.38 3.14 PlyA + 232220 232225 6 1.05 4.06 PlyA - 232427 232422 6 1.05 4.05 Term - 233652 233531 122 1 2 54 48 -2 0.025 -8.96 4.04 Intr - 236122 235948 175 1 1 83 115 159 0.607 17.61 4.03 Intr - 238045 237961 85 0 1 108 72 86 0.801 8.82 4.02 Intr - 240729 240581 149 0 2 34 31 154 0.439 3.33 4.01 Intr - 241825 241638 188 2 2 133 80 220 0.642 25.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 73431 73337 95 1 2 85 37 108 0.835 3.39 S.002 Term + 149326 149427 102 0 0 100 43 49 0.821 -0.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:69404245_69653668|GENSCAN_predicted_peptide_1|583_aa MSNDGRSRNRDRRYDEVPSDLPYQDTTIRTHPTLHDSERAVSADPLPPPPLPLQPPFGPD FYSSDTEEPAIAPDLKPVRRFVPDSWKNFFRGKKKDPEWDKPVSDIRYISDGVECSPPAS PARPNHRSPLNSCKDPYGGSEGTFSSRKEADAVFPRDPYGSLDRHTQTVRTYSEKVEEYN LRYSYMKSWAGLLRILGVVELLLGAGVFACVTAYIHKDSEWYNLFGYSQPYGMGGVGGLG SMYGGYYYTGPKTPFVLVVAGLAWITTIIILVLGMSMYYRTILLDSNWWPLTEFGINVAL FILYMAAAIVYVNDTNRGGLCYYPLFNTPVNAVFCRVEGGQIAAMIFLFVTMIVYLISAL VCLKLWRHEAARRHREYMEQQEINEPSLSSKRKMCEMATSGDRQRDSEVNFKELRTAKMK PELLSGHIPPGHIPKPIVMPDYVAKYPVIQTDDERERYKAVFQDQFSEYKELSAEVQAVL RKFDELDAVMSRLPHHSESRQVLNDGKMLIHTEENLGNTIQDIGMGKEFMTKTPKTIATK AKIEKWDLIKLKRFCTAKETIIRVNRQSTEWEKMFAIYPSDKV >gi568815593f:69404245_69653668|GENSCAN_predicted_CDS_1|1752_bp atgtcaaatgatggaagatccaggaatcgggacaggcgctacgatgaggtcccaagcgac ctgccctatcaagataccaccataagaacccacccaactcttcatgacagtgagcgggca gtgagcgctgatcccttgccaccaccccctctcccattacagccaccattcggcccagac ttctactcaagtgacacagaagaaccagctatagcgccagatctcaaaccagtaaggcgc tttgtccctgactcctggaagaactttttcagagggaagaaaaaggaccccgaatgggat aagccggtgtctgatatcaggtacatctccgatggagtggagtgttcaccaccagcctct ccagcaagaccaaaccaccgttcgcccctcaactcctgcaaagatccctacggagggtca gaaggaacctttagttcccggaaagaggctgacgcagtgtttccccgggatccctatgga tctctagaccgacacacacaaacagttcgaacatacagtgagaaggtggaggagtataac ctgagatactcctacatgaagtcgtgggcaggcctgctgagaatactgggtgtggtggag ctgcttttgggggccggtgtctttgcttgtgtcacagcttacattcacaaggacagtgag tggtacaacttgtttggatattcacaaccgtatggcatgggaggcgttggtggattgggc agtatgtatgggggctattactacactggccctaagaccccttttgtactcgtggttgct ggattagcttggatcaccaccattattattctggttcttggcatgtccatgtattaccgg accattcttctggactctaattggtggcccctaactgaatttggaattaacgttgccttg tttattttgtatatggccgcagccatagtctatgtgaatgataccaaccgaggtggcctc tgctactatccgttatttaatacaccagtgaatgcagtgttctgccgggtagaaggagga cagatagctgcaatgatcttcctgtttgtcaccatgatagtttatctcattagtgctttg gtttgcctaaagttatggaggcatgaggcagctcggagacatagagaatatatggaacaa caggagataaatgagccatcattgtcatcgaaaaggaaaatgtgtgaaatggccaccagt ggtgacagacaaagagactcagaagttaatttcaaggaactgagaacagcaaaaatgaaa cctgaactactgagtggacacatccccccaggccacattcctaaacctatcgtgatgccc gactatgtggcaaaataccctgtgattcagacagatgatgagcgagaacgctataaagct gtgttccaagaccagttttcagagtacaaagagctgtctgcagaagttcaggctgtcctg aggaagtttgatgagctggatgcagtgatgagcagattgccacatcattcggaaagccga caggtcctcaacgacgggaagatgctcattcacacagaagaaaacctaggcaataccatt caggacataggcatgggcaaagagttcatgactaaaacaccaaaaacaattgcaacaaaa gccaaaattgagaaatgggatctaatcaaactaaagcgcttctgcacagcaaaagaaact atcatcagagtgaacaggcaatctacagaatgggagaaaatgtttgcaatctatccatct gacaaggtctaa >gi568815593f:69404245_69653668|GENSCAN_predicted_peptide_2|114_aa MASAPRGWERRRAPRAPREATGPEGKAAASLGRTRGSLATLPAWGITLLEEEAAAFQASQ CVALRQQVYYWDTWKRETFPMKALMVFFGDSWCDRKLGVPKCNNSNSSSFALYI >gi568815593f:69404245_69653668|GENSCAN_predicted_CDS_2|345_bp atggcctctgcgcctaggggttgggagcggcgccgagcccctcgcgctcctcgggaagcc accgggcccgagggaaaagccgcggcatccttaggccggacccggggctcgctggccaca ctgcccgcttggggaattaccctgctcgaggaggaggcggcggctttccaggccagtcag tgtgtggcccttaggcaacaggtgtattattgggatacctggaaaagagaaacgtttccc atgaaggcacttatggtgttttttggtgattcttggtgtgatagaaaactaggcgttccc aagtgtaacaatagtaacagtagttcttttgcactgtacatatga >gi568815593f:69404245_69653668|GENSCAN_predicted_peptide_3|2051_aa MHVRPMLSQPAYSFYPEDEILHFYKWTSPPGVIRILSMLIIVMCIAIFACVASTLAWDRG YGTSLLGGSVGYPYGGSGFGSYGSGYGYGYGYGYGYGGYTDPRAAKGFMLAMAAFCFIAA LVIFVTSVIRSEMSRTRRYYLSVIIVSAILGIMVFIATIVYIMGVNPTAQSSGSLYGSQI YALCNQFYTPAATGLYVDQYLYHYCVVDPQEAIAIVLGFMIIVAFALIIFFAVKTRRKMD RYDKSNILWDKEHIYDEQPPNVEEWVKNVSAGTQDVPSPPSDYVERVDSPMAYSSNGKVN DKRFYPESSYKSTPVPEVVQELPLTSPVDDFRQPRYSSGGNFETPSKRAPAKGRAGRSKR TEQDHYETDYTTGGESCDELEEDWIREYPPITSDQQRQLYKRNFDTGLQEYKSLQSELDE INKELSRLDKELDDYREESEEYMAAADEYNRLKQVKGGPSSQPFRSRKTRSSSSEPRSMA AEPSTFRERMAITLSGSLRIPAGTPSLSAPQLSGGHALHLSAPLRQLHSAASGTFRRGRR ARRNDFRGAPKAAERSVGLSFRLRVLLAAPLLEYFVEEYFDQNPISQDAKPSFSMAHLDG NTEPGLTLGGYFCPQCRAKYCELPVECKICAYCNDSIFAYEELRLDSFKDWPRESAVGVA ALAKAGLFYTGHPTNTRTLDISGVKTTLRRFSVILQDLCSILVILSSTSLHRYHSKPVLC ALSSRASPSSATTLSTRTLWGPGAGSHPFGVHNTRLSPDLCPGKIVLRALKESGAGMPEQ HKDPRVQENPDDQRTVPEVTGDARSAFWPLRDNGGPSPFVPRPGPLQTDLHAQSSEIRYN HTSQTSWTSSSTKRNAISSSYSSTGGLPGLKQRRGPASSRCQLTLSYSKTVSEDRPQAVS SGHTRCEKGADTAPGQTIAPTGGSPRSQDSRPRRRKIPLLPRRRGEPLMLPPPLELGYRV TAEDLHLEKEKAFQRINSALHVEDKAISDCRPSRPSHTLSSLATGASGGPPVSKAPTMDA QQDRPKSQDCLGLVAPLASAAEVPATAPVSGKKHRPPGPLFSSSDPLPANSSHSRDSAQV TSMIPVPLTAASRDAGMRRTRSAPAAAAAAPPPSTLNPTSGSLLNAVDGGPSHFLASATA AARVQRSEVRYNQRSQTSRTRSCLKRNASSSSHSSTEGLQEVKRRRGPASSHCQLAHSSS NTVSEDGPQAVSSGHRCENKAGTAPGQTLAPRGGSPRSQASRPHINTALHVEDKAISDCR PSRPSHTLSSLATEASGGPPVSKAPTMDAQQDRPKSQDCLGLVAPLASAAEVPSTAPVSG KKHRPPGPLFSSSDPLPATSSHSRDSAQVTSLIPATFTAASRDAGMRRTRSAPAAAAAAP PPSTLNNTSGSLLNAVDGGPSHFLASATAAARAQRSEVRYNQRSQTSRTRSCLKRNASSS SSSHSSTEGLQELKRRRGPASSHCQLAHSSPNTVSEDGPQAVSSGHRCENKAGTAPGQTL APRGGSPRSQASRPHINSALHVEDKAISDCRPSRPSHTLSSLATGASGGPPVSKAPTMDA QQDRPKSQDCLGLVAPLASAEEVPSTAPVSGKKHRPPGPLFSSSDPLPATSSHSRDSAQV TSLIPATFTAASRDAGMRRTRPGTSAPAAAAAALPPSTLNPTSGSLLNAVDGGPSHFLAS ATAAARAQRSEVRYNQRSQTSRTRSCLKRNASSSSHSSTEGLQEVKRRRGPASSHCQLAH SSSNTVSEDGPQAVSSGHRCENKAGTAPGQTLAPRGGSPRSQASRPRINSALHVEDKAIS DCRPSRPSHTLSSLATGASGGPPVSKAPTMDAQQHRPKSQDCLGLLAPLASAAEVPSTAP VSGKKHRPPGPLFSSSDPLPATSSHSRDSAQVTSLIPAPFTAASSDAGMRRTRPGTSAPA AAAAAPPPSTLNPTSGSLLNAVDGGPSHFLASATAAARVQRSEVRYNQRSQTSRTRSCLQ GNASSSSHSSTEGLPQLKRRRGPASSYCQLAHSSSNTVSEDGPQAVSSGHTRCEKKAGMP ASEECFVFEIV >gi568815593f:69404245_69653668|GENSCAN_predicted_CDS_3|6156_bp atgcatgttcgaccaatgctctctcagccagcctactctttttacccagaagatgaaatt cttcacttctacaaatggacctctcctccaggagtgattcggatcctgtctatgctcatt attgtgatgtgcattgccatctttgcctgtgtggcctccacgcttgcctgggacagaggc tatggaacttcccttttaggaggtagtgtaggctacccttatggaggaagtggctttggt agctacggaagtggctatggctatggctatggttatggctatggctacggaggctataca gacccaagagcagcaaagggcttcatgttggccatggctgccttttgtttcattgccgcg ttggtgatctttgttaccagtgttataagatctgaaatgtccagaacaagaagatactac ttaagtgtgataatagtgagtgctatcctgggcatcatggtgtttattgccacaattgtc tatataatgggagtgaacccaactgctcagtcttctggatctctatatggttcacaaata tatgccctctgcaaccaattttatacacctgcagctactggactctacgtggatcagtat ttgtatcactactgtgttgtggatccccaggaggccattgccattgtactggggttcatg attattgtggcttttgctttaataattttctttgctgtgaaaactcgaagaaagatggac aggtatgacaagtccaatattttgtgggacaaggaacacatttatgatgagcagcccccc aatgtcgaggagtgggttaaaaatgtgtctgcaggcacacaggacgtgccttcaccccca tctgactatgtggaaagagttgacagtcccatggcatactcttccaatggcaaagtgaat gacaagcggttttatccagagtcttcctataaatccacgccggttcctgaagtggttcag gagcttccattaacttcgcctgtggatgacttcaggcagcctcgttacagcagcggtggt aactttgagacaccttcaaaaagagcacctgcaaagggaagagcaggaaggtcaaagaga acagagcaagatcactatgagacagactacacaactggcggcgagtcctgtgatgagctg gaggaggactggatcagggaatatccacctatcacttcagatcaacaaagacaactgtac aagaggaattttgacactggcctacaggaatacaagagcttacaatcagaacttgatgag atcaataaagaactctcccgtttggataaagaattggatgactatagagaagaaagtgaa gagtacatggctgctgctgatgaatacaatagactgaagcaagtgaagggaggtccttca tcccaaccttttagaagtagaaagacaagatcgagctcctcagaacccaggtcgatggct gcagagccttcgaccttccgagagcgaatggcgatcactctttccggttctctgcgaatt ccagctggaacaccgtccctttccgcgccccaactcagcggaggccatgccctgcacctg agcgccccgctccggcagctgcactctgcagcatccggaacgtttcggcgtggccgcagg gcgcggcggaatgacttccggggcgcccctaaagcggcggagaggagtgtcgggctgagt ttccggctgagagtccttctagcggcgccgttgttggaatactttgtagaggaatatttt gatcaaaatcctattagtcaggatgcaaaaccctctttcagcatggcgcatttggatggc aatactgagccagggcttacattaggaggctatttctgcccacagtgtcgggcaaagtac tgtgagctacctgttgaatgtaaaatctgtgcttattgcaatgacagcatctttgcttac gaagaactacggctggactcttttaaggactggccccgggaatcagctgtgggagttgca gcactggccaaagcaggtcttttctacacaggccatcctacaaacacccgcacactcgac atcagtggtgtcaagacaactctaagaaggttttccgtgatcctgcaagacctgtgttcc atcctggtgattctgtcttcaacttcactgcacaggtaccacagtaagccagtgctgtgt gctctgagttccagggcatcccccagctcagccactacactgagcacaaggactctgtgg ggcccaggagcaggtagtcacccctttggggtccacaacacccggctgtccccagacttg tgtccagggaagatagtgttgagggccctcaaggagagcggggcagggatgcctgagcag cacaaggaccccagagtccaagaaaatcctgatgatcagagaacggtccccgaggtcacc ggggatgcacggtctgcattttggcccctgcgggacaatggaggcccctctccctttgtg cccaggcccgggcctctgcagacagacctccacgcccagagctcagaaatcagatataac cacacatcccagacatcctggacgagctcgagcaccaaacgaaatgccatctccagctcc tacagctccacgggaggcttgccggggctaaagcagaggagggggccagcctcatcccgc tgccagctgaccctcagttactcaaagacagtgagtgaggacaggcctcaggctgtctct tcgggtcacacacggtgtgaaaagggggcagatacagcaccagggcagacaatcgcccca acgggtggctcccccagatcccaggactctaggccccgtagacgcaagattcccctgctg ccacgcaggcgaggggagcctttgatgctgccacctcccttagagctggggtaccgggtc acggctgaagacctgcacctggaaaaagagaaggcattccagcgcatcaacagtgcactg cacgttgaggacaaggccatctcggactgcagaccctcacggccttcccacactttgtcc tcacttgcaacaggggcttcgggtgggcctcccgtttctaaagcacccactatggatgca cagcaggacagacccaagtcccaagactgcctgggcctagtggcccccctagcatctgca gcagaggtccccgctacagctcccgtgtctgggaagaagcacagaccaccaggacccctg ttctcctcctcagatccccttccggccaactcttcccactcccgggactcagcccaggtc acctcgatgattcctgtccccttgacagctgcaagcagggatgccggcatgagaagaaca aggtcggctcctgcagctgccgcagcagcccctcccccctccacattgaaccccacgtcg gggtcactactcaatgcagtggatggaggcccctcacatttcttggcctcagccacagct gcagcacgtgtccagaggtcagaagtgagatataaccagagatcccagacctcccggacc agatcgtgcctcaaacgaaatgccagctccagctcccacagctctacggaaggcctccag gaagtaaagcggaggagggggccagcctcatcccactgccagctggcccacagttcctca aacacagtgagtgaggatggacctcaggctgtctcttcgggtcaccgctgtgaaaacaag gcaggtacagcaccagggcagacacttgcccccaggggtggctcccccagatcccaggcc tctaggccccacatcaacactgcactgcacgttgaggacaaggccatctcggactgcaga ccctcacgaccttcccacactttgtcctcacttgcaacagaggcttcgggtgggcctccc gtttctaaagcacccactatggacgcacagcaggacagacccaagtcccaagactgcctg ggcctagtggcccccctagcatctgctgcagaggtcccctctacagctcccgtgtctggg aagaagcacagaccaccaggacccctgttctcctcctcagatccccttcctgccacctct tcccactcccgggactcagcccaggtcacctcgctgattcctgccaccttcacagctgca agcagggatgccggcatgagaagaacaaggtcggctcctgcagctgccgcagcagcccct cccccctccacattgaacaacacgtcggggtcactactcaatgcagtggatggaggcccc tcacatttcttggcctcagccacagctgcagcacgtgcccagaggtcagaagtgagatat aaccagagatcccagacctcccggaccagatcctgcctcaaacgaaatgccagctccagc tccagctcccacagctctacggaaggcctccaggaactaaagcggaggagggggccagcc tcatcccactgccagctggcccacagttccccaaacacagtgagtgaggacggacctcag gctgtctcttcgggtcaccgctgtgaaaacaaggcaggtacagcaccagggcagacactc gcccccaggggaggctcccccagatcccaggcctctaggccccacatcaacagtgcactg cacgttgaggacaaggccatctcggactgcagaccctcacggccttcccacactttgtcc tcacttgcaacaggggcttcgggtgggcctcccgtttctaaagcacccactatggacgca cagcaggacagacccaagtcccaagactgcctgggcctagtggcccccctagcatctgct gaagaggtcccctctacagctcccgtgtctgggaagaagcacagaccaccaggacccctg ttctcctcctcagatccccttcctgccacctcttcccactcccgggactcagcccaggtc acctcgctgattcctgccaccttcacagctgcaagcagggatgccggcatgagaagaaca aggcctggcacctcggctcctgcagctgccgcagcagcccttcccccctccacattgaac cccacgtcggggtcgctactcaatgcagtggatggaggcccctcacatttcttggcctca gccacagctgcagcacgtgcccagaggtcagaagtgagatataaccagagatcccagacc tcccggaccagatcctgcctcaaacgaaatgccagctccagctcccacagctctacggaa ggcctccaggaagtaaagcggaggagggggccagcctcatcccactgccagctggcccac agttcctcaaacacagtgagtgaggacggacctcaggctgtctcttcgggtcaccgctgt gaaaacaaggcaggtacagcaccagggcagacactcgcccccaggggtggctcccccaga tcccaggcctctaggccccgcatcaacagtgcactgcacgttgaggacaaggccatctcg gactgcagaccctcacggccttcccacactttgtcctcacttgcaacaggggcttcgggt gggcctcccgtttctaaagcacccactatggatgcacagcagcacagacccaagtcccaa gactgcctgggcctactggcccccctagcatctgctgcagaagtcccctctacagctccc gtgtctgggaagaagcacagaccaccaggacccctgttctcctcctcagatccccttcct gccacctcttcccactcacgggactcagcccaggtcacctcgctgattcccgcgcccttc acagctgcaagcagcgatgccggcatgagaagaacaaggcctggcacctcggctcctgca gctgcagcagcagcccctcccccctccacattgaaccccacgtcggggtcactactcaat gcagtggatggaggcccctcacatttcttggcctcagccacagctgcagcacgtgtccag aggtcagaagtgagatataaccagagatcccagacctcccggaccagatcctgcctccaa ggaaatgccagctccagctcccacagctctacggaaggcctcccgcaactaaagcggagg agggggccagcctcatcctactgccagctggcccacagttcctcaaacacagtgagtgag gacggacctcaggctgtctcttcgggtcacacccgctgtgagaagaaggcagggatgccg gcatcagaagaatgtttcgtgttcgaaattgtttga >gi568815593f:69404245_69653668|GENSCAN_predicted_peptide_4|239_aa XLVNYQISVKCSNQFKLEVCLLNAENKVVDNQAGTQGQLKVLGANLWWPYLMHEHPAYLY SWEGRPDGAQAVGALTPGALAVVGAADCTEVTGAFDFYTLPVGLRTVPVTESQMVIAHTK ALDPSQPVTFVTNSTYAADKGALYVDVIRVNSYYSWYRNYGHLELIQLQLAAQFENWCKT SQSHYSERVWSGNACRASPGRKFKIYSSGYFQIYNMLLLTILILLCNRTPELIPGFYIR >gi568815593f:69404245_69653668|GENSCAN_predicted_CDS_4|720_bp nggctggtgaattaccagatctccgtcaagtgcagtaaccagttcaagttggaagtgtgt cttttgaatgcagaaaacaaagtcgtggacaaccaggctgggacccagggccagctgaag gtgctgggtgccaacctctggtggccgtacctgatgcacgaacaccccgcctacctgtac tcgtgggagggcaggccagatggggctcaggctgtcggggcgctcacacctggcgctttg gctgtcgtaggtgcggctgactgcacagaagtcactggggcctttgacttctacacactc cctgtggggctccgcactgtgcccgtcaccgagagccagatggtgattgctcacaccaaa gccttggacccctcccagcctgtgacctttgtgaccaactccacctacgcagcagacaag ggggctctgtatgtggatgtgatccgtgtgaacagctactactcttggtatcgcaactac gggcacctggagttgattcagctgcagctggccgcccagtttgagaattggtgtaagaca tcacaatcccattattcagagcgcgtatggagtggaaacgcttgtagggcttcaccaggg agaaaattcaaaatctactcttctggctattttcaaatatataatatgttattgttaact atactcatcctactatgcaataggacaccagaacttattcctgggttctacatccgttaa