GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:15:22 Sequence gi568815576f:28681060_28886323 : 205264 bp : 45.40% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 PlyA - 727 722 6 -0.45 1.11 Term - 6927 6838 90 0 0 81 43 54 0.535 -2.08 1.10 Intr - 8156 8076 81 2 0 105 66 59 0.680 5.23 1.09 Intr - 15928 15842 87 1 0 44 116 17 0.444 0.27 1.08 Intr - 18878 18779 100 1 1 118 98 102 0.977 14.31 1.07 Intr - 22507 22446 62 1 2 94 87 68 0.964 4.83 1.06 Intr - 29000 28947 54 2 0 74 103 48 0.942 4.08 1.05 Intr - 30958 30850 109 0 1 105 92 74 0.999 9.79 1.04 Intr - 38426 38336 91 0 1 97 94 26 0.959 3.15 1.03 Intr - 44065 43918 148 1 1 55 85 43 0.742 0.51 1.02 Intr - 44308 44184 125 2 2 88 73 124 0.752 11.20 1.01 Init - 53662 53344 319 1 1 63 91 176 0.864 12.90 1.00 Prom - 54065 54026 40 -4.66 2.00 Prom + 54520 54559 40 -4.46 2.01 Init + 61037 61272 236 1 2 50 72 130 0.496 3.21 2.02 Intr + 62823 62919 97 0 1 84 70 64 0.719 4.21 2.03 Intr + 63556 63645 90 0 0 59 61 125 0.992 7.09 2.04 Intr + 64805 64949 145 1 1 79 62 172 0.683 13.56 2.05 Term + 76019 76110 92 0 2 101 32 63 0.082 -0.12 2.06 PlyA + 76135 76140 6 1.05 3.00 Prom + 80338 80377 40 -5.36 3.01 Init + 91791 91975 185 2 2 72 89 146 0.996 10.50 3.02 Intr + 92666 92719 54 2 0 99 80 76 0.976 5.99 3.03 Intr + 99889 100113 225 1 0 99 99 133 0.978 12.60 3.04 Intr + 102449 102586 138 2 0 46 92 125 0.863 8.28 3.05 Term + 102697 102730 34 1 1 90 50 8 0.386 -5.74 3.06 PlyA + 102969 102974 6 1.05 4.05 PlyA - 104039 104034 6 1.05 4.04 Term - 114673 114116 558 1 0 88 49 167 0.909 7.15 4.03 Intr - 116146 116018 129 1 0 76 94 112 0.941 11.49 4.02 Intr - 118094 117998 97 1 1 123 83 123 0.909 15.31 4.01 Init - 119465 119239 227 2 2 74 64 246 0.978 17.21 4.00 Prom - 121748 121709 40 -5.26 5.00 Prom + 122073 122112 40 -4.36 5.01 Sngl + 145048 145371 324 0 0 87 32 136 0.654 3.90 5.02 PlyA + 149434 149439 6 1.05 6.00 Prom + 150661 150700 40 -4.86 6.01 Init + 152954 153197 244 1 1 103 74 97 0.530 7.70 6.02 Term + 155950 156083 134 2 2 100 38 75 0.529 1.95 6.03 PlyA + 157689 157694 6 1.05 7.00 Prom + 161500 161539 40 -6.36 7.01 Init + 163334 163467 134 1 2 56 56 466 0.407 39.81 7.02 Intr + 202704 203007 304 0 1 107 70 496 0.571 46.29 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:28681060_28886323|GENSCAN_predicted_peptide_1|421_aa MSRESDVEAQQSHGSSACSQPHGSVTQSQGSSSQSQGISSSSTSTMPNSSQSSHSSSGTL SSLETVSTQELYSIPEDQEPEDQEPEEPTPAPWARLWALQDGFANLECVNDNYWFGRDKS CEYCFDEPLLKRTDKYRTYSKKHFRIFREVGPKNSYIAYIEDHSGNGTFVNTELVGKGKR RPLNNNSEIALSLSRNKVFVFFDLTVDDQSVYPKALRDEYIMSKTLGSGACGEVKLAFER KTCKKVAIKIISKRKFAIGSAREADPALNVETEIEILKKLNHPCIIKIKNFFDAEDYYIV LELMEGGELFDKVVGNKRLKEATCKLYFYQMLLAVQYLHENGIIHRDLKPENVLLSSQEE DCLIKDEDMKRKFQDLLSEENESTALPQVLAQPSTSRKRPREGEAEGAETTKRPAVCAAV L >gi568815576f:28681060_28886323|GENSCAN_predicted_CDS_1|1266_bp atgtctcgggagtcggatgttgaggctcagcagtctcatggcagcagtgcctgttcacag ccccatggcagcgttacccagtcccaaggctcctcctcacagtcccagggcatatccagc tcctctaccagcacgatgccaaactccagccagtcctctcactccagctctgggacactg agctccttagagacagtgtccactcaggaactctattctattcctgaggaccaagaacct gaggaccaagaacctgaggagcctacccctgccccctgggctcgattatgggcccttcag gatggatttgccaatcttgaatgtgtgaatgacaactactggtttgggagggacaaaagc tgtgaatattgctttgatgaaccactgctgaaaagaacagataaataccgaacatacagc aagaaacactttcggattttcagggaagtgggtcctaaaaactcttacattgcatacata gaagatcacagtggcaatggaacctttgtaaatacagagcttgtagggaaaggaaaacgc cgtcctttgaataacaattctgaaattgcactgtcactaagcagaaataaagtttttgtc ttttttgatctgactgtagatgatcagtcagtttatcctaaggcattaagagatgaatac atcatgtcaaaaactcttggaagtggtgcctgtggagaggtaaagctggctttcgagagg aaaacatgtaagaaagtagccataaagatcatcagcaaaaggaagtttgctattggttca gcaagagaggcagacccagctctcaatgttgaaacagaaatagaaattttgaaaaagcta aatcatccttgcatcatcaagattaaaaacttttttgatgcagaagattattatattgtt ttggaattgatggaagggggagagctgtttgacaaagtggtggggaataaacgcctgaaa gaagctacctgcaagctctatttttaccagatgctcttggctgtgcagtaccttcatgaa aacggtattatacaccgtgacttaaagccagagaatgttttactgtcatctcaagaagag gactgtcttataaaggatgaagacatgaagagaaagtttcaagatcttctgtctgaggaa aatgaatccacagctctaccccaggttctagcccagccttctactagtcgaaagcggccc cgtgaaggggaagccgagggtgccgagaccacaaagcgcccagctgtgtgtgctgctgtg ttgtga >gi568815576f:28681060_28886323|GENSCAN_predicted_peptide_2|219_aa MWRGRAGALLRVWGFWPTGVPRRRPLSCDAASQAGSNYPRCWNCGGPWGPGREDRFFCPQ CRALQAPDPTRDYFSLMDCNRSFRVDTAKLQHRYQQLQRLVHPDFFSQRSQTEKDFSEKH STLVNDAYKTLLAPLSRGLYLLKLHGIEIPERTDYEMDRQFLIEIMEINEKLAEAESEAA MKEIESIVKDDFEEAKEILTKMRYFSNIEEKIKLKKIPL >gi568815576f:28681060_28886323|GENSCAN_predicted_CDS_2|660_bp atgtggcgggggagagccggggctttgctccgggtgtgggggttttggccgacaggggtt cccagaaggagaccgctaagctgcgatgctgcgtcgcaggcgggaagcaattatccccgc tgttggaactgcggcggcccatggggccccgggcgggaggacaggttcttctgcccacag tgccgagcgctgcaggcacctgaccccactcgagactacttcagccttatggactgcaac cgttccttcagagttgatacagcgaagctccagcacaggtaccagcaactgcagcgtctt gtccacccagatttcttcagccagaggtctcagactgaaaaggacttctcagagaagcat tcgaccctggtgaatgatgcctataagaccctcctggcccccctgagcagaggactgtac cttctaaagctccatggaatagagattcctgaaaggacagattatgaaatggacaggcaa ttcctcatagaaataatggaaatcaatgaaaaactcgcagaagctgaaagtgaagctgcc atgaaagagattgaatccattgtcaaagatgactttgaagaagccaaggaaattttgaca aagatgagatacttttcaaatatagaagaaaagatcaagttaaagaagattcccctttaa >gi568815576f:28681060_28886323|GENSCAN_predicted_peptide_3|211_aa MAALGRPFSGLPLSGGSDFLQPPQPAFPGRAFPPGADGAELAPRPGPRAVPSSPAGSAAR GRVSVHCKKKHKREEEEDDDCPVRKKRITEAELCAGPNDWILCAHQDVEGHGVNPSVSGL SIPGILDVICEEMDQTTGEPQCEVARRKLQEIEDRIIDEDEEVEADRNVNHLPSLVLSDT MKTGLKREFDEVFTKKMIESMQLEIGFFLVH >gi568815576f:28681060_28886323|GENSCAN_predicted_CDS_3|636_bp atggctgcgctcggccggcccttcagcggcctccctctgagcggcggctcggacttcctg cagccgccgcagccggccttccccggccgggccttcccgccgggggctgacggcgccgag ttggccccgcggccgggacctcgcgcagtccctagcagtcccgctgggagtgcggcgcgc ggacgtgtttctgttcactgtaaaaagaaacacaagcgagaggaggaggaggatgatgat tgtccagtaagaaagaaaaggataactgaagcagagctctgtgctggtcctaatgactgg attctttgtgcacatcaggatgtagaggggcatggagtaaatcccagtgttagtggcctt tccatacctgggatattagatgttatttgtgaagaaatggatcagacaactggagaacca cagtgtgaagttgcccgaaggaagcttcaggagattgaggacaggataattgatgaagat gaagaagttgaagctgacagaaatgttaaccatctccccagtcttgtcctttctgatacc atgaaaacaggtttgaagagggaatttgatgaagtttttacaaagaaaatgattgagtct atgcaattggagattggctttttcctagttcattag >gi568815576f:28681060_28886323|GENSCAN_predicted_peptide_4|336_aa MVVVAAAPNPADGTPKVLLLSGQPASAAGAPAGQALPLMVPAQRGASPEAASGGLPQARK RQRLTHLSPEEKALRRKLKNRVAAQTARDRKKARMSELEQQVVDLEEENQKLLLENQLLR EKTHGLVVENQELRQRLGMDALVAEEEAEAKSDILLGILDNLDPVMFFKCPSPEPASLEE LPEVYPEGPSSLPASLSLSVGTSSAKLEAINELIRFDHIYTKPLVLEIPSETESQANVVV KIEEAPLSPSENDHPEFIVSVKEEPVEDDLVPELGISNLLSSSHCPKPSSCLLDAYSDCG YGGSLSPFSDMSSLLGVNHSWEDTFANELFPQLISV >gi568815576f:28681060_28886323|GENSCAN_predicted_CDS_4|1011_bp atggtggtggtggcagccgcgccgaacccggccgacgggacccctaaagttctgcttctg tcggggcagcccgcctccgccgccggagccccggccggccaggccctgccgctcatggtg ccagcccagagaggggccagcccggaggcagcgagcggggggctgccccaggcgcgcaag cgacagcgcctcacgcacctgagccccgaggagaaggcgctgaggaggaaactgaaaaac agagtagcagctcagactgccagagatcgaaagaaggctcgaatgagtgagctggaacag caagtggtagatttagaagaagagaaccaaaaacttttgctagaaaatcagcttttacga gagaaaactcatggccttgtagttgagaaccaggagttaagacagcgcttggggatggat gccctggttgctgaagaggaggcggaagccaagtctgatatcctgttgggcattctggac aacttggacccagtcatgttcttcaaatgcccttccccagagcctgccagcctggaggag ctcccagaggtctacccagaaggacccagttccttaccagcctccctttctctgtcagtg gggacgtcatcagccaagctggaagccattaatgaactaattcgttttgaccacatatat accaagcccctagtcttagagataccctctgagacagagagccaagctaatgtggtagtg aaaatcgaggaagcacctctcagcccctcagagaatgatcaccctgaattcattgtctca gtgaaggaagaacctgtagaagatgacctcgttccggagctgggtatctcaaatctgctt tcatccagccactgcccaaagccatcttcctgcctactggatgcttacagtgactgtgga tacgggggttccctttccccattcagtgacatgtcctctctgcttggtgtaaaccattct tgggaggacacttttgccaatgaactctttccccagctgattagtgtctaa >gi568815576f:28681060_28886323|GENSCAN_predicted_peptide_5|107_aa MDPFLTPYAKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMSKTPKAMATKAKI DKWDLIKLKSFCTAKETIIRVNRQPTEWEKIFTIYPSEKGLVSRIYK >gi568815576f:28681060_28886323|GENSCAN_predicted_CDS_5|324_bp atggatcccttccttacaccttatgcaaaaattaattctagatggattaaagacttaaat gttagacctaaaaccataaaaaccctagaagaaaacctaggcaataccattcaggacata ggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaatt gacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaacgatcatcaga gtgaacaggcaacctacagaatgggagaaaatttttacaatctacccatctgaaaaaggg ctagtatccagaatctacaaataa >gi568815576f:28681060_28886323|GENSCAN_predicted_peptide_6|125_aa MPSVIGFSTGPAKELCEGDTLDRAYFLALGTDPWSSDAHLKSGLSCPHGMRNVSRQKGLD LWGLLPFIPFILGEGPCQPPRNGFLHLAWENGKQLHLTLSLQLMILDEVKPPVSQFPYIK PQRSL >gi568815576f:28681060_28886323|GENSCAN_predicted_CDS_6|378_bp atgcccagtgtcattggcttttctacaggacctgccaaagagctgtgtgaaggagatacc ttagaccgagcatacttcctggccctgggcacagatccttggtcctcagacgcgcacttg aagtccggcctcagctgcccacatgggatgaggaatgtgtcccgtcagaaaggtctagat ctctgggggctactgccatttatcccttttattctgggagaaggtccatgtcagccacca cgaaatggcttcctccacctggcctgggaaaatggcaagcagctccaccttacactgtcc ttacagcttatgatcttagacgaagtgaagccccctgtttcccagtttccttatatcaag cctcagagaagtctttga >gi568815576f:28681060_28886323|GENSCAN_predicted_peptide_7|146_aa MKKKKEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEQAMTMRPRSGGRPGATGR RRRRLRRRPRGLRCSRLPPPPPLPLLLGLLLAAAGPGAARAKETAFVEVVLFESSPSGDY TTYTTGLTGRFSRAGATLSAEGEIVQ >gi568815576f:28681060_28886323|GENSCAN_predicted_CDS_7|438_bp atgaagaagaagaaagaagaagaagaagaagaagaagaagaagaggaagaggaagaggaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaacaggccatgaccatgaggccgcgctcgggcgggcgcccaggggccacgggccgc cgccgccgccgcctgcgccgccgcccccgcggcctccggtgcagccgcctgccgccgccg ccgccgctgccgctgctgctcgggctgctgctggcggccgcggggcccggcgcggcgcgg gccaaggagacggcgttcgtggaggtggtgctgttcgagtcgagcccaagcggcgattac accacctacaccaccggcctcacgggccgcttctcgcgggccggggccacgctcagcgcc gagggcgagatcgtgcag