GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:13:03 Sequence gi568815576r:28695549_28900524 : 204976 bp : 45.37% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 1439 1353 87 2 0 44 116 17 0.031 0.27 1.08 Intr - 4389 4290 100 2 1 118 98 102 0.894 14.31 1.07 Intr - 8018 7957 62 2 2 94 87 68 0.964 4.83 1.06 Intr - 14511 14458 54 0 0 74 103 48 0.942 4.08 1.05 Intr - 16469 16361 109 1 1 105 92 74 0.999 9.79 1.04 Intr - 23937 23847 91 1 1 97 94 26 0.959 3.15 1.03 Intr - 29576 29429 148 2 1 55 85 43 0.742 0.51 1.02 Intr - 29819 29695 125 0 2 88 73 124 0.752 11.20 1.01 Init - 39173 38855 319 2 1 63 91 176 0.864 12.90 1.00 Prom - 39576 39537 40 -4.66 2.00 Prom + 40031 40070 40 -4.46 2.01 Init + 46548 46783 236 2 2 50 72 130 0.496 3.21 2.02 Intr + 48334 48430 97 1 1 84 70 64 0.719 4.21 2.03 Intr + 49067 49156 90 1 0 59 61 125 0.992 7.09 2.04 Intr + 50316 50460 145 2 1 79 62 172 0.683 13.56 2.05 Term + 61530 61621 92 1 2 101 32 63 0.082 -0.12 2.06 PlyA + 61646 61651 6 1.05 3.00 Prom + 65849 65888 40 -5.36 3.01 Init + 77302 77486 185 0 2 72 89 146 0.996 10.50 3.02 Intr + 78177 78230 54 0 0 99 80 76 0.976 5.99 3.03 Intr + 85400 85624 225 2 0 99 99 133 0.978 12.60 3.04 Intr + 87960 88097 138 0 0 46 92 125 0.863 8.28 3.05 Term + 88208 88241 34 2 1 90 50 8 0.386 -5.74 3.06 PlyA + 88480 88485 6 1.05 4.05 PlyA - 89550 89545 6 1.05 4.04 Term - 100184 99627 558 2 0 88 49 167 0.909 7.15 4.03 Intr - 101657 101529 129 2 0 76 94 112 0.941 11.49 4.02 Intr - 103605 103509 97 2 1 123 83 123 0.909 15.31 4.01 Init - 104976 104750 227 0 2 74 64 246 0.978 17.21 4.00 Prom - 107259 107220 40 -5.26 5.00 Prom + 107584 107623 40 -4.36 5.01 Sngl + 130559 130882 324 1 0 87 32 136 0.654 3.90 5.02 PlyA + 134945 134950 6 1.05 6.00 Prom + 136172 136211 40 -4.86 6.01 Init + 138465 138708 244 2 1 103 74 97 0.530 7.70 6.02 Term + 141461 141594 134 0 2 100 38 75 0.529 1.95 6.03 PlyA + 143200 143205 6 1.05 7.00 Prom + 147011 147050 40 -6.36 7.01 Init + 148845 148978 134 2 2 56 56 466 0.407 39.81 7.02 Intr + 188215 188518 304 1 1 107 70 496 0.572 46.29 7.03 Term + 201566 201667 102 1 0 90 37 80 0.092 1.38 7.04 PlyA + 203555 203560 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:28695549_28900524|GENSCAN_predicted_peptide_1|365_aa MSRESDVEAQQSHGSSACSQPHGSVTQSQGSSSQSQGISSSSTSTMPNSSQSSHSSSGTL SSLETVSTQELYSIPEDQEPEDQEPEEPTPAPWARLWALQDGFANLECVNDNYWFGRDKS CEYCFDEPLLKRTDKYRTYSKKHFRIFREVGPKNSYIAYIEDHSGNGTFVNTELVGKGKR RPLNNNSEIALSLSRNKVFVFFDLTVDDQSVYPKALRDEYIMSKTLGSGACGEVKLAFER KTCKKVAIKIISKRKFAIGSAREADPALNVETEIEILKKLNHPCIIKIKNFFDAEDYYIV LELMEGGELFDKVVGNKRLKEATCKLYFYQMLLAVQYLHENGIIHRDLKPENVLLSSQEE DCLIK >gi568815576r:28695549_28900524|GENSCAN_predicted_CDS_1|1095_bp atgtctcgggagtcggatgttgaggctcagcagtctcatggcagcagtgcctgttcacag ccccatggcagcgttacccagtcccaaggctcctcctcacagtcccagggcatatccagc tcctctaccagcacgatgccaaactccagccagtcctctcactccagctctgggacactg agctccttagagacagtgtccactcaggaactctattctattcctgaggaccaagaacct gaggaccaagaacctgaggagcctacccctgccccctgggctcgattatgggcccttcag gatggatttgccaatcttgaatgtgtgaatgacaactactggtttgggagggacaaaagc tgtgaatattgctttgatgaaccactgctgaaaagaacagataaataccgaacatacagc aagaaacactttcggattttcagggaagtgggtcctaaaaactcttacattgcatacata gaagatcacagtggcaatggaacctttgtaaatacagagcttgtagggaaaggaaaacgc cgtcctttgaataacaattctgaaattgcactgtcactaagcagaaataaagtttttgtc ttttttgatctgactgtagatgatcagtcagtttatcctaaggcattaagagatgaatac atcatgtcaaaaactcttggaagtggtgcctgtggagaggtaaagctggctttcgagagg aaaacatgtaagaaagtagccataaagatcatcagcaaaaggaagtttgctattggttca gcaagagaggcagacccagctctcaatgttgaaacagaaatagaaattttgaaaaagcta aatcatccttgcatcatcaagattaaaaacttttttgatgcagaagattattatattgtt ttggaattgatggaagggggagagctgtttgacaaagtggtggggaataaacgcctgaaa gaagctacctgcaagctctatttttaccagatgctcttggctgtgcagtaccttcatgaa aacggtattatacaccgtgacttaaagccagagaatgttttactgtcatctcaagaagag gactgtcttataaag >gi568815576r:28695549_28900524|GENSCAN_predicted_peptide_2|219_aa MWRGRAGALLRVWGFWPTGVPRRRPLSCDAASQAGSNYPRCWNCGGPWGPGREDRFFCPQ CRALQAPDPTRDYFSLMDCNRSFRVDTAKLQHRYQQLQRLVHPDFFSQRSQTEKDFSEKH STLVNDAYKTLLAPLSRGLYLLKLHGIEIPERTDYEMDRQFLIEIMEINEKLAEAESEAA MKEIESIVKDDFEEAKEILTKMRYFSNIEEKIKLKKIPL >gi568815576r:28695549_28900524|GENSCAN_predicted_CDS_2|660_bp atgtggcgggggagagccggggctttgctccgggtgtgggggttttggccgacaggggtt cccagaaggagaccgctaagctgcgatgctgcgtcgcaggcgggaagcaattatccccgc tgttggaactgcggcggcccatggggccccgggcgggaggacaggttcttctgcccacag tgccgagcgctgcaggcacctgaccccactcgagactacttcagccttatggactgcaac cgttccttcagagttgatacagcgaagctccagcacaggtaccagcaactgcagcgtctt gtccacccagatttcttcagccagaggtctcagactgaaaaggacttctcagagaagcat tcgaccctggtgaatgatgcctataagaccctcctggcccccctgagcagaggactgtac cttctaaagctccatggaatagagattcctgaaaggacagattatgaaatggacaggcaa ttcctcatagaaataatggaaatcaatgaaaaactcgcagaagctgaaagtgaagctgcc atgaaagagattgaatccattgtcaaagatgactttgaagaagccaaggaaattttgaca aagatgagatacttttcaaatatagaagaaaagatcaagttaaagaagattcccctttaa >gi568815576r:28695549_28900524|GENSCAN_predicted_peptide_3|211_aa MAALGRPFSGLPLSGGSDFLQPPQPAFPGRAFPPGADGAELAPRPGPRAVPSSPAGSAAR GRVSVHCKKKHKREEEEDDDCPVRKKRITEAELCAGPNDWILCAHQDVEGHGVNPSVSGL SIPGILDVICEEMDQTTGEPQCEVARRKLQEIEDRIIDEDEEVEADRNVNHLPSLVLSDT MKTGLKREFDEVFTKKMIESMQLEIGFFLVH >gi568815576r:28695549_28900524|GENSCAN_predicted_CDS_3|636_bp atggctgcgctcggccggcccttcagcggcctccctctgagcggcggctcggacttcctg cagccgccgcagccggccttccccggccgggccttcccgccgggggctgacggcgccgag ttggccccgcggccgggacctcgcgcagtccctagcagtcccgctgggagtgcggcgcgc ggacgtgtttctgttcactgtaaaaagaaacacaagcgagaggaggaggaggatgatgat tgtccagtaagaaagaaaaggataactgaagcagagctctgtgctggtcctaatgactgg attctttgtgcacatcaggatgtagaggggcatggagtaaatcccagtgttagtggcctt tccatacctgggatattagatgttatttgtgaagaaatggatcagacaactggagaacca cagtgtgaagttgcccgaaggaagcttcaggagattgaggacaggataattgatgaagat gaagaagttgaagctgacagaaatgttaaccatctccccagtcttgtcctttctgatacc atgaaaacaggtttgaagagggaatttgatgaagtttttacaaagaaaatgattgagtct atgcaattggagattggctttttcctagttcattag >gi568815576r:28695549_28900524|GENSCAN_predicted_peptide_4|336_aa MVVVAAAPNPADGTPKVLLLSGQPASAAGAPAGQALPLMVPAQRGASPEAASGGLPQARK RQRLTHLSPEEKALRRKLKNRVAAQTARDRKKARMSELEQQVVDLEEENQKLLLENQLLR EKTHGLVVENQELRQRLGMDALVAEEEAEAKSDILLGILDNLDPVMFFKCPSPEPASLEE LPEVYPEGPSSLPASLSLSVGTSSAKLEAINELIRFDHIYTKPLVLEIPSETESQANVVV KIEEAPLSPSENDHPEFIVSVKEEPVEDDLVPELGISNLLSSSHCPKPSSCLLDAYSDCG YGGSLSPFSDMSSLLGVNHSWEDTFANELFPQLISV >gi568815576r:28695549_28900524|GENSCAN_predicted_CDS_4|1011_bp atggtggtggtggcagccgcgccgaacccggccgacgggacccctaaagttctgcttctg tcggggcagcccgcctccgccgccggagccccggccggccaggccctgccgctcatggtg ccagcccagagaggggccagcccggaggcagcgagcggggggctgccccaggcgcgcaag cgacagcgcctcacgcacctgagccccgaggagaaggcgctgaggaggaaactgaaaaac agagtagcagctcagactgccagagatcgaaagaaggctcgaatgagtgagctggaacag caagtggtagatttagaagaagagaaccaaaaacttttgctagaaaatcagcttttacga gagaaaactcatggccttgtagttgagaaccaggagttaagacagcgcttggggatggat gccctggttgctgaagaggaggcggaagccaagtctgatatcctgttgggcattctggac aacttggacccagtcatgttcttcaaatgcccttccccagagcctgccagcctggaggag ctcccagaggtctacccagaaggacccagttccttaccagcctccctttctctgtcagtg gggacgtcatcagccaagctggaagccattaatgaactaattcgttttgaccacatatat accaagcccctagtcttagagataccctctgagacagagagccaagctaatgtggtagtg aaaatcgaggaagcacctctcagcccctcagagaatgatcaccctgaattcattgtctca gtgaaggaagaacctgtagaagatgacctcgttccggagctgggtatctcaaatctgctt tcatccagccactgcccaaagccatcttcctgcctactggatgcttacagtgactgtgga tacgggggttccctttccccattcagtgacatgtcctctctgcttggtgtaaaccattct tgggaggacacttttgccaatgaactctttccccagctgattagtgtctaa >gi568815576r:28695549_28900524|GENSCAN_predicted_peptide_5|107_aa MDPFLTPYAKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMSKTPKAMATKAKI DKWDLIKLKSFCTAKETIIRVNRQPTEWEKIFTIYPSEKGLVSRIYK >gi568815576r:28695549_28900524|GENSCAN_predicted_CDS_5|324_bp atggatcccttccttacaccttatgcaaaaattaattctagatggattaaagacttaaat gttagacctaaaaccataaaaaccctagaagaaaacctaggcaataccattcaggacata ggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaatt gacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaacgatcatcaga gtgaacaggcaacctacagaatgggagaaaatttttacaatctacccatctgaaaaaggg ctagtatccagaatctacaaataa >gi568815576r:28695549_28900524|GENSCAN_predicted_peptide_6|125_aa MPSVIGFSTGPAKELCEGDTLDRAYFLALGTDPWSSDAHLKSGLSCPHGMRNVSRQKGLD LWGLLPFIPFILGEGPCQPPRNGFLHLAWENGKQLHLTLSLQLMILDEVKPPVSQFPYIK PQRSL >gi568815576r:28695549_28900524|GENSCAN_predicted_CDS_6|378_bp atgcccagtgtcattggcttttctacaggacctgccaaagagctgtgtgaaggagatacc ttagaccgagcatacttcctggccctgggcacagatccttggtcctcagacgcgcacttg aagtccggcctcagctgcccacatgggatgaggaatgtgtcccgtcagaaaggtctagat ctctgggggctactgccatttatcccttttattctgggagaaggtccatgtcagccacca cgaaatggcttcctccacctggcctgggaaaatggcaagcagctccaccttacactgtcc ttacagcttatgatcttagacgaagtgaagccccctgtttcccagtttccttatatcaag cctcagagaagtctttga >gi568815576r:28695549_28900524|GENSCAN_predicted_peptide_7|179_aa MKKKKEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEQAMTMRPRSGGRPGATGR RRRRLRRRPRGLRCSRLPPPPPLPLLLGLLLAAAGPGAARAKETAFVEVVLFESSPSGDY TTYTTGLTGRFSRAGATLSAEGEIVQIHPSKVVFGHISLLIKIFRGVLLLTTELCEEVT >gi568815576r:28695549_28900524|GENSCAN_predicted_CDS_7|540_bp atgaagaagaagaaagaagaagaagaagaagaagaagaagaagaggaagaggaagaggaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaacaggccatgaccatgaggccgcgctcgggcgggcgcccaggggccacgggccgc cgccgccgccgcctgcgccgccgcccccgcggcctccggtgcagccgcctgccgccgccg ccgccgctgccgctgctgctcgggctgctgctggcggccgcggggcccggcgcggcgcgg gccaaggagacggcgttcgtggaggtggtgctgttcgagtcgagcccaagcggcgattac accacctacaccaccggcctcacgggccgcttctcgcgggccggggccacgctcagcgcc gagggcgagatcgtgcagattcatccttctaaagtagtgtttggtcacatcagtctcttg atcaaaatctttcgtggcgtcctgttgcttactactgagctgtgtgaagaggtgacctag