GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:21:12 Sequence gi568815596f:238748216_238948695 : 200480 bp : 47.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 14012 14051 40 -2.76 1.01 Init + 27734 27922 189 1 0 62 65 163 0.451 8.67 1.02 Term + 29970 30125 156 2 0 47 52 117 0.458 1.93 1.03 PlyA + 33561 33566 6 1.05 2.00 Prom + 41467 41506 40 -4.36 2.01 Init + 43336 43427 92 0 2 81 103 37 0.038 2.50 2.02 Intr + 65066 65207 142 2 1 114 17 128 0.348 8.66 2.03 Intr + 65817 65881 65 2 2 100 55 18 0.107 -2.78 2.04 Intr + 70484 70687 204 2 0 61 30 148 0.198 4.62 2.05 Intr + 84411 84499 89 0 2 56 115 46 0.137 3.71 2.06 Term + 93639 93802 164 1 2 22 47 194 0.420 6.80 2.07 PlyA + 94317 94322 6 1.05 3.00 Prom + 96252 96291 40 -5.16 3.01 Sngl + 100001 100483 483 1 0 115 43 1116 0.999 105.88 3.02 PlyA + 102449 102454 6 1.05 4.29 PlyA - 102581 102576 6 1.05 4.28 Term - 105930 105854 77 1 2 92 41 66 0.512 0.30 4.27 Intr - 112031 111858 174 0 0 84 53 154 0.731 11.41 4.26 Intr - 117490 117376 115 1 1 50 94 -4 0.078 -3.58 4.25 Intr - 118996 118855 142 0 1 123 96 54 0.745 10.06 4.24 Intr - 120986 120323 664 0 1 49 17 184 0.166 -1.77 4.23 Intr - 124755 124522 234 1 0 32 10 190 0.157 3.26 4.22 Intr - 127201 127078 124 1 1 45 25 113 0.186 0.86 4.21 Intr - 134669 134506 164 0 2 66 92 61 0.158 3.99 4.20 Intr - 136091 135950 142 2 1 74 89 5 0.060 -0.87 4.19 Intr - 136815 136537 279 0 0 68 73 75 0.031 1.67 4.18 Intr - 142077 141902 176 1 2 72 66 72 0.036 3.06 4.17 Intr - 143366 143195 172 2 1 70 21 74 0.102 -1.58 4.16 Intr - 145545 145430 116 1 2 16 70 117 0.104 2.87 4.15 Intr - 147391 147292 100 1 1 25 86 63 0.027 -0.52 4.14 Intr - 149030 148903 128 0 2 31 77 90 0.186 2.60 4.13 Intr - 152580 152316 265 0 1 3 61 149 0.212 0.79 4.12 Intr - 161475 161289 187 2 1 57 80 110 0.841 6.79 4.11 Intr - 162837 162706 132 2 0 34 84 108 0.369 4.76 4.10 Intr - 163156 162999 158 1 2 53 91 53 0.042 0.91 4.09 Intr - 165898 165774 125 2 2 93 51 33 0.038 0.40 4.08 Intr - 166603 166475 129 2 0 95 71 46 0.027 4.27 4.07 Intr - 167272 167143 130 1 1 115 70 -12 0.024 -0.03 4.06 Intr - 172365 172235 131 1 2 38 98 46 0.222 1.01 4.05 Intr - 173991 173807 185 2 2 -6 75 113 0.139 0.03 4.04 Intr - 177492 176643 850 1 1 27 93 278 0.062 12.63 4.03 Intr - 177843 177723 121 0 1 84 86 63 0.023 5.77 4.02 Intr - 179951 179839 113 0 2 89 64 66 0.007 4.40 4.01 Init - 182051 181979 73 2 1 77 47 22 0.003 -1.77 4.00 Prom - 185402 185363 40 -7.06 5.00 Prom + 186475 186514 40 -2.96 5.01 Init + 193772 194250 479 1 2 87 72 172 0.355 10.66 5.02 Term + 195564 195756 193 0 1 30 46 144 0.374 1.29 5.03 PlyA + 196345 196350 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 99138 98447 692 1 2 102 50 287 0.960 20.05 S.002 Init - 99260 99221 40 2 1 77 37 47 0.870 -1.14 S.003 Term - 171301 171194 108 1 0 102 46 80 0.941 3.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:238748216_238948695|GENSCAN_predicted_peptide_1|114_aa MRKKDNSAAFPVPVLSSWMACGPGVVQSQAHREREAMQVEMPKQQRDAAGDKLGNPELPG ACRERVHLTGDFVQKRAGGNGKSARKIPPGPRGPAWLMKNMPVFPAKVVSCQDK >gi568815596f:238748216_238948695|GENSCAN_predicted_CDS_1|345_bp atgagaaagaaggataactcggctgccttcccagttcctgttctgagcagctggatggcg tgtggacctggagtggtgcagtcacaggcacatcgagagcgagaggcaatgcaagtggaa atgccaaagcaacagagggatgcagcaggtgacaagctgggcaacccggagttacctggg gcctgtcgagaaagggttcatctcaccggtgactttgtgcagaagagagcaggcgggaac gggaaaagcgcacgtaagatcccgccaggccctcgaggccccgcatggctgatgaagaac atgcctgtgtttccggccaaggtggtttcctgccaggacaaatga >gi568815596f:238748216_238948695|GENSCAN_predicted_peptide_2|251_aa MSTDGPAPAAALPVVSMLTGREGRPGIIWIILTASLSPGPSEDRKHQCRFLRCKEPAVLD PLLSPSHEALKTPLHGHMGPGGLHSGALSFVSNRTEHSLRTLKLEVSARAIRQEKEIRVI QTGNEVKLYLFTNDVTLYLENPHDSTKRLLELTNNFSKVPGYKINVRNHRQLNLSTLLVG MQNGLAILENSLAVSSKEPPGIKEPLLLQPASELQRSCYTSLQEPDVQTTPLDIWCLDTM LPRKPNTSTLG >gi568815596f:238748216_238948695|GENSCAN_predicted_CDS_2|756_bp atgagcacagacgggccagctcccgcagccgcccttccggtagtgtccatgctcacgggg cgggagggcaggccggggattatctggatcatcctcacagccagtttgtcaccaggtccc tctgaggacaggaaacatcagtgccgtttcttgcgttgcaaggagccagctgtgctggac ccattactgtcaccttcccacgaagccctcaagacaccgctgcatggtcacatgggccct ggaggcctccacagtggtgctctgtcttttgtctccaaccggacagagcactcgctgagg actcttaagctggaagtctcagccagagcaatcaggcaggagaaggaaataagagtcatc caaacaggaaatgaagtcaagctatatctcttcaccaacgatgtgactctatacctagaa aaccctcatgactccaccaaaaggctcctggaactgacaaacaacttcagtaaagttcca ggatacaaaatcaatgttcgaaaccatcggcagctgaacttgtctacattgctggtggga atgcaaaatggtctggccattctggaaaatagtttggcagtttcctctaaagaaccacct ggaatcaaggagccgctcctgctgcagccagcttccgagctgcagcgctcctgctatacg tctctccaggaaccggatgtgcagaccacacctctggacatctggtgcctggacaccatg ctgccaagaaaaccaaacacctccaccctgggctga >gi568815596f:238748216_238948695|GENSCAN_predicted_peptide_3|160_aa MEEGSSSPVSPVDSLGTSEEELERQPKRFGRKRRYSKKSSEDGSPTPGKRGKKGSPSAQS FEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQV LQSDEMDNKMTSCSYVAHERLSYAFSVWRMEGAWSMSASH >gi568815596f:238748216_238948695|GENSCAN_predicted_CDS_3|483_bp atggaggagggctccagctcgcccgtgtcccccgtggacagcctgggcaccagcgaggag gagctcgagaggcagcccaagcgcttcggccggaagcggcgctacagcaagaagtcgagc gaagatggcagcccgaccccgggcaagcgcggcaagaagggcagccccagcgcgcagtcc ttcgaggagctgcagagccagcgcatcctggccaacgtgcgcgagcgccagcgcacccag tcgctcaacgaggccttcgcggcgctgcgcaagatcatccccacgctgccctctgacaag ctgagcaagatccagacgctcaagctggccgccaggtacatagacttcctctaccaggtc ctgcagagcgacgagatggacaataagatgaccagctgcagctacgtggcccacgagcgc ctcagctacgccttctccgtgtggcgcatggagggcgcgtggtccatgtccgcctcccac tag >gi568815596f:238748216_238948695|GENSCAN_predicted_peptide_4|1801_aa MGRLRGCERGPKRCLDKAMYRKTKELFSPPLKAFPGDGGHGDCIGQLCALSMKHLLQDSL RNTFFYMEGPVLHRSQPQLTLGSQEALRRFAGRFPRTPGYNKVAFCFFQEHSMTRAVNLS CETTEGETTFSKNRAQTVPVSLSPNCSMAWSAANVSAALKGKALLWSQHAELASTPLHAA PHQHWLGLNLSSLAPRCKPWNANPKGQTPRGSQVPGADTFLALAVMAQMEAKTTPLSTGR LVLLPREQEPRHRLCTEFSIYFSQGGYAKAVCLSLSGSHLQPVPVDSVGSTVMLGKDPNT RTKHVDSCQCQNGRKQGKGSWLRPFANRVQRCRHLEPARSYQTGSHLCLQRLLSQRQDRG AWLCQVSFWDEPILLSQVGHDGEARRPDGFCQTPSRSPWLMEEEDKWRRGGGRATLSTAS SNCTRVWAPPECCKPPCIIKTEAQARAAPPSKVEMMSAKGILKQMEWYRWWVTKTQVEGT CCHGLSQEDGKVIPSLMSLSGAVFVPCGCCTKQPQTDSFRSGFSQSSGGRNLNQVGPVSW VTESASDETCWKYVWRMGESTQPWMGRQEKGSPEPSPDGPGPRRRGILACGLKASVGQIA SVHVGPYVWRPTGIPMPHTLHPADPGTRCPPGDMRLVAQGSDLVTLSLQSWKPPPWGPQA TPSPQCSWVLACQLPGELMVNMARMHVEQTVTRSEKQGGGGTNECHAWESREYHPGHWAG DCYGASPGFRNPCPLGQAKPFPSPECPFLPPSALSHTHHPGASPWPVLSAAMNAVWLWSR ATSYGFQTQTTSRCMLASKQAHHPHSVTLPECPECWWKEQGSPESPYPASPGTMRESERS NAAAECAPLSSCSREIPLEMPSVQRKGPSKNQRLQPREKCGRRNAGIPSLTATFKGGVGG FGEYFCGTMRQRGLASHEQPDGRRASHRSQYLGPASPGTLSTPKVPEAANNGGRRSALPE CRTPHCMSGIRLKTLNHLTQSICQGKDVCQTVLSPSSSPYTGTQTPGSPQASGQHTETQK SRPKCRPEQGIGHAGQAPEQRSASVGIQAGVATSMAGGLADPSPAGSWLCGIPWCCHLPD HVQRPMQAFAPDAPLPQASQEKRVSTSVWTFSSMGGRSTDAPGLLCPLPTPLRGAGVTCR LVGLGRVNWVCAAQPWNLPLAAVSRPQGAGTFRPCFRKAPCAPDLREACEVKVESPDSES CNPRPRSLGIPGADLAAHWHFSPGTTGSQRPHLKELASLTAPASSRATQKASFDFVERSF DHGGLLKSRYHFPGLQADRLPGSSLIMGQKEYRESCWKLSDKSDPVGGAVGGVLEEAVFF LEKTHSCTFCLICGETDSSHSYVSSTQQKDCKQPGLSKCEGNTKDPAAAPHDTSYAKPSA ERVPLGRMPVTSRLSICRMGTTLVTCPVNRPVLLTALGPISHSVQTSPRCSLRKQALQES MPFSQWRAPQPRPLQGNHLNCQAFAARAPTRAICCLMLVRTTLSLTTQTPMPTDPSEMNF LSLENARECRWMPPPSLDHLENQMDTSPCTGRHAALPPALLSRGLVSRPYHACGSAHTGT TLSPANNARVCGQQRILVMVAAGDHGQSPTLEHGLSHRHSHSQDCLVSTVGNSIFHLCLK KGQFGFLWSHRMIRNSGVVAPPMLQPASFPPAAGFEDGKGPRNKRLAPLFGANTRPPGCQ QKKKKVPKDQMAQLALALQIQRLNLQACGWATQSQETPGVSLSSGSSTCTHEEQESHDSG GGTNPLRASEPRTSLKRSTERADEPWAVDSLLMEARVKGKSELKKGEMCPSLKALLFGSE K >gi568815596f:238748216_238948695|GENSCAN_predicted_CDS_4|5406_bp atgggcaggttaagaggatgcgagagaggccctaaaagatgtctggacaaggctatgtac cgtaaaacaaaagagctattctcacctcccctgaaggccttcccaggtgatgggggccat ggagactgcattggacagctctgtgcactgtccatgaagcacttgctgcaggactccctt agaaacactttcttctacatggaaggccccgtcctccacaggtcccagcctcagctgaca ctcgggtcccaggaggctctgcgaaggtttgctgggcggttcccccgcacgccgggctac aacaaagtggctttctgcttcttccaggaacacagcatgacacgtgctgtgaacctcagc tgtgagacaaccgaaggagagacaacattcagcaaaaacagagcccaaactgtccccgtt tctctcagccccaactgctccatggcttggagtgcagcaaatgtttcagccgcgctcaaa gggaaggctttgctctggagtcagcatgcagagctcgcctcgactcccctccatgccgct cctcaccagcactggctgggtttgaatctcagctccctggctcctcgctgtaagccctgg aatgcaaaccccaagggccaaacaccccgaggctctcaggtcccaggcgcggacaccttc ctggccttggctgtcatggcccagatggaagcaaagacaactcccctctccacaggtcgg ctggtcctcctgccccgagagcaggagcccaggcacaggctctgtacagagttctccatc tacttctcccagggtggctatgcaaaggccgtctgcctctcgctctctgggtcccacctg cagcctgtcccagtggattctgtaggctccacagtgatgctgggaaaagacccaaacacc cggacaaagcacgtggacagctgccaatgccaaaacggcaggaagcaagggaaaggcagc tggctccgaccctttgccaaccgtgtgcaacgctgcaggcacctagagccagcgaggagc taccaaactggttcccatctatgcttgcagcggctgctgtcgcagaggcaggacaggggt gcatggctatgccaagtatccttctgggatgagccgattctcttgtcccaggtcgggcat gatggagaggccagaagacctgatggcttctgccagacacccagccgatccccatggctt atggaggaagaagacaagtggagacggggaggtggtcgggcgaccctttccacggccagc agcaactgcacccgtgtctgggcacctcctgagtgctgtaagcccccatgcatcatcaag acggaggcccaggcaagggcagcaccaccgtccaaagtggagatgatgtcagcaaagggg atattgaagcagatggagtggtacagatggtgggtcactaaaacgcaggttgaaggtacg tgctgccacgggctaagccaagaagatgggaaggtcattccttctttgatgtcactttcg ggggctgtatttgttccttgtggctgctgcaccaaacaaccacaaactgatagcttcaga agtggattctcccagagttctgggggcagaaatttgaatcaagtcgggcctgtttcctgg gtcactgagtcagcgagtgatgagacctgctggaaatatgtgtggaggatgggcgagtcc actcagccttggatggggcggcaggaaaaaggcagccccgagcccagcccagatggtcct ggccccagacgacgtggcatcttagcctgtgggttgaaggcatctgtggggcagattgcc tctgtacacgttggcccctacgtgtggcggcccacaggcatacccatgccacacacactc cacccagcagaccccggtaccagatgtcccccaggagacatgcgtttggtggctcagggt agtgacctggtgactctcagtctccagagctggaagcctccaccctggggtccacaagca acacccagcccccagtgctcatgggtgctggcatgccagctgcctggtgagctcatggtc aacatggcacggatgcatgtggagcagacggtgactcgttcagagaagcagggaggtggt ggcacaaatgaatgccatgcttgggagagcagggaataccacccaggacactgggcaggt gactgctacggggccagcccaggcttccgcaacccctgccccctgggccaggccaagccc tttccctcccccgagtgccctttcctgcccccgagtgccctttctcacactcaccacccc ggggcctctccctggcccgtgctcagcgcagccatgaatgcggtctggctgtggagtagg gcaacctcctacggcttccagacccagacaaccagcagatgcatgctggcttccaaacaa gctcaccacccccactcagtgacactcccagaatgtccagagtgttggtggaaagagcag ggcagccccgaaagtccatacccagcgagtccagggaccatgcgggagagcgagagaagc aatgcagctgcagaatgtgcaccgctcagcagctgctccagggaaattccactagaaatg ccttctgtccaaaggaaaggaccatctaagaatcaacgtctgcaaccaagggagaagtgc ggcaggaggaacgcagggataccgtccctgacagcgactttcaaagggggcgttggagga tttggtgaatatttctgtgggacgatgagacagcgagggctggccagccacgagcagcca gatggcaggagagcatcccaccgctcccagtacctgggccctgcaagtccaggaaccctt tcaaccccgaaagtgccagaagctgcgaacaatggcggccggcgatcggcgcttcctgag tgccggaccccacactgtatgtctgggattcggctgaagactctcaaccatctcacgcaa tcgatctgccaggggaaggacgtttgccagaccgtcctctcaccctccagctccccctac actggaacacagacacctgggagcccgcaggcctcaggacagcacacagagacacagaag tccaggccgaaatgcagaccagaacaaggaataggccatgccggccaagccccagagcag cggtcagcgtctgtggggatacaggctggagtggccacaagcatggcaggtggcctggct gaccccagcccagcaggctcctggctgtgtggcatcccctggtgctgccacctccctgac cacgtgcagaggccaatgcaggcctttgctccggatgctccactgccccaggcttctcag gagaaacgtgtctccacgagcgtgtggacattctcttccatggggggacgctccactgat gcccctggcctgctctgccccttacccacaccactcaggggagctggagtgacctgcaga ctggtgggtttgggaagagtcaactgggtgtgcgcagctcagccctggaatcttcctctg gcagcagtgagccggccccagggagcaggcactttccggccatgcttccggaaggctccc tgtgctcctgacctgcgggaggcctgcgaggtgaaggtggagagccctgactcagagtct tgcaatcccaggcccaggagccttgggatccctggggcagacttggcagcccactggcac ttcagcccaggcacaactggcagccaacgaccccatcttaaagaactggcttctctgact gccccagcttccagcagagcaacccagaaagcatcctttgactttgttgaaagaagcttt gatcatggaggacttttaaaatcaagatatcacttcccaggactccaagcagacaggctc ccaggaagttccctcatcatgggacaaaaggaatatagagaaagctgctggaagctcagt gacaagtctgacccagtgggtggtgctgttggaggagtcttggaagaggccgtcttcttc ttggagaagacccacagctgcaccttctgtctcatctgtggagagacagattctagtcat tcttatgtctccagcacccagcagaaggactgcaaacagccaggactcagtaaatgtgag ggcaacacgaaggaccctgcagctgccccacatgataccagctacgcaaaaccatctgcg gagcgtgtgcctctgggccgtatgcccgtgacctccagactttccatctgcagaatggga acaacactggtcacctgtccagtaaatcgtcctgtactcctcaccgcacttggacctatt tcccattctgtgcaaacaagtccacgttgctcgcttcggaagcaggctttgcaggagtcg atgcctttctctcagtggcgtgctccccagcctcgtcctttgcaaggaaaccacttaaac tgccaagcatttgctgcccgcgcacctacccgtgccatctgctgcctgatgctggtgcgg acgacactgtctctcacaacccagacaccgatgccaactgacccctctgagatgaacttc ctttctctagaaaatgcgagagaatgcaggtggatgccacctccatcactggaccatctt gagaatcaaatggacacaagcccctgcaccggccggcacgcagcgctccctccagccttg ctgagccgtgggcttgtgtcccggccctaccatgcctgtggatcagcccacactggcacc acactgagcccagccaacaatgcccgagtgtgtgggcaacagaggatactggtcatggtg gcagctggtgatcacggacagtcaccaaccttggaacacggcttgtcccacagacacagt cacagccaggactgcttggtgtccactgtgggaaacagcatcttccatctctgtctcaag aagggccagtttggtttcctgtggtcccatagaatgataagaaattcgggggttgtggcc cctccaatgttacagccagcctccttccctccagctgctggcttcgaggacggaaaaggc cctcgaaacaagaggctggctcctctgtttggggcaaacacaagacccccaggctgccag caaaaaaaaaaaaaagttccaaaagaccagatggcacagctggccctggccctgcaaatt caaagactcaacctgcaggcatgtggctgggccacccaaagccaggagacgccaggtgtc tccctgtcttctgggagcagcacctgcacacacgaagagcaggaatcccatgactctgga ggtggcaccaaccccttgagggcctctgagcccagaaccagcctgaagcggtccaccgag cgggcagacgagccctgggctgtggacagcctgctgatggaggcgagagtgaagggaaaa tctgaactaaagaaaggagagatgtgtccaagtttaaaggcactgctcttcgggtctgag aagtaa >gi568815596f:238748216_238948695|GENSCAN_predicted_peptide_5|223_aa MDVYVLEVLHRRHNTGWVPRAKCSKQKLSAELLLLGEGALLPHRTSMSQEAQELVATQAG QGTGSPRTRPGSDPEAWRCAAPEAAPGIQAWILSLAQHQQSNREALLATGKEVNTVNGPE QETTKVEGSPHGACMDGWYCGQGACPRQCLEGAGSSCRARPCYVWVPPQILLPTGLSDWA QAQAILPVSTKVKAQLEQRTGMLRYGLGIAKSMHMKMWEPWMP >gi568815596f:238748216_238948695|GENSCAN_predicted_CDS_5|672_bp atggacgtgtatgtcctggaagtgctccatcgcaggcacaacacaggctgggtccccaga gcaaaatgcagcaagcagaagctttcggctgagctcctgttgctgggtgaaggagccctt cttccccatcggacgtccatgagccaggaagcccaggagcttgtagcaactcaagccgga caaggaacaggatctcctagaacacgacccggctctgatccggaggcctggagatgtgca gccccggaggcagctcccgggatacaggcatggatactctccttagcacagcatcagcag agcaaccgggaggcccttctggctactgggaaggaggtgaacacagtgaacgggccagag caggaaacaaccaaggtggagggcagcccccatggcgcctgtatggatgggtggtactgt ggccagggagcctgcccgcggcaatgcctggaaggagccggctcctcctgcagagccagg ccctgctacgtctgggtcccaccacagatcctgctcccaactgggctgtctgattgggct caggcccaggccatcctgccagtctccaccaaagtcaaggcccaactggaacagcggact gggatgctgagatatggactgggaatagccaagtccatgcacatgaaaatgtgggaaccc tggatgccctga