GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:09:01 Sequence gi568815579f:16561218_16788368 : 227151 bp : 49.50% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 5440 5315 126 1 0 93 97 197 0.993 21.98 1.03 Intr - 6952 6815 138 1 0 112 63 191 0.997 19.66 1.02 Intr - 10365 10295 71 1 2 99 86 50 0.965 4.80 1.01 Init - 11147 10727 421 2 1 100 94 748 0.999 71.15 1.00 Prom - 13741 13702 40 -4.96 2.06 PlyA - 14070 14065 6 -0.45 2.05 Term - 16465 14810 1656 1 0 79 47 1835 0.996 167.93 2.04 Intr - 17192 17118 75 2 0 110 105 63 0.939 9.91 2.03 Intr - 22266 22211 56 1 2 112 61 38 0.585 2.20 2.02 Intr - 24124 24039 86 0 2 82 41 57 0.339 -0.14 2.01 Init - 26486 26407 80 2 2 81 77 63 0.487 3.52 2.00 Prom - 38315 38276 40 -4.56 3.00 Prom + 44995 45034 40 -4.16 3.01 Init + 46708 46926 219 0 0 47 110 143 0.952 11.03 3.02 Intr + 57497 57568 72 1 0 96 86 8 0.125 1.00 3.03 Intr + 62952 63043 92 2 2 98 121 38 0.041 6.89 3.04 Intr + 66436 66546 111 1 0 88 64 85 0.000 5.59 3.05 Term + 90356 90539 184 2 1 60 54 194 0.562 10.22 3.06 PlyA + 91778 91783 6 1.05 4.00 Prom + 93767 93806 40 -4.56 4.01 Init + 100001 100124 124 1 1 115 99 329 0.999 35.03 4.02 Intr + 118767 118923 157 1 1 74 107 153 0.432 14.87 4.03 Intr + 119180 119364 185 2 2 80 94 213 0.991 20.53 4.04 Intr + 119704 119814 111 2 0 -11 94 98 0.515 0.85 4.05 Intr + 121204 121291 88 2 1 51 59 157 0.394 8.13 4.06 Intr + 125071 125188 118 1 1 43 121 129 0.380 12.17 4.07 Term + 126927 127154 228 2 0 88 50 276 0.997 20.43 4.08 PlyA + 128787 128792 6 1.05 5.00 Prom + 130976 131015 40 -4.86 5.01 Init + 131901 131955 55 2 1 114 79 43 0.904 7.45 5.02 Term + 138938 138999 62 0 2 112 43 12 0.340 -2.93 5.03 PlyA + 139502 139507 6 1.05 6.00 Prom + 151210 151249 40 -5.36 6.01 Init + 166339 166461 123 0 0 80 56 123 0.099 6.69 6.02 Intr + 169975 170061 87 0 0 111 78 32 0.174 4.67 6.03 Intr + 175417 175533 117 0 0 106 62 47 0.545 4.56 6.04 Intr + 183204 183501 298 2 1 65 94 304 0.990 25.15 6.05 Intr + 187922 189194 1273 0 1 53 131 1279 0.960 116.00 6.06 Intr + 198008 198211 204 2 0 92 96 341 0.999 33.62 6.07 Intr + 200762 200921 160 2 1 58 100 212 0.999 19.39 6.08 Intr + 202611 202728 118 2 1 92 82 114 0.921 11.24 6.09 Intr + 203817 203975 159 1 0 112 92 164 0.994 19.16 6.10 Intr + 211909 212106 198 2 0 74 113 41 0.785 4.52 6.11 Intr + 218126 218248 123 0 0 63 94 172 0.834 15.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 47534 47647 114 1 0 56 45 83 0.887 -0.63 S.002 Term - 92908 92814 95 2 2 82 47 182 0.897 11.49 S.003 Intr - 98230 98178 53 0 2 82 110 65 0.889 6.65 S.004 Init - 98893 98868 26 1 2 76 58 65 0.886 0.57 S.005 Init + 166339 166456 118 0 1 80 110 128 0.836 12.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:16561218_16788368|GENSCAN_predicted_peptide_1|252_aa MAAAAVGAGHGAGGPGAASSSGGAREGARVAALCLLWYALSAGGNVVNKVILSAFPFPVT VSLCHILALCAGLPPLLRAWRVPPAPPVSGPGPSPHPSSGPLLPPRFYPRYVLPLAFGKY FASVSAHVSIWKVPVSYAHTVKATMPIWVVLLSRIIMKEKQSTKVYLSLIPIISGVLLAT VTELSFDMWGLVSALAATLCFSLQNIFSKKVLRDSRIHHLRLLNILGCHAVFFMIPTWVL VDLSAFLVSSDL >gi568815579f:16561218_16788368|GENSCAN_predicted_CDS_1|756_bp atggcggcggccgcggtgggcgcgggccacggcgcggggggcccgggcgcagcgagcagc agtggtggggcgcgcgagggcgcgcgggtggcggcgctgtgcctgctgtggtacgcgctg agcgcgggcggcaacgtggtcaacaaggtgatcctgagcgccttcccgttcccggtgacc gtgtcgctgtgccacatcctggctctgtgcgctgggctcccgccgctgctgcgcgcctgg cgcgtgccccccgcgccgcccgtctcgggccccggacccagtccgcatccgtcgtccggc ccgctgctgccgccgcgcttctacccgcgctacgtgctaccgctcgccttcggcaagtac ttcgcgtccgtgtcagcgcacgtcagcatctggaaggtgcccgtgtcctatgcacacacc gtcaaggccaccatgcccatctgggtggtcctcctgtcccggatcattatgaaggagaag cagagcaccaaggtatacttgtcactcatccccatcatcagcggtgtcctgctggccacc gtcaccgagttgtcttttgacatgtggggactcgtcagcgccctcgccgccacgctgtgc ttctcgcttcagaacattttctccaaaaaggtcttgcgagattcacggatccaccatctc cggctgctcaacatcctgggctgccacgccgtcttctttatgatccccacctgggttctg gtggacctctcggctttcctggtcagcagcgacttg >gi568815579f:16561218_16788368|GENSCAN_predicted_peptide_2|650_aa MDMTDHSGGRLLLGVAGGPQQEVLFKWSGLSLQEGTVDLACAPGSGSLWFPLEERRTSRS NEEAKLKQIKWAGAIRNMVAVLEVISSLEKYPITKEALEETRLGKLINDVRKKTKNEELA KRAKKLLRSWQKLIEPAHQHEAALRGLAGATGSANGGAHNCRPEVGAAGPPRSIHDLKSR NDLQRLPGQRLDRLGSRKRRGDQRDLGHPGPPPKVSKASHDPLVPNSSPLPTNGISGSPE SFASSLDGSGHAGPEGSRLERDENDKHSGKIPVNAVRPHTSSPGLGKPPGPCLQPKASVL QQLDRVDETPGPPHPKGPPRCSFSPRNSRHEGSFARQQSLYAPKGSVPSPSPRPQALDAT QVPSPLPLAQPSTPPVRRLELLPSAESPVCWLEQPESHQRLAGPGCKAGLSPAEPLLSRA GFSPDSSKADSDAASSGGSDSKKKKRYRPRDYTVNLDGQVAEAGVKPVRLKERKLTFDPM TRQIKPLTQKEPVRADSPVHMEQQSRTELDKQEAKASLQSPFEQTNWKELSRNEIIQSYL SRQSSLLSSSGAQTPGAHHFMSEYLKQEESTRQGARQLHVLVPQSPPTDLPGLTREVTQD DLDRIQASQWPGVNGCQDTQGNWYDWTQCISLDPHGDDGRLNILPYVCLD >gi568815579f:16561218_16788368|GENSCAN_predicted_CDS_2|1953_bp atggacatgacagaccactcaggtggccggctgctcctgggagtagcaggagggccccag caggaagtgcttttcaagtggtcaggcctctccctgcaggaaggaactgtcgacctagcg tgtgcccctggttcgggctctctgtggtttcctttagaagagaggcgcacttcccgcagc aatgaagaggccaaactgaagcaaattaagtgggcaggggcgatccggaacatggtggcg gtgctggaagtcatctccagcctggagaaataccctattaccaaagaggcacttgaggaa acacgacttgggaagctcatcaacgacgtccgcaagaaaaccaagaacgaggagctcgcc aagcgggccaagaagctgctgcggagctggcagaagctcatcgagccggcacaccagcat gaggcggcgctgcgggggctggcgggggccaccggctctgccaacgggggcgcacacaac tgccggccggaggtgggggcggctggcccacccaggagcatccatgacctgaagagccgc aatgacctccagaggctgcccgggcagcggctggacaggctgggcagccgcaagcgccgg ggtgaccagcgtgacctcggccacccagggccgccacccaaggtctccaaagctagccac gaccccctggtccccaactcatcccccctccccaccaacgggatcagtgggagtccagag agcttcgccagctccctggatggcagtgggcatgcaggcccagagggcagccgcctggag cgtgacgagaatgacaagcacagtggcaagatccccgtcaacgccgtgcgaccgcacacc agctccccgggcctgggcaagccccctggaccctgcttgcagccaaaggcttcggtgctg cagcagctggacagggtggacgagactccggggcctccccatcccaagggaccccctcgc tgctctttcagtcctcggaactcacggcatgagggctcctttgcccggcagcagagcttg tatgcacccaagggctccgtgcccagcccctcaccgcggccccaggcactcgatgccaca caggtgccgtcaccgcttccactggcacagccgtccacaccccccgtacggcggctcgag ctgctgcccagtgcggaaagcccagtgtgctggcttgagcagcctgagagccaccagcgg ctggcggggccgggctgcaaggcagggctgtccccagccgagcccctcctgtcccgggca ggcttttccccagactcctccaaggcggacagtgatgctgcctcctcagggggctcggac agtaaaaagaagaagaggtaccgacctcgagactatacggttaacttggacgggcaggtg gctgaggcgggcgtcaagcctgtccggttaaaagagcggaagctcacctttgaccccatg acgagacagatcaaacctctgacccagaaagagccagtgcgggcagacagccctgtgcac atggagcagcagtccaggacagagctggacaagcaggaggccaaggccagcctccagagc cccttcgaacagacgaactggaaggagctgtcacgcaacgagatcatccagtcctacctg agccggcagagcagcctgctctcatcatcgggcgcgcagaccccaggggctcaccacttc atgtctgagtacctgaagcaggaggagagcacccggcaaggggccaggcagctgcatgtg ctggtgcctcaaagcccgcccacggacctccctggtctgacccgggaggtcacacaggac gatctcgacagaatccaggccagccagtggccgggggtgaacgggtgtcaggacacacag ggtaactggtatgactggacgcagtgcatatcgctcgatccgcacggcgacgacgggcgc ttgaacattctgccttatgtctgcttggactga >gi568815579f:16561218_16788368|GENSCAN_predicted_peptide_3|225_aa MGNYDHENCPSSQLQRRRHHLDKEPVCGLKPGKQAKGHQGPGGQGSRPRPDLRLHTFQYI SSSGAASLQHPSQDLDVSWEHPLYCGTVWVEPTADSGDLNLDLEGALWGAYTSSQSHPVD KKSAGKPRPHGSLGLTYRALPGCPTCPRPGPEKREAGMAAALEGRLADEGVKMLVKQERM MKQVHRKDENERSTLTSSLPQDLCKDENENHADFFPAPGPLQRRE >gi568815579f:16561218_16788368|GENSCAN_predicted_CDS_3|678_bp atgggaaactacgatcacgagaactgtcccagcagccagctgcaaaggcgcagacaccat ctggacaaggaacctgtctgtggcctgaagcccggcaagcaggccaaggggcaccagggc cctggtgggcaaggctccaggcccaggccagatctccgtctgcacacgttccaatacatc tcctcttctggggccgcgagccttcagcatccaagccaggaccttgatgtctcctgggaa catcccctctattgtgggactgtttgggtggaacccacagcagattcaggggatctgaac ttagatctggaaggagctttatggggtgcatacaccagttcccaatcccaccctgtggac aagaaatctgcaggcaaaccaaggccgcacggaagcctcggcctcacgtaccgagcgctg ccgggctgcccgacctgcccccggcccgggcccgagaagcgagaggcagggatggcagcg gccctcgaggggagactagcagatgagggagtgaagatgctggtcaaacaggagaggatg atgaagcaggtgcacaggaaagacgagaatgagagaagcacactgacttcttccctgccc caggacctttgcaaagatgagaatgagaaccatgctgacttcttccctgccccaggacct ttgcaaagacgagaatga >gi568815579f:16561218_16788368|GENSCAN_predicted_peptide_4|336_aa MELLSALSLGELALSFSRVPLFPVFDLSYFIVSILYLKYEPGAVELSRRHPIASWLCAML HCFGSYILADLLLGEPLIDYFSNNSSILLASAVWYLIFFCPLDLFYKCVCFLPVKLIFVA MKEVVRVRKIAVGIHHAHHHYHHGWFVMIATGWVKAPLASTHEMPVAHLSPAVTKKLSPD IAKRLLEGKMAAGSGVALMSNFEQLLRGVWKPETNEILHMSFPTKASLYGAILFTLQQTR WLPVSKASLIFIFTLFMVSCKVFLTATHSHSSPFDALEGYICPVLFGSACGGDHHHDNHG GSHSGGGPGAQHSAMPAKSKEELSEGSRKKKAKKAD >gi568815579f:16561218_16788368|GENSCAN_predicted_CDS_4|1011_bp atggagctgctctcggcgctgagcctgggcgaactggcgctcagcttctcgcgggtgccg ctcttccccgtcttcgacctcagttacttcatcgtctccatcctctacctcaagtatgag ccaggagcagtcgaactgtcccggcgccaccccatcgcgtcctggctgtgcgccatgctg cattgcttcgggagctacatcctggctgatctgctccttggggagccactgatcgattac ttcagcaacaactccagcatcctgctggcctcagctgtctggtacttgattttcttctgc cccctggacctcttctacaagtgtgtctgcttcctgcctgtgaaactcatcttcgtggcc atgaaggaggtggtgcgagtccgcaagatcgcggtgggcatccatcacgcccatcaccac taccaccacgggtggttcgtcatgattgcaactgggtgggtcaaagcacccctggcctcc acccacgagatgccagtagcacacttgtccccagctgtgacaaagaaattgtctccagac attgctaaacgcctcctagagggcaaaatggctgcaggttctggtgtcgccctcatgtcc aactttgagcagctgctccgaggggtctggaagccagagaccaacgagatcctgcacatg tctttccccaccaaggccagcctgtatggagccatcctcttcaccctccagcagacccgc tggctcccagtgtccaaagccagcctcatcttcatcttcaccttgttcatggtgtcctgt aaggtgtttctgacagccacccactcacacagctccccctttgatgccctggagggctac atctgccccgtgctgtttggttcggcctgcgggggtgaccatcaccacgacaaccatggt gggtcccacagcggtggtgggccaggagctcagcattcggccatgcccgccaagtccaag gaggagttgagcgagggctccaggaagaagaaggccaagaaggcggattag >gi568815579f:16561218_16788368|GENSCAN_predicted_peptide_5|38_aa MPSQIHIFSHEKNANQNHTLGSTIVLCLSEFDDSRNLI >gi568815579f:16561218_16788368|GENSCAN_predicted_CDS_5|117_bp atgcccagtcagatacacatcttcagccatgagaagaatgcaaatcaaaaccacaccctt ggcagcaccattgtactttgtctctctgaatttgatgactctaggaacctcatatag >gi568815579f:16561218_16788368|GENSCAN_predicted_peptide_6|954_aa MSVGPLSPSTRRRLLRGEAGPLPAPPPSGVTIFISSTVSGKIWMQRGKPCRALPTLKCQT FCQRHGLMFEVVDLRWGIRNIEATDHLTTELCLEEVDRCWKTSIGPAFVALIGDQYGPCL IPSRIDEKEWEVLRDHLTARPSDLELVARYFQRDENAFPPTYVLQAPGTGEACEPEEATL TSVLRSGAQEARRLGLITQEQWQHYHRSVIEWEIERSLLSSEDREQGATVFLREIQDLHK HILEDCALRMVDRLADGCLDADAQNLLSSLKSHITDMHPGVLKTHRLPWSRDLVNPKNKT HACYLKELGEQFVVRANHQVLTRLRELDTAGQELAWLYQEIRHHLWQSSEVIQTFCGRQE LLARLGQQLRHDDSKQHTPLVLFGPPGIGKTALMCKLAEQMPRLLGHKTVTVLRLLGTSQ MSSDARGLLKSICFQVCLAYGLPLPPAQVLDAHTRVVQFFHTLLHTVSCRNFESLVLLLD AMDDLDSVRHARRVPWLPLNCPPRVHLILSACSGALGVLDTLQRVLLDPEAYWEVKPLSG NQGQQMIQLLLAAARRTLSPVHTDLLWASLPECGNPGRLRLAFEEARKWASFTVPVPLAT TAEEATHQLCTRLEQTHGQLLVAHVLGYIVSSRHGLSEAELKDVLSLDDEVLQDVYRDWT PPSKELLRFPPLLWVRLRRDLGYYLARRPVDGFTLLAIAHRQLVEVVRERYLSGSERAKR HGVLADFFSGTWSQGTKKLITLPLVGKPLNLDRKVAPQPLWFSHTVANLRKLKELPYHLL HSGRLEELKQEVLGSMSWISCRGISGGIEDLLDDFDLCAPHLDSPEVGLVREALQLCRPA VELRGMERSLLYTELLARLHFFATSHPALVGQLCQQAQSWFQLCAHPVLVPLGGFLQPPG GPLRATLSGCHKGITAMAWGVEEKLLVIGTQDGIMAVWDMEEQHVIHMLTGHTX >gi568815579f:16561218_16788368|GENSCAN_predicted_CDS_6|2862_bp atgagcgtgggccctctcagcccgagtacccgccggcggcttctgcggggtgaagccggc cccctcccagcacctccacccagcggtgtgaccatcttcatcagttccacagtctcaggt aagatatggatgcagagagggaagccctgcagagcactgcctaccctgaagtgccagacc ttctgccagaggcacggcttgatgtttgaggtcgttgatctgaggtggggtattcggaac attgaagccactgaccacttgaccacagaactctgcttggaggaggttgaccggtgttgg aaaacatccatagggccagcttttgttgccctcatcggtgatcagtacggcccctgtctg attccctcgcggatcgatgagaaggagtgggaggtattgagggaccatctgactgccagg ccaagtgacctggagctggtggcacgatacttccagagggacgagaatgcgtttcctccc acctacgtcctgcaggcaccaggtactggggaggcctgtgaaccagaggaggccacctta acttctgtcctacgctctggagcccaggaggcccggaggctggggctcatcacccaggag cagtggcagcactaccaccggtcagtcattgagtgggagatagagcggagcctgctgagc tcagaggaccgggaacagggagccaccgtcttccttagagagatccaagacctccacaaa cacatccttgaagactgcgcccttaggatggtggaccggctcgcggatggctgcctggac gctgatgcccagaaccttctcagcagcctcaaaagtcacatcactgacatgcacccaggg gtcctcaagacccaccgcctgccgtggagccgcgacttggtgaaccccaagaacaagact cacgcctgctacctgaaggagctgggtgagcagtttgtggtgagggccaatcaccaggtc ctcacacgcctccgtgagctggatacggccggacaggagttggcgtggctctaccaagag atccgccaccacctttggcagagctcggaggtcattcagaccttctgcggacgccaggaa ctcctggcccggcttgggcagcagctcaggcacgatgacagcaagcagcacacccccctg gtactctttgggcccccaggcattggaaagacagccctgatgtgcaagctggctgagcag atgccaaggctgctggggcacaagacagtgaccgtcctgcggctgctggggacgtcacaa atgagctcagatgcccgtggcctgctgaagagcatctgcttccaggtgtgcctggcctat gggctgcccttgccccctgcccaggttctggacgcccacaccagggtggtccagtttttc cataccctcctccacactgtctcttgcagaaacttcgagtctctcgtgctcctgctggat gctatggatgacctggactctgtccgccatgctcggagggttccctggctgcctctcaac tgccccccgagggtgcacctcatcctctcagcttgctcgggggcactgggggttttggac accttgcagcgggtgctcctggacccggaggcctactgggaggtgaagcccctttccgga aaccaaggccagcagatgatccaactcctgctggcagctgcaaggaggacgctgagcccg gtgcacacagatttgctctgggccagcctcccagagtgtgggaacccagggcggctgagg ctggcgtttgaggaagcccggaaatgggcctctttcaccgtgcctgtcccgctggccacc accgcagaggaagccacgcaccaactctgcacccgcctggagcagacacacgggcagctc ctcgtggcccacgtgctgggctacattgtgtcttcccgacacggtctctcggaggcggag ctgaaggatgttttgtccctggacgacgaggtcctgcaggatgtgtaccgagattggacc ccgcccagcaaggagctgctgcgcttcccgcccctgctgtgggtgcggcttcgtcgggat ctgggatactacttggcccggcggcccgtggatggcttcaccctcctggccattgcccac agacagctggtcgaggtggtccgtgagcgctacctgtcaggatccgagagagccaagagg catggcgtcctggccgacttcttctcagggacctggagccagggtaccaagaagctcatc actctgccacttgtggggaaaccactgaacttggaccgaaaggtggccccgcagcctctg tggttctcacatacggttgcaaacctgcggaagctgaaggagttgccctatcacctgctt cactcgggccgcctggaggagctgaaacaggaggttctgggcagcatgagctggatttcc tgccggggcatctctgggggcattgaagacctgctggatgactttgacctgtgtgcccct cacctggactcccctgaggttggcctggtccgtgaagccctccagctctgccgccctgct gtggagctccgaggcatggagaggagcctcctgtacacagaactgctggccagactccat ttcttcgccacctcacatccagcactggtgggacagctatgccaacaggcccagagctgg ttccagttgtgcgcacaccctgtgctggtgcccctcggaggattcctccagcccccggga ggacccctccgggcaactctcagcggctgtcacaaaggcatcaccgccatggcatggggt gtggaggagaagctgctggtgattggcacccaggatggcatcatggctgtgtgggacatg gaagagcagcatgtgatccacatgctaactggacacacagnn