GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:14:00 Sequence gi568815582f:8635740_8881427 : 245688 bp : 48.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3352 3423 72 2 0 94 78 65 0.781 5.48 1.02 Intr + 5392 5445 54 2 0 98 89 66 0.988 6.65 1.03 Intr + 6388 6468 81 2 0 80 116 72 0.997 8.81 1.04 Intr + 6724 6826 103 2 1 110 83 143 0.999 15.23 1.05 Intr + 8818 8986 169 1 1 59 82 306 0.933 27.05 1.06 Intr + 17269 17416 148 0 1 42 37 67 0.007 -3.19 1.07 Term + 25008 25177 170 1 2 41 48 404 0.916 29.74 1.08 PlyA + 25906 25911 6 -0.45 2.05 PlyA - 25979 25974 6 1.05 2.04 Term - 26915 26873 43 1 1 105 48 30 0.045 -2.67 2.03 Intr - 27138 27103 36 2 0 62 100 45 0.029 0.48 2.02 Intr - 30401 30363 39 1 0 114 96 0 0.085 0.74 2.01 Init - 38916 38780 137 0 2 58 53 154 0.492 6.53 2.00 Prom - 45572 45533 40 -2.96 3.00 Prom + 49627 49666 40 -5.26 3.01 Init + 56705 56731 27 1 0 112 94 18 0.251 4.58 3.02 Intr + 59770 59893 124 0 1 18 100 62 0.108 0.56 3.03 Intr + 83265 83441 177 1 0 55 69 63 0.013 0.99 3.04 Intr + 99960 100070 111 1 0 104 107 115 0.169 15.35 3.05 Intr + 110262 110359 98 1 2 87 109 87 0.732 10.43 3.06 Intr + 112369 112398 30 0 0 105 111 7 0.784 3.03 3.07 Intr + 114683 114800 118 1 1 71 82 121 0.995 9.84 3.08 Intr + 122018 122067 50 0 2 100 78 22 0.656 0.80 3.09 Intr + 128330 128410 81 1 0 106 109 78 0.913 11.53 3.10 Intr + 128999 129091 93 1 0 136 76 115 0.999 15.26 3.11 Intr + 130469 130531 63 1 0 88 109 27 0.929 3.71 3.12 Intr + 132454 132517 64 0 1 117 82 69 0.876 7.49 3.13 Intr + 133086 133234 149 1 2 52 80 174 0.943 12.95 3.14 Intr + 137041 137178 138 0 0 54 86 325 0.972 29.56 3.15 Intr + 139151 139318 168 1 0 86 68 276 0.999 25.54 3.16 Intr + 140605 140751 147 0 0 80 92 258 0.991 25.83 3.17 Intr + 143740 143851 112 0 1 106 116 79 0.999 12.45 3.18 Term + 145570 145691 122 2 2 124 43 41 0.660 1.84 3.19 PlyA + 147637 147642 6 1.05 4.02 PlyA - 147928 147923 6 1.05 4.01 Sngl - 160683 160213 471 0 0 1 47 546 0.639 37.92 4.00 Prom - 161330 161291 40 -9.75 5.00 Prom + 161473 161512 40 -14.13 5.01 Init + 161874 162209 336 2 0 84 111 150 0.117 14.36 5.02 Intr + 166060 166171 112 0 1 60 54 74 0.207 1.15 5.03 Intr + 166551 166622 72 1 0 74 48 92 0.680 3.18 5.04 Intr + 169028 169104 77 0 2 46 96 56 0.778 1.43 5.05 Intr + 170577 170668 92 2 2 67 90 30 0.810 -0.11 5.06 Intr + 175340 175500 161 2 2 59 46 156 0.840 8.13 5.07 Intr + 177252 177367 116 1 2 65 100 97 0.996 8.77 5.08 Intr + 178601 178670 70 1 1 95 57 64 0.269 2.75 5.09 Intr + 196064 196139 76 0 1 100 86 -35 0.022 -3.93 5.10 Intr + 196382 196561 180 2 0 86 16 96 0.247 1.18 5.11 Term + 197090 197297 208 2 1 120 44 119 0.393 7.51 5.12 PlyA + 197757 197762 6 1.05 6.11 PlyA - 198127 198122 6 1.05 6.10 Term - 199713 199537 177 0 0 75 33 124 0.156 3.29 6.09 Intr - 205524 205390 135 0 0 60 81 52 0.155 2.46 6.08 Intr - 208758 208615 144 0 0 74 90 11 0.113 0.38 6.07 Intr - 213206 212897 310 1 1 44 86 144 0.176 6.12 6.06 Intr - 219587 219450 138 1 0 106 6 186 0.025 11.78 6.05 Intr - 222733 222611 123 0 0 101 61 145 0.992 12.80 6.04 Intr - 223596 223432 165 2 0 115 69 64 0.264 6.28 6.03 Intr - 233115 233028 88 1 1 114 47 -11 0.000 -3.57 6.02 Intr - 233351 233227 125 1 2 87 49 74 0.001 3.63 6.01 Init - 244048 243984 65 1 2 65 97 56 0.575 4.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 219587 219425 163 1 1 106 49 198 0.974 15.11 S.002 Intr + 233874 234047 174 1 0 85 77 107 0.942 9.21 S.003 Term + 234077 234279 203 0 2 70 42 95 0.976 0.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:8635740_8881427|GENSCAN_predicted_peptide_1|265_aa XVGADLLSMCQRNIALNSHLAATGGGIVRVKELDWLKDDLCTDPKVPFSWSQEEISDLYD HTTILFAAEVFYDDDLTDAVFKTLSRLAHRLKNACTAILSVEKRLNFTLRHLDVTCEAYD HFRSCLHALEQLADGKLRFVVEPVEASFPQLLVYERLQQLVTASSNPLLLLNQCMRRANG GAADIGSRQWIPASTREAFQTVLFLTWQCGGEIGPASGQVLEEEEEEEEEEEEEEEEGEE GEEGEEEEEEEEEEEEEEEEEEELC >gi568815582f:8635740_8881427|GENSCAN_predicted_CDS_1|798_bp natgtcggtgcagatctcttgtccatgtgccagcgaaacattgccctcaacagccacctg gctgccactggaggtggtatagttagggtcaaagaactggactggctgaaggacgacctc tgcacagatcccaaggtccccttcagttggtcacaagaggaaatttctgacttgtacgat cacaccaccatcctgtttgcagccgaagtgttttacgacgacgacttgactgatgctgtg tttaaaacgctctcccgactcgcccacagattgaaaaatgcctgcacagccatactgtcg gtggagaagaggctcaacttcacattgagacacttggacgtcacatgtgaagcctacgat cacttccgctcctgcctgcacgcgctggagcagctcgcagatggcaagctgcgcttcgtg gtggagcccgtggaggcctccttcccacagctcctggtttacgagcgcctccagcaactg gtgactgccagttcaaacccacttctgctgctcaaccaatgcatgcgccgggcaaacggt ggtgctgctgatattggaagcaggcagtggatcccagcttcaaccagggaggccttccag actgtgctgtttcttacatggcagtgtggtggggagattgggccggcaagtgggcaagtc ttggaggaggaggaggaggaggaggaggaggaggaggaggaggaggaggagggggaggag ggggaggagggggaggaggaagaggaggaggaggaggaagaggaggaggaggaggaggag gaggaggagctgtgctga >gi568815582f:8635740_8881427|GENSCAN_predicted_peptide_2|84_aa MRAPSSGSAGCLLDFPQRLAGLPGSAERPGQSAPPAPALAFATANGLSRLSGNQRTPSGC LTDITKSNFEKTQTISFMLFLGEL >gi568815582f:8635740_8881427|GENSCAN_predicted_CDS_2|255_bp atgcgcgccccgagttccggcagcgctgggtgtctgctcgacttcccccagcgactcgcc ggactccctggctccgccgagcgtcccggccaatcagcgccgccagccccggccctcgct ttcgccacagccaatgggttgagcagactctcaggcaaccaaagaacaccctcagggtgc ttgacagacattaccaagtccaactttgagaagacccaaaccatcagcttcatgctcttt cttggtgagctctga >gi568815582f:8635740_8881427|GENSCAN_predicted_peptide_3|623_aa MAKKEAVEQNRMSFAQPYVSLSFSVCKLALLHYIRRRLKASVMSGNQLASAVSGVRESGL FSLNGKCVSSNHTDLICLLLVGPPHGNTRSHRTQQDVSLFTEESLGLGTGWQHAKGVPVP QGVMASMLLAQRLACSFQHSYRLLVPGSRHISQAAAKVDVEFDYDGPLMKTEVPGPRSQE LMKQLNIIQNAEAVHFFCNYEESRGNYLVDVDGNRMLDLYSQISSVPIGYSHPALLKLIQ QPQNASMFVNRPALGILPPENFVEKLRQSLLSVAPKGMSQLITMACGSCSNENALKTIFM WYRSKERGQRGFSQEELETCMINQAPGCPDYSILSFMGAFHGRTMGCLATTHSKAIHKID IPSFDWPIAPFPRLKYPLEEFVKENQQEEARCLEEVEDLIVKYRKKKKTVAGIIVEPIQS EGGDNHASDDFFRKLRDIARKHGCAFLVDEVQTGGGCTGKFWAHEHWGLDDPADVMTFSK KMMTGGFFHKEEFRPNAPYRIFNTWLGDPSKNLLLAEVINIIKREDLLNNAAHAGKALLT GLLDLQARYPQFISRVRGRGTFCSFDTPDDSIRNKLILIARNKGVVLGGCGDKSIRFRPT LVFRDHHAHLFLNIFSDILADFK >gi568815582f:8635740_8881427|GENSCAN_predicted_CDS_3|1872_bp atggctaagaaggaggcagttgagcagaataggatgagttttgctcaaccttatgtcagc ctcagcttctcagtctgcaaactggcactactccactacatcagaagacgtttaaaagca tcagtgatgtctggaaatcagttagcatcggcggtttctggagtccgtgaaagtggtttg ttttctctgaacgggaagtgcgtttcctccaatcacaccgacctgatttgtttgttgctt gttgggcctccccacgggaacacgagatcccacagaacacagcaggacgtgtccttgttt actgaggaatctctggggctcggcacagggtggcagcacgcaaagggtgtccctgtccct caaggggtcatggcctccatgttgctcgcccagcgcctggcctgcagcttccagcacagc taccgcctgctggtgcctggatccagacacattagtcaagctgcagccaaagtcgacgtt gaatttgattatgatgggcctctgatgaagacggaagtcccagggcctagatctcaggag ttaatgaaacagctgaatataattcagaatgcagaggctgtgcattttttctgcaattac gaagagagccgaggcaattacctggttgatgtggacggcaaccgaatgctggatctttat tcccagatctcctctgtccccataggttacagccaccccgccctgctgaaactcatccaa cagcctcaaaatgcgagcatgtttgtcaacagacccgccctcggaatcctgcctccggag aactttgtggagaagctccggcagtccttgctctcggtggctcccaaagggatgtcccag ctcatcaccatggcctgcggctcctgctccaatgaaaacgccttaaagaccatcttcatg tggtaccggagcaaggaaagagggcagaggggcttctcccaggaggagctggagacgtgc atgattaaccaggcccctggctgccccgactacagcatcctctccttcatgggcgcgttc catgggaggaccatgggttgcttagcgaccacgcactctaaagccattcacaagatcgac atcccttcctttgactggcccatcgcaccgttcccacggctgaaataccctctggaagag tttgtgaaagagaaccaacaggaggaggcccgctgtctggaagaggtggaggatctgatt gtgaaatatcggaaaaagaagaagacggtggccgggatcatcgtggagcccatccagtcc gagggtggagacaaccacgcatccgatgacttctttcggaagctgagagacatcgccagg aagcatggctgcgccttcttggtggacgaggtccagaccggaggaggctgcacgggcaag ttctgggcccatgagcactggggcctggatgacccagcagacgtgatgaccttcagcaag aagatgatgactgggggcttcttccacaaggaggagttcaggcctaatgctccctaccgg atcttcaacacctggctgggggacccgtccaagaacctgttgctggctgaggtcatcaac atcatcaagcgggaggacctgctaaataatgcagcccatgccgggaaggccctgctcaca ggactgctggacctccaggcccggtacccccagttcatcagcagggtgagaggacgaggc accttttgctccttcgatactcccgatgattccatacggaataagctcattttaattgcc agaaacaaaggtgtggtgttgggtggctgtggtgacaaatccattcgtttccgtcccacg ctggtcttcagggatcaccacgctcacctgttcctcaatattttcagtgacatcttagca gacttcaagtaa >gi568815582f:8635740_8881427|GENSCAN_predicted_peptide_4|156_aa MFYRFDAIRTFGFLSRLKLAQTALTVVALPPGYYLYSQGLLTLNTVCLMSGISGFALTML CWMSYFLRRLVGILYLNESGTMLRVAHLNFWGWRQDTYCPMADVIPLTETKDRPQEMFVR IQRYSGKQTFYVTLRYGRILDRERFTQVFGVHQMLK >gi568815582f:8635740_8881427|GENSCAN_predicted_CDS_4|471_bp atgttttaccgttttgatgccatcagaaccttcgggttcctgtcacgactgaagttggca cagactgccctgacagtggtagctttgccaccaggctattacttgtactcccagggcctc ctcactctcaacaccgtgtgcctcatgagtgggatatcgggctttgccctgaccatgctg tgctggatgagctatttcttacggagactggttggtatcctgtatctgaatgagtctggc accatgctgcgggtggcccatctgaacttctggggctggcggcaggacacatactgtccc atggcagatgtgattcccctgacagaaaccaaggaccggcctcaggagatgtttgtgcgt atccagcggtacagtgggaaacagaccttctacgtcaccctgcgctatggacgcatcctg gacagagagcgtttcacacaggtgtttggggtacatcagatgctcaagtga >gi568815582f:8635740_8881427|GENSCAN_predicted_peptide_5|499_aa MAPPPAAGSKSHRPPEPLRAHARMYKAGVICVAPWELRSRNRGCRYSQALWERSPLLFPT CPATQRPNPEVPGRVPRANVSCKVRLETGDMAAPGPALCLFDVDGTLTAPRQKITKEMDD FLQKLRQKIKIGVVGGSDFEKVQEQLGNDESPAQDPEKPERLIDPFTGESDLIVVEKYDY VFPENGLVAYKDGKLLCRQNIQSHLGEALIQDLINYCLSYIAKIKLPKKRGTFIEFRNGM LNVSPIGRSCSQEERIEFYELDKVRLSEISLVNGWVYGNKIWPGGQISFDVFPDGWDKRY CLRHVENDGYKTIYFFGDKTMPAGGVGYLAISRPVSGFLAILHLADFGTIRDLNINARIS LSPGLSLVPVSYSGMQARHSPKKAAPQEGAHHALRVTKKEADPWRERKEGLRQVVFVMQM LLRAVRLSESGVFSDGHMACALTSLRCQGSLTLQAWQPFPPFAQQGAQELSVDDWSPQGS CDVVLVCAVSPAAKAMPGG >gi568815582f:8635740_8881427|GENSCAN_predicted_CDS_5|1500_bp atggccccaccacctgctgccggaagtaaatcccaccggcccccggaacccctaagagcg catgctcgaatgtacaaggcgggcgtgatctgcgttgcaccctgggagttgcggtccagg aatcgtggctgccgctactcccaggcgttatgggaacggagtcccctcctcttcccgacg tgccctgcgactcagcggccgaacccggaagttccgggccgagttcctcgtgccaacgtg tcttgtaaggtgcggctagaaactggggacatggcagcgcctggcccagcgctctgcctc ttcgacgtggatgggaccctcaccgccccgcggcagaaaattaccaaagaaatggatgac ttcctacaaaaattgaggcagaagatcaaaatcggagtggtaggcggatcggactttgag aaagtgcaggagcaactgggaaatgatgaatctccagcacaggaccccgaaaaaccagag aggcttattgaccccttcactggagagtctgacttgatagtggttgaaaaatacgattat gtgtttccagaaaatggcttggtagcatacaaagatgggaaactcttgtgtagacagaat attcaaagtcatctgggtgaggccctaatccaagatttaatcaactactgtctgagctac attgcgaaaattaaactcccgaagaagaggggtactttcattgaattccgaaatgggatg ttaaacgtgtcccctattggaagaagctgcagccaagaagaacgcattgagttctacgaa ctcgataaagtacgtctttctgaaatatctttggtgaatggctgggtttatggaaataag atatggcctggaggccagatcagctttgatgtctttcctgatggatgggacaagagatac tgtctgcgacatgtggaaaatgacggttataagaccatttatttctttggagacaaaact atgccagctgggggtgttggttaccttgccatcagcaggcccgtgtcaggctttctcgcc atcctgcacctagcagactttgggactattagggatttaaatattaatgcgcggatttca ctgagccctggcctctctttggtgcctgtcagctactctggtatgcaagctcgacacagc cccaagaaggctgcaccccaggaaggagcccaccatgctctgagagtcacaaagaaagaa gcagacccttggcgggagagaaaggaaggcctgaggcaggttgtgtttgtgatgcagatg ctcctgcgagccgtgcggctctctgaaagcggcgtcttctctgatggccacatggcctgt gccctcacgtccctcagatgccagggttccctcaccctccaggcctggcagcccttccct ccctttgctcaacaaggcgcccaagagctgtctgttgatgactggtctccccagggcagc tgtgatgtggttttggtttgcgccgtatccccagctgctaaagcaatgcctggcgggtag >gi568815582f:8635740_8881427|GENSCAN_predicted_peptide_6|489_aa MKLVKLWALSELYDSEQVSSPRPIGRELLGAGIGAPPPLGPRPLPSSALSVGAKRLAEQN GLQGSSSSAEEGSGLGFSGPVWGTRVAPPQGKGSAMSSEPPPPPQPPTHQASVGLLDTPR SRERSPSPLRGNVVPSPLPTRRTRTFSATVRASQGPVYKGVCKCFCRSKGHGFITPADGG PDIFLHISDVEGEYVPVEGDEVTYKMCSIPPKNEKLQAVEVVITHLAPGTKHETWKVLFH RVLAETRDPSSFTKNVQKMKRVRDADLAVAALSRLRGKVWVKAWVSADRRAAPLTVLWLG SRSIPEKLEANASRKPARGNQKVLLVKSDIPRWFLWTWGQEPPNPFSFTLSGKSRFSGGE ASTPNSYLCAPIPYFHAPTPFPLFWRSDNGPAFTSQITQAVSQALSIQWNLHIPYSPQSS GKVERTNGLLKLDDPDTHQAQQITWAVRPQGFTDSPHYFSQAQISSPSVTYLGIILIKTH VLSLLIVSD >gi568815582f:8635740_8881427|GENSCAN_predicted_CDS_6|1470_bp atgaagttggtcaaactgtgggctctctccgagctgtatgactctgaacaggtgtcttct cctcgccccattggccgggagctcctgggggcggggatcggcgccccaccccccctcggg ccacgccccctccccagctccgcgctctcagtcggagcgaagcggctggcggagcagaac ggattgcaggggtcttctagctcggcagaggagggctccgggctgggtttctcggggcct gtttggggaacgcgggttgctcctccccaggggaaggggtcagccatgtcatctgagcct cccccaccaccacagccccccacccatcaagcttcagtcgggctgctggacacccctcgg agccgtgagcgctcaccatcccctctgcggggcaacgtggtcccaagcccactgcccact cgccggacgaggaccttctcggcgacggtgcgggcttcacagggccccgtctacaaagga gtctgcaaatgcttctgccggtccaagggccatggcttcattactccagctgatggcggc cccgacatcttcctgcacatctctgatgtggaaggggagtatgtcccagtggaaggcgac gaggtcacctataaaatgtgctccatcccacccaagaatgagaagctgcaggccgtggag gtcgtcatcactcacctggcaccaggcaccaagcatgagacctggaaagttctcttccac cgtgttctagctgagacccgggatccttccagtttcacgaagaacgtgcagaagatgaaa agagtgagggacgcggacctagctgtagctgcactcagccgcctccgtggtaaagtctgg gtgaaggcctgggtgagtgcagataggagagcagctcctcttactgttctctggctgggc tctcgctccatccctgagaaactagaggcgaacgcctccaggaagccagccaggggcaac cagaaagtccttttggtgaagtctgacatcccgcgttggtttctctggacctgggggcaa gaaccccccaaccccttctccttcacccttagcggcaagtcccgcttttctgggggagag gcaagtaccccaaactcgtatctctgtgccccaatcccttatttccatgccccaacccct ttcccgcttttctggaggtctgataacggaccagcctttactagtcaaatcacccaagca gtttctcaggctcttagtattcagtggaaccttcatatcccttacagtcctcaatcttca ggaaaggtagaacggactaatggtcttttaaagttagatgaccctgacacccatcaggct cagcaaattacctgggctgtacggccacaaggcttcacagacagcccccactacttcagt caagcccaaatttcatccccatctgttacctatctcggcataattctcataaaaacacac gtgctctccctgttgatcgtgtctgattaa