GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:50:17 Sequence gi568815596f:111023746_111264228 : 240483 bp : 43.18% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 127 136 10 0 1 83 98 2 0.474 1.46 1.02 Term + 624 784 161 1 2 44 34 136 0.511 1.90 1.03 PlyA + 1527 1532 6 1.05 2.06 PlyA - 1841 1836 6 1.05 2.05 Term - 6870 6833 38 1 2 109 38 57 0.650 0.30 2.04 Intr - 8039 7966 74 1 2 104 109 28 0.651 5.55 2.03 Intr - 12441 12344 98 0 2 121 13 35 0.207 -1.89 2.02 Intr - 22552 22429 124 0 1 119 73 31 0.289 5.29 2.01 Init - 30933 30632 302 0 2 84 64 111 0.111 3.03 2.00 Prom - 43655 43616 40 -2.16 3.00 Prom + 44668 44707 40 -2.96 3.01 Sngl + 57505 58017 513 0 0 58 38 195 0.652 7.54 3.02 PlyA + 58193 58198 6 1.05 4.00 Prom + 58964 59003 40 -2.56 4.01 Init + 65769 65819 51 2 0 55 102 0 0.292 -0.74 4.02 Intr + 69120 69221 102 2 0 59 110 39 0.491 3.57 4.03 Term + 93871 94071 201 0 0 101 43 184 0.990 12.59 4.04 PlyA + 94779 94784 6 1.05 5.05 PlyA - 96357 96352 6 -0.45 5.04 Term - 97841 97087 755 0 2 -28 42 538 0.752 31.21 5.03 Intr - 98115 98024 92 2 2 60 101 34 0.989 1.54 5.02 Intr - 99159 98207 953 0 2 81 53 699 0.924 55.61 5.01 Init - 115740 115678 63 0 0 70 102 35 0.337 4.25 5.00 Prom - 130364 130325 40 -1.86 6.11 PlyA - 130413 130408 6 1.05 6.10 Term - 159672 159278 395 1 2 72 45 206 0.389 9.90 6.09 Intr - 164757 164624 134 2 2 26 61 69 0.024 -1.71 6.08 Intr - 168414 168303 112 1 1 55 44 107 0.232 2.44 6.07 Intr - 175244 175132 113 1 2 86 57 74 0.155 4.12 6.06 Intr - 177040 176914 127 2 1 60 83 55 0.094 1.94 6.05 Intr - 198991 198943 49 1 1 91 76 17 0.261 -0.95 6.04 Intr - 201518 199445 2074 1 1 68 29 1815 0.698 160.94 6.03 Intr - 209994 209937 58 1 1 60 81 83 0.733 2.74 6.02 Intr - 224496 224403 94 0 1 125 67 68 0.673 7.94 6.01 Init - 227838 227755 84 0 0 67 19 146 0.140 4.83 6.00 Prom - 231183 231144 40 -1.86 7.02 PlyA - 231975 231970 6 1.05 7.01 Term - 239995 239790 206 2 2 83 38 143 0.714 6.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 99530 99480 51 2 0 67 86 37 0.811 1.18 S.002 Sngl + 181446 181640 195 2 0 54 47 203 0.828 7.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:111023746_111264228|GENSCAN_predicted_peptide_1|56_aa MVEEEQRLMEEGFMLVEAAMGVKQLQAKECWRPLATTRSQKRKERTFRGGMELLTP >gi568815596f:111023746_111264228|GENSCAN_predicted_CDS_1|171_bp atggtggaagaagagcagagactcatggaagaaggctttatgctggtggaggcagcaatg ggagtgaagcagctgcaagccaaggaatgttggagaccactggcaaccaccagaagccag aagaggaaagaaagaaccttcagaggtggcatggaactgctgacaccttga >gi568815596f:111023746_111264228|GENSCAN_predicted_peptide_2|211_aa MVAGDVEGGLASLLPHAPRRLCAFGGSIHTWLVRNPKEASSSIQPPHLPSVEPSWGAVSA GLLNLGCSSWQLYPPTLMGPCMSHWVDSSYEYCSTCLLRSSCVSLDKSHSLAEPRFLISD VWSPEPTCWSRCEAVCKETVLAASTSSSNRKTLQKPNSQRMRSVVLQLLVLPQVGAVSPG TRRPLSHSITNIQQEPTDLALRFLRGLEIHM >gi568815596f:111023746_111264228|GENSCAN_predicted_CDS_2|636_bp atggtggcaggggacgtggagggaggcctggcgtccctccttccacacgctcccaggcgt ctctgtgcatttgggggcagcattcatacttggctggtaagaaatcctaaggaagcttcc agcagcattcagcccccacacctcccatcagtggagccttcctggggagctgtctctgca ggcttgctcaacctgggctgcagctcttggcagctgtacccaccgaccctgatgggcccc tgcatgtcccactgggtagacagcagctatgaatactgttctacatgtctgcttagatca agctgtgtgtccttagacaagtcacacagccttgctgagcctcgcttcctcatcagtgac gtgtggagcccggagcccacttgctggagcaggtgtgaggctgtgtgtaaagaaacggtg ctggcagcctccacctccagtagtaacagaaaaacactgcagaaacccaacagtcaaaga atgagaagtgttgtcttacaacttctggttctaccccaagtcggagctgtgtctccaggg acccgtagacccctgagtcactcaattacaaatattcaacaggagccaactgacctcgct ctgcggtttctgcggggcctggagatccacatgtaa >gi568815596f:111023746_111264228|GENSCAN_predicted_peptide_3|170_aa MIVYLENPIVSAQNLLKRISNFSKVSGYKINVQKSQTFLYTNNIQTESQIMSELPFKIAT KRIKYLGIQLTRDVKDIFKENYKLLLNEIREDTNKWKNIPCSWIGRINIMKMAILPEVIY RFNAIPTKLPLTFFTELEKNYFKFHMEPKKSPYSQYNPKQKEQSWRHHTT >gi568815596f:111023746_111264228|GENSCAN_predicted_CDS_3|513_bp atgattgtatatttagaaaaccccatcgtctcagcccagaatctccttaagcgtataagc aacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaaacattcctatac accaataacatacaaacagagagccaaatcatgagtgaactcccatttaaaattgctaca aagagaataaaatatctaggaatacaacttacaagggatgtgaaggacatcttcaaggag aactacaagctgctgctcaacgaaataagagaggacacaaacaaatggaaaaacattcca tgctcctggataggaagaatcaatatcatgaaaatggccatactgcctgaagtaatttat agattcaatgctatccccaccaagctaccattgactttcttcacagaattagaaaaaaac tactttaaatttcatatggaaccaaaaaagagcccgtatagccaatacaatcctaagcaa aaggaacaaagctggaggcatcacactacctga >gi568815596f:111023746_111264228|GENSCAN_predicted_peptide_4|117_aa MSGDLTKSHKELLGEFKFCLLYGTKLVFQERAWYLEHKYLTPMASTRIRNQLLDLCDSVK DDARRVISTFNIPHTYLHAPIAGISNPRAAWAFYPAPLQPRPREEARSRRPKLGAKL >gi568815596f:111023746_111264228|GENSCAN_predicted_CDS_4|354_bp atgtcaggtgacttaaccaagagtcataaagaactccttggggagttcaagttttgtctg ttgtatggaaccaagctggtgtttcaggagcgggcctggtatttagaacataaatacttg actcccatggccagcacgaggatcaggaatcagttgctggatttgtgcgactcggtgaag gatgatgcccggagggtgatctcgacctttaacattccacacacctacctccacgcacca atcgccggaatctccaacccgcgggccgcgtgggctttctaccctgcaccgctgcagccg cggccacgggaagaggcgcgctcccggcggcccaagctgggagccaagctctaa >gi568815596f:111023746_111264228|GENSCAN_predicted_peptide_5|620_aa MNTPNVKSTSNQCIPYFGKKYSAASGPGTKPRGVRPAPFADRAPPDRHHESSAGDKARDS GSESPAQRKARAFPPARRRAPPGSRGLRTPRLWPALPRPQPPGTSSASSRSSPRALRPPP PAPPTSPRRWRRVPAFPIGPRAVAPRRSQSAAAVGAAPTGQAAKGEAIVDKLASERCSGR GRGGCRFVGGFEGRLARPPCPSLCSFRTQALAAPAVLDSQSPGDKAGLAPRPDPQGHPQA QSSPLVLVLARRGQTLTAVTALAVATILLQQLPLSPGAPQTRVKAEEPPLAADAHVRDHR VNSHPPPWRPSGPPPAAPTLGQQVPEGTAPPACTANRVGWSGEDLDFSLPGNRADGTGRR TPAEQAGAPGRRSRRRDFPAGRTARYYEQVLSSRAGAERHALPLLLPGPTTLHQARAQHP VPSRTSAKSLQVPRERLFFKENLPEREPDPHGSPPPAGDKRRRCKRLRSHVHGKRAVGGA VRAPALVFLRCQARAPGFGGIYLAFLSPRVRVRHLGEQGPRKGAEAVAAGTQRTEYRDDA IAVWQGSQGGCKNQVVVVAAAAAAAAAAAAAAAAAALDAELQQTADQAAARSEVKPARPC SPALAGGPAPTPNHCGPRPR >gi568815596f:111023746_111264228|GENSCAN_predicted_CDS_5|1863_bp atgaacactcctaatgttaaatccacgagcaaccaatgtatcccttattttgggaaaaag tacagcgcagcgtccgggccagggaccaagccccgcggcgtccggcccgcgcccttcgcc gaccgggcgccccctgaccgtcaccacgaaagcagcgcaggagacaaagcccgggactcg ggttcagagtccccggcgcagcgcaaagcccgcgccttcccgccagcccgccgccgcgct ccgccgggcagccgagggctccgcacgccgcgcctctggcccgcgctgccccggccgcag ccgccgggaacatcctccgcctcctcccgctcctcccctcgcgctctgcgcccgccgccg ccggcaccgccgacctccccgcgccgctggcgccgggtgcccgcgttcccaattggtccg cgcgcggtcgctccgcgccgcagccaatcggcagccgcggtgggggcggcgcctacgggt caggccgccaaaggcgaggcgattgttgacaaactcgcctccgagcgctgctctggccgt ggtcgtggcggctgccgcttcgtcggaggatttgagggccggctggcccggccaccctgc ccatccctgtgctccttccggacacaagccttagcagctcccgccgtcctggacagccag tcacctggagacaaagcaggacttgcgccgcgcccggacccgcagggccatccccaggcc cagtcctcgccacttgtccttgttttggcccgtcgggggcagaccctaacggccgtcacc gccctggctgtggcaactattctcctccaacaacttcctctctctcctggtgcgccgcag acacgagtcaaagccgaagagccacccctggccgcggacgcgcacgtccgcgaccatcga gtaaattcacaccctccgccctggcgtccgtcgggccctccaccagcagcgccaaccctg gggcaacaggtcccagagggcacggcgccgccagcctgcaccgcgaaccgggtcgggtgg tcaggggaggacctagatttctcgctgccagggaatcgtgcggatggcacaggccgcagg actcctgcagagcaagcaggggccccagggcgccggagccggcggcgagactttccagct gggcgcacagcccgatactacgagcaggtcctcagctcccgggccggcgccgaaaggcac gcgctgcctctcctcctgcccggacctacgacccttcaccaggcccgcgcgcagcatccc gtcccaagccgcacttcggccaagagcctgcaggttccgcgtgaacgacttttctttaaa gagaacctgcccgagagggaacccgatccacacggatccccacctcccgccggcgacaag cggcgccgatgcaagcgtctccgctcgcacgtgcacgggaagcgggcagtaggtggtgcc gttcgggctccagcgctagtcttccttcggtgccaggcccgcgccccgggatttggaggt atttaccttgcgtttctcagtccgagagtcagagtcagacatttgggggaacaagggcca agaaagggcgccgaggcggtggcggcgggaacgcagcgaaccgaataccgcgatgatgcg atcgcagtgtggcagggttcgcagggtggctgcaagaatcaagtggtggtagtggcggcg gcggcggcggcggcggcggcggcggcggcggcggcagcggcagcggcgctggacgcagag ctccaacaaactgcagaccaggcggctgcgcggagcgaagtgaaacctgcgcggccctgc agcccagctctggctggcggccccgctcctacgcccaatcactgcggaccccgcccccgc tag >gi568815596f:111023746_111264228|GENSCAN_predicted_peptide_6|1079_aa MVMVVVVMVMVEVVMVAVVVVMVVVAVMAGQKADPGRIAHVFGYTGEGIVPAPQHIFIQV LDEPEDVYKWKKRSHWSRRCPEYAICKTELVLFPQNVSPGCPEYAICKTELVLFPQNVSP GCPEYAICKTELVLFPPNVSSPRCPEYAICKTELVLFPPNVSPRCPEYAICKTELILFPQ NVSPGCREYAICKTELVLFPPNVSPRCPEYAICKTELVLFPPNVSSPRCPEYAICKTELI LFPQNVSRGCPEYAICKTELVLFPPNVSPGCPEYAICKTELVLFPQNVSPGCPEYAICKT ELVLFPPNVSPGCPEYTICKTELVLFPPNVSPGCPEYAICKTELVLFPPNVSPGCPEYAI CKTELVLFPPNVSPGCPEYAICKTELVLFPPNVSPGCPEYAICKTELVLFPQNVSPGCPE YSICKTELILFPPNVSPGCPEYAICKTELVLFPPNVSPGCTEYAICKTELVLFPPNVSPG CPEYAICKTELVLFPQNVSPGCPEYAICKTELVLFPPNVSPGCPEYAICKTELVLFPPNV SPGCPEYAICKTELVLFPPNVSPGCPEYAICKTELVLFPQNVSPGCPEYSICKTELVLFP PNVSPGCPEYSICKTELVLFPQNVSPGCPEYAICKTELVLFLPNVSPGCPEYAICKTELV LFPPNVSPGCPEYAICKTELVLFPPNVSPGCPEYAIGKTELVLFPQNVSPGCPEYAICKT ELVLFPPNVSPGCPEYAIGKTELVLFPPNVSPGCPEYAICKTELVLFPPNLAKFLADITL IMNFVASREEQSQKESKATHAERLPVSIQTPTSQTGSQAPEPGYCSPKWAELIQREVEMV LAVVIFPPLEGLKLGCPTAPGSWLALVLKAEESEINVLAAVGVLFLVRGQPSSHCVLTWQ KRQEREDVGLVLSKLMQTYLTDNILVLFGKDGIAQALKMVCAWVSECVRMAKVELGGWKV CSECASLEISEISKWKCQQAIESASQKEAQRSPGWGYRFRMIGIRIVFKSMEFHELTCEV KKEDPRGPKGDQGGVRKRRRTCKGGPASEERGEPGEQNVVETERQEFAEKGVNIVTGLR >gi568815596f:111023746_111264228|GENSCAN_predicted_CDS_6|3240_bp atggtgatggtggtggtggtgatggtgatggtggaggtggtgatggtggcagtggtggtg gtgatggtggtggtggcggtgatggcaggccagaaagcagacccagggaggatagcacat gtcttcggctacactggtgaaggcatcgtgcctgctccacagcacatttttattcaagtc ttggacgagcctgaggatgtctacaagtggaagaaaagaagccactggagcagaagatgt ccagaatacgccatctgcaaaactgaactcgtcctctttccccagaatgtatctcccgga tgtccagaatacgccatctgcaaaactgaactcgtcctctttccccagaatgtatctccc ggatgtccagaatacgccatctgcaaaactgaactcgtcctctttcccccgaatgtatca tctcccagatgtccagaatacgccatctgcaaaactgaactcgtcctctttcccccgaat gtatctcccagatgtccagaatacgccatctgcaaaactgaactcatcctctttccccag aatgtatctcccggatgtcgagaatacgccatctgcaaaactgaactcgtcctctttccc ccgaatgtatctcccagatgtccagaatacgccatctgcaaaactgaactcgtcctcttt cccccgaatgtatcatctcccagatgtccagaatacgccatctgcaaaactgaactcatc ctctttccccagaatgtatctcgcggatgtccagaatacgccatctgcaaaactgaactc gtcctctttcccccgaatgtatctcccggatgtccagaatacgccatctgcaaaactgaa ctcgtcctctttccccagaatgtatctcccggatgtccagaatacgccatctgcaaaact gaactcgtcctctttcccccgaatgtatctcccggatgtccagaatacaccatctgcaaa actgaactcgtcctctttcccccgaatgtatctcccggatgtccagaatacgccatctgc aaaactgaactcgtcctctttcccccgaatgtatctcccggatgtccagaatacgccatc tgcaaaactgaactcgtcctctttcccccgaatgtatctcccggatgtccagaatacgcc atctgcaaaactgaactcgtcctgtttcccccgaatgtatctcccggatgtccagaatac gccatctgcaaaactgaactcgtcctctttccccagaatgtatctcccggatgtccagaa tactccatctgcaaaactgaactcatcctctttcccccgaatgtatctcccggatgtcca gaatacgccatctgcaaaactgaactcgtcctctttcccccgaatgtatctcccggatgt acagaatacgccatctgcaaaactgaactcgtcctctttcccccgaatgtatctcccgga tgtccagaatacgccatctgcaaaactgaactcgtcctctttccccagaatgtatctccc ggatgtccagaatacgccatctgcaaaactgaactcgtcctctttcccccgaatgtatct cccggatgtccagaatacgccatctgcaaaactgaactcgtcctctttcccccgaatgta tctcccggatgtccagaatacgccatctgcaaaactgaactcgtcctgtttcccccgaat gtatctcccggatgtccagaatacgccatctgcaaaactgaactcgtcctctttccccag aatgtatctcccggatgtccagaatactccatctgcaaaactgaactcgtcctctttccc ccgaatgtatctcccggatgtccagaatactccatctgcaaaactgaactcgtcctcttt ccccagaatgtatctcccggatgtccagaatacgccatctgcaaaactgaactcgtcctc tttctcccgaatgtatctcccggatgtccagaatacgccatctgcaaaactgaactcgtc ctctttcccccgaatgtatctcccggatgtccagaatacgccatctgcaaaactgaactc gtcctctttcccccgaatgtatctcccggatgtccagaatacgccatcggcaaaactgaa ctcgtcctctttccccagaatgtatctcccggatgtccagaatatgccatctgcaaaact gaactcgtcctctttcccccgaatgtatctcccggatgtccagaatacgccatcggcaaa actgaactcgtcctgtttcccccgaatgtatctcccggatgtccagaatacgccatctgc aaaacagaacttgtcctctttcccccaaatctagcaaaattcttagctgacattactttg atcatgaactttgttgcctccagggaagagcagagtcaaaaagaaagcaaagccacgcat gcagagaggttaccagtgtccatccagacgcccacttcacagacaggtagccaagcgcca gagcccggctattgtagccctaagtgggctgaactcatccagagggaagtggagatggtg ctggctgtggttatctttcctccactcgagggattgaagctaggctgtcccacggcaccg ggctcgtggctggccctggttctgaaagctgaggagtctgagatcaatgtgctagcagcc gtgggggtcctcttcctggttcgtggtcagccgtcttctcactgtgtcctcacatggcag aagaggcaagagagagaggatgtaggtttggtcttgtctaagctcatgcagacatacctc acagataacatcttggttctttttggcaaagatggaatagcccaggcgttgaagatggta tgtgcatgggtgagtgagtgtgtgagaatggcaaaggtggagttgggaggatggaaagtc tgctctgaatgtgctagtttggagatatctgagatctccaagtggaaatgtcagcaggca atagaatctgcaagtcagaaggaagctcaaagaagtccaggttggggatatagatttaga atgatcggcataagaatagtatttaaatccatggaatttcacgaactcacctgtgaagtg aagaaagaggacccaagagggcctaagggagaccaaggaggagtcagaaaaaggaggagg acctgcaaaggaggaccagccagtgaagaacgaggagagcctggagagcagaatgtggta gaaacagagaggcaagagtttgcagaaaaaggggtcaatattgtcacaggactaagatga >gi568815596f:111023746_111264228|GENSCAN_predicted_peptide_7|68_aa XVGFHVLNSIIQRSSAQSRLRQSFDDPPVSHTKGPAMNKASVTEDPVPLCHKLSDLTASH QAEPTRLL >gi568815596f:111023746_111264228|GENSCAN_predicted_CDS_7|207_bp nctgtaggcttccatgttctcaactctattattcagaggagttcagctcagtcacgtctg cgccagagctttgatgacccacctgtcagccacactaagggccctgccatgaataaggcc tccgtcactgaggatcctgtacccctctgccataaactcagtgacctgactgccagccac caggcagaacctactagacttctctaa